Gene Pnap_1162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_1162 
Symbol 
ID4689168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp1234254 
End bp1236530 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content63% 
IMG OID639834166 
Producthydantoinase B/oxoprolinase 
Protein accessionYP_981399 
Protein GI121604070 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACA AATCAATAGC ACAAGGCACG GCCTTGCCCA GCGCGGACGA GATTCAATTG 
ATCCGCAAAT TCCTCAACGA CACCACGCTG TTTCTGGGGC CGGACCCGGA GATCATGCAA
AACCACGACA TCATGCCGCG CACCGATGTC GAGGACGCCT GCATCCACAA AGTCAGCGAT
GCGCACACGG TGGCCAAGAT CCGGGACCGC ATCCAGGCCG GTTGCGATGA AGGCTACGAG
ATGGTGGAGC AGATGGGCGC GGCCCCCGGC GCCAAGTGGG GCGATGTGAT CACCGGCGTG
TACTCGGCCT CGGGCGACCT GGCCATTGCC AGTGCCGGTG GCGTCCTGAT TTTCTCGGCC
CTGGTGCATC ACCCCATCAA GTTCATCATC AAGAACTGGA TCAACGACCC CACGGTAGGT
GTGCGGGATG GGGACGGCTT CATCCACAAC GACTCCCGCT ACGGCAACGT CCACAACACC
GACCAGAGCA TGATCATCCC CGTCTTTCAC GACGGCAAGC TGGTCTGCTG GGTGGCCTCC
ACCGTGCACG AGGGCGAGAA CGGCGCCATT GAGCCCGGAG GCATGCCCTC GATGGCCGAG
AGCCCGAGCG ATGAAGGCTT GAAGATGTCG CCGTTCAAGG TGGTGGAGAA CTACCAGATC
AAGCGCGACA TCCTGACCTT CCTGCAAAAC TCGGTGCGCG AACCCAAGCT GCAGTACGAG
GACATGAAAG TCAAGCTGTT CGCCTGCCTG CGCATCAAGC AGCGCATCGA GGAGACGCTC
AACACCGACG GCCCCGAGTC GTTGATCTCC ACCTTGCGCC TGACGATGGA AAACGTGCGG
GCCGAAGTCA AGCGCCGCGT CAGCGCCTGG CCCGACATGA CGGTGCGCAC CTACATCATC
CAGGACTCCA CCCTGCGCGA GAACTGCGTG GTGAAGATCA ACTGCAAGCT CACCAAGACC
GGCGACCGGC TGATCTTTGA CTTCCGGGGC TCCAGCCCGG AATTCACCAA CCGCGCCACC
AACACCATCG TGGCCGGTCT CAAGGGCATG CTGGCGCAGG TGTTCCTGTG CTACGTCTGG
CCGGACCTGC CGCGCGGCCA GGCGGCGTTT GCACCCATCG AGGTCATCAC CGATCCGCAC
TCGATTGTCA ACTGCTCCTA CGATGCGCCG AACTCGCAGA GCCTGATGTC CATCTTCACC
GGCTTCACGG CCGGCCAGCA CGCGGTGGCG AAGTTTCTCT ACAGCTGCCC CGAAAAGTAC
ACCAAGGTTC ACGCGCCGAC CTTCAACATG ATCAACACCT TCGTCTGGGG CGGTGTGAGC
CAGCACGGCG AAACCCTTGG CAACCTGTGC GCCGACCTGA ACGGCATGGG TGCCGGCGCG
GTCTCGGACC GCGATGGCGA GCATGCCCTG GCGCCGATCT TTGCCACCAT GGCCGACATT
GGCGAGCAAG AACTCAATGA GGAGGAAGTG CCTTTCCTGC AGCTCGTCTC CAAGAAGATG
ACCCGCGACG CCATCGCCCC CGGCAAGTAC CGGGGCGGAC AGGGCTACAC CATGATGGTG
GCGACCAAGG ACAGCGACCA GTGGGGCTTC ATGACCGTCT GCCAGGGCGC CAAGATTCCG
CCCCTGCAAG GCCTGTTCGG CGGCTATGCC TGCGGCACCT ACCCGCTGTG CAAGGTCAAG
GATGTGGATG TCTATGACGT GCTGCTCAAC AAGCCGCACG AATTCAGGCA CTCCATCGAG
GAAATCATGA ACGAGCGCCC GTTTGACGAG GCCAGCTACA CCACCCACCA CATGGGCATG
GGCTTTGAAA TCTCCAAGCG CGGCGAGCTG TTCATGATCT CCCAGGGGGC CGGCGCCGGG
TATGGCGACC TGCTCGAACG CGACCCGGAA GCCGTGGTCA AGGACATCGA GGAAGGACTG
ATGTCTCCCG GCGTCGCCAC GCGCCTGTAC AAGGTGAAGT TCGACCCGGC CACGCTGGCG
ATTGACCACG CCGCCACGGC CGCACTGCGT GAGCAGGAGC GCCAGGCCCG CCGGGCGCGG
GGCGTGCCGT ATGCCGAGTT TGTCAAGACC TGGAATCAGC CCCGGCCGCC AGCCCATCTG
CAGTACTTCG GCTGCTGGGG TGACGATGTG GGCACGCTGT ACAGCGGGCA TCCCGAGCTG
ACCCATCCGG CGGACCAGCC GCAGCCCAAC TACATGACCA ACCCCAAGGA TGTGCGCATC
GCCGAGCTGG AGAGTCGCCT GGCCGCCCTG GGCGGACTCA AGGAAGAAAA GCAATGA
 
Protein sequence
MSNKSIAQGT ALPSADEIQL IRKFLNDTTL FLGPDPEIMQ NHDIMPRTDV EDACIHKVSD 
AHTVAKIRDR IQAGCDEGYE MVEQMGAAPG AKWGDVITGV YSASGDLAIA SAGGVLIFSA
LVHHPIKFII KNWINDPTVG VRDGDGFIHN DSRYGNVHNT DQSMIIPVFH DGKLVCWVAS
TVHEGENGAI EPGGMPSMAE SPSDEGLKMS PFKVVENYQI KRDILTFLQN SVREPKLQYE
DMKVKLFACL RIKQRIEETL NTDGPESLIS TLRLTMENVR AEVKRRVSAW PDMTVRTYII
QDSTLRENCV VKINCKLTKT GDRLIFDFRG SSPEFTNRAT NTIVAGLKGM LAQVFLCYVW
PDLPRGQAAF APIEVITDPH SIVNCSYDAP NSQSLMSIFT GFTAGQHAVA KFLYSCPEKY
TKVHAPTFNM INTFVWGGVS QHGETLGNLC ADLNGMGAGA VSDRDGEHAL APIFATMADI
GEQELNEEEV PFLQLVSKKM TRDAIAPGKY RGGQGYTMMV ATKDSDQWGF MTVCQGAKIP
PLQGLFGGYA CGTYPLCKVK DVDVYDVLLN KPHEFRHSIE EIMNERPFDE ASYTTHHMGM
GFEISKRGEL FMISQGAGAG YGDLLERDPE AVVKDIEEGL MSPGVATRLY KVKFDPATLA
IDHAATAALR EQERQARRAR GVPYAEFVKT WNQPRPPAHL QYFGCWGDDV GTLYSGHPEL
THPADQPQPN YMTNPKDVRI AELESRLAAL GGLKEEKQ