Gene Pnap_3689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3689 
Symbol 
ID4686207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3927720 
End bp3928970 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content62% 
IMG OID639836707 
Productextracellular solute-binding protein 
Protein accessionYP_983906 
Protein GI121606577 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAC TTTCCAAGCT GGCTGCGGCC CTTGCCTTGG TGGCCGCTGG CACGGCGGCC 
ATGGCCGGTG AAGTCGAAGT CCTGCATTAC TGGACTTCCG GTGGCGAGGC CAAGTCGGTC
GCCGAACTGA AAAAAATCAT GGAAGCCAAA GGCCATGTCT GGAAGGATTT CGCCGTAGCG
GGCGGCGGCG GCGACAACGC GGCCACCGTG CTCAAAAGCC GCGTGGTGTC GGGCAACCCG
CCCGCAGCCG CCCAGATCAA AGGCCCGGCG ATCCAGGAAT GGGCGGCTGA AGGCGTTCTG
GCCAACATGG ACCCTGTCGC CCAGGCCGAA AAGTGGGACA GCCTGCTGCC CAAGGTGGTG
GCCGACGTCA TGAAGTACAA AGGCAATTTC GTGGCGGTTC CCGTCAACGT GCACCGCGTG
AACTGGATGT GGGCCAACGC CGCCGTGCTG AAAAAAGCCG GGGTCGCGGG CATGCCCAAG
AATTGGGACG AGTTCTTTGT CGCGGCCGAC AAGATCAAGA AAGCAGGGCT GATTCCGGTC
GCCCACGGTG GCCAGAACTG GCAGGACTTC ACCACCTTCG AGTCCGTGGT GCTCGGCGTG
GGCGGCCCGA AGTTCTACAG CGACGCGCTC GTCAAGCTTG ACCAAAAGGC GCTGACCGGC
GAGACCATGA AGAAGTCGCT GGAAACCTTC CGCAAGATCA AGGGCTACAC CGACGCTGCC
GCGCCCGGGC GCGACTGGAA CCTGGCCACC GCCATGGTGA TGCAGGAAAA AGCGGCCTTC
CAGTTCATGG GCGACTGGGC CAAGGGCGAG TTCATTGCGG CCGGCAAGGT GCCCGGCAAG
GACTTCCTGT GCGCCGCCGC CCCCGGCACC GCCAATGCGT TCACCTTCAA TGTGGACTCG
TTTGCCATGT TCAAGCTCAA AGGCGCCGCA GCGCAAAAGG CGCAGGCGGA TTTGTCGGCG
GCCATCATGG GCACGGAATT CCAGGAGATT TTCAACCTGA ACAAGGGCTC GATTCCGGTG
CGCCTGAACA TGAACATGGC CAAGTTCGAC GACTGCGCCA AGCTGTCCGG CAAGGACTTT
GTCGAGACCG CCAAAACCGG CGGGCTGGTG CCCTCGGCGG CCCACGGCAT GGCCATCAGC
CCTGCGGCTG AAGGCGCCAT CAAGGACGCG GTCAGCCAGT TCTGGAACGA CGACAAGATC
TCGGTGGACG AGGCGCAAAA GCGCATCGCT GCCGCAGCAA AAACCAAATA A
 
Protein sequence
MLKLSKLAAA LALVAAGTAA MAGEVEVLHY WTSGGEAKSV AELKKIMEAK GHVWKDFAVA 
GGGGDNAATV LKSRVVSGNP PAAAQIKGPA IQEWAAEGVL ANMDPVAQAE KWDSLLPKVV
ADVMKYKGNF VAVPVNVHRV NWMWANAAVL KKAGVAGMPK NWDEFFVAAD KIKKAGLIPV
AHGGQNWQDF TTFESVVLGV GGPKFYSDAL VKLDQKALTG ETMKKSLETF RKIKGYTDAA
APGRDWNLAT AMVMQEKAAF QFMGDWAKGE FIAAGKVPGK DFLCAAAPGT ANAFTFNVDS
FAMFKLKGAA AQKAQADLSA AIMGTEFQEI FNLNKGSIPV RLNMNMAKFD DCAKLSGKDF
VETAKTGGLV PSAAHGMAIS PAAEGAIKDA VSQFWNDDKI SVDEAQKRIA AAAKTK