Gene Pnap_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_0043 
Symbol 
ID4689983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp43586 
End bp44605 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content61% 
IMG OID639833037 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_980290 
Protein GI121602961 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.482624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.263408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTC TTCGTAGAAC CGTGATTGCC GCCGCCGCTG CCGTGGCCTT GAGCGCCTCG 
TTTTCCGCGC TGGCGCAGGA CATCAAGCCG CGCCTGATCC GCTTCGGCTA CGGCCTGAAC
GAGCAAAGCA ACCAGGGCCG CGCCGCCAAG GTATTCGCCG ACGAGGTCGC CAAGGCCTCG
GGCGGCAAGA TGAAGGTGCG CGCCATCGGC GCCGCCGCGC TCGGCCCCGA CACGCAGATG
CAGCAGGCCC TGATTGGCGG CGCGCAGGAA ATGATGGTCG GCTCGACCGC CACGCTGGTC
GGCATCACCA AGGAAATGGC GCTGTGGGAC ACGCCCTTCC TGATCAACAA CACCAAGGAA
GCCGACGCCC TGCTTGACGG CCCGATTGGC GAGAAGATCA AGGACAAGCT GCAGGACAAA
GGGCTGGTCG GCCTGGTCTA TTGGGAAAAC GGCTTTCGCA ACCTGACCAA CAGCAAGCGC
CCGGTCACCA AGGTCGAGGA TCTGGACGGC ATCAAGCTGC GCGTGATGCA GAACAACGTG
TTTTTGAGCA GCTTCAAGAC GCTGGGCGCC AACGCCATTC CGATGGCGTT TTCCGAACTC
TTCGGCGCGC TGGAAACCAA GACCGTCGAT GGCCAGGAAA ACCCGTACAA CACGATTTTG
TCGAGCAAGT TCTACGAGGT GCAGAAGTAC CTGACGGTGA CCAACCACGT TTACAGCCCG
TGGATCGTGC TGGTCAGCAA GAAGTGGTGG GACCAGCTGT CCAAGGCCGA GCAGAAGGTG
CTGATGGACG CGGCCAAGAC CAGCCGCGAC TACGAGCGCA AGGACACGCG CGAGGAAGCC
TCCAAAGCCA TGGCCGACCT CAAGGCCAAG GGCATGCTGG TCAATGAGCT GGCGCCCGCA
GAAGCCGACC GCATGCGCAA CAAGCTGACC CGCGTGTATG CCGAAATCGG CACCGAAGTC
GGCATGGACC TGTGGATTGC CACGCAGAAC GAACTGCTGA AGATTCGCGG CAAGAAATGA
 
Protein sequence
MTFLRRTVIA AAAAVALSAS FSALAQDIKP RLIRFGYGLN EQSNQGRAAK VFADEVAKAS 
GGKMKVRAIG AAALGPDTQM QQALIGGAQE MMVGSTATLV GITKEMALWD TPFLINNTKE
ADALLDGPIG EKIKDKLQDK GLVGLVYWEN GFRNLTNSKR PVTKVEDLDG IKLRVMQNNV
FLSSFKTLGA NAIPMAFSEL FGALETKTVD GQENPYNTIL SSKFYEVQKY LTVTNHVYSP
WIVLVSKKWW DQLSKAEQKV LMDAAKTSRD YERKDTREEA SKAMADLKAK GMLVNELAPA
EADRMRNKLT RVYAEIGTEV GMDLWIATQN ELLKIRGKK