Gene Pnap_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_4043 
Symbol 
ID4687714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp4309559 
End bp4310542 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content59% 
IMG OID639837056 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_984255 
Protein GI121606926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.187299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGA TTCGTCTGAT ATTCCTCGCC GGGCTGGCCT GCGCTGCGAT GGCGCACGCC 
GAGCCGATCA CCTTGACGCT TGCGCATGCC ACCATGACCA CGCACCCGGC TCACCTGGCG
GCGCTGCAGT TTGCCAGGCG GGTCGAAGAG CGCACCAACG GGCAAATCAA GACCGAGATA
TTCCCGGCGG CTCAGCTCGG CAGCGAAAAC GAAATGCTAA AAAAGGTCAA ACTCGGCGCG
ATTGACATGG ACGTGTCCAC GCCGAACTAC ATGATCAAGT ACGAAAAGGC CTTCGCGGTC
GTGGTCATGC CCTACGTATT CGACAACTAC GAGCATGCGC ACCGGGTGCT CGACGGCCCG
GCGATGGCCT GGCTCGCGCC GCTGGCCGAG AAGCAGGGCT TCGTGATCTT GTCCAACTGG
GAATGGGGCT TTCGCAACCT GACCAACAAC CAGCGCCCGA TCAACCAGCC CGGGGATGTG
CGCGGCCTGA AAATACGCGT GCCGCCCGTG GCTGAAGTCG AGACCACCAT GCAGGCGCTG
GGCGCGCAGG TCAGCAAGAT CAGCTTCAAA GACCTCTACG CAGCGTTGTC GCAAGGACGG
GTCGATGGCC AGGAAAACCC GCTCAACGTG ATTTATTACA ACAAGCTGTA CGAGGTGCAG
AAGCACCTCG CGCTGACGCG GCATGTTTAC TACAACACCG TGCACCTGAT CAGCGCCAAA
AGCTGGGCGA TGCTCACGCC GGCGCAGCAA AAAATTGTGC GCGAAGAAAG CAAGGCGGCG
GGCGACGGCA TGCGCAAAAA AATCATTGCC GAAGAGGACG AGCTGATCGC CAAAATGGCC
GCTGCCGGGG TGAAGGTCAC GCGCCCCGAC CTCAAGGCGT TTCGCGCCAC AGTGGAACCC
GTTTATCAGG AAATTGCCGC CTACACGGGC GAAGCGAATG TGCAAAGGTT TCTGAAAATG
GTCGAAGATG AGCGCAAGAA ATGA
 
Protein sequence
MKAIRLIFLA GLACAAMAHA EPITLTLAHA TMTTHPAHLA ALQFARRVEE RTNGQIKTEI 
FPAAQLGSEN EMLKKVKLGA IDMDVSTPNY MIKYEKAFAV VVMPYVFDNY EHAHRVLDGP
AMAWLAPLAE KQGFVILSNW EWGFRNLTNN QRPINQPGDV RGLKIRVPPV AEVETTMQAL
GAQVSKISFK DLYAALSQGR VDGQENPLNV IYYNKLYEVQ KHLALTRHVY YNTVHLISAK
SWAMLTPAQQ KIVREESKAA GDGMRKKIIA EEDELIAKMA AAGVKVTRPD LKAFRATVEP
VYQEIAAYTG EANVQRFLKM VEDERKK