Gene EcolC_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3809 
Symbol 
ID6067842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4169435 
End bp4170406 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content45% 
IMG OID641603221 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001726740 
Protein GI170021786 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0641152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA AAGTATCCGC TGGCATTATC GGTGCTGTTC TTATGTTATC CGCAAGCCAG 
TCCTGGGCAG TGACATTAAA ACTGAGTCAT AATCAGGATA AGTCTCATCC TGTTCATAAA
GCGATGGAGT TCTTTGCGAA AAAGAGCAAA GAGTACTCTA ACGGTGATAT TACTATTCGT
ATTTATCCAA ATGGAACATT GGGTACTCAA CGAGAAACAA TGGAGCTGAT TCGTTCTGGC
GCTATTCCAC TGGTAAAAAC CAATGCGGCA GAAATGGAAG CATTTGAAAA TTCCTATAAA
TTATTTAGCC TGCCTTATTT GTTCCGCGAT CGTGATCATT ATTATCAGGT CATGCAGGGC
GATATCGGGA GAAAAATCCT CGACTCAACG AAAAGCAAAG GTTATTTCGG GCTGACTTTT
TATGATGGAG GCGCCCGCAG TTTCTATGGC AATAAACCAG TACTGAAACC AGACGATCTG
AAAGGCATGA AAGTCCGTGT CCAGCCAAGC CCTGGCGCAG TTGAAATGAT CAAAGTCATG
GGCGGTAACC CGACGCCACT GGATTACGGC GAGTTGTATA CAGCCTTACA GCAGGGTGTG
GTCGATATGG CAGAAAACAG CGTGATGGCG CTGACCACCA TGCGTCACGG TGAAGTGGCA
AAATCCTTCA GCCTTGACGA ACACACTATG GTTCCCGATG TGGTTCTGAT GAGCAATGCT
GCGTTTGATA AACTTAGCCC GGAAAATCAG GCAGTTATAT TAAAAGCAGC TAAAGAATCA
ATGAGCTACA TGAAAGACTT GTGGAGCGAG GAAGAGAAAC AAGAATTTGC AAAACTGGAT
AAAATGGGCG TGAAAGTCTA CCAGGTAGAT AAAGCTCCGT TTATCGAGAA AGTACAGCCA
ATGTACGCAA ACTTCGCTAA GGATAACCCA GCCCTTGCCC CAATGCTGGC TGATATTCAG
GCGGCTAAGT AA
 
Protein sequence
MKIKVSAGII GAVLMLSASQ SWAVTLKLSH NQDKSHPVHK AMEFFAKKSK EYSNGDITIR 
IYPNGTLGTQ RETMELIRSG AIPLVKTNAA EMEAFENSYK LFSLPYLFRD RDHYYQVMQG
DIGRKILDST KSKGYFGLTF YDGGARSFYG NKPVLKPDDL KGMKVRVQPS PGAVEMIKVM
GGNPTPLDYG ELYTALQQGV VDMAENSVMA LTTMRHGEVA KSFSLDEHTM VPDVVLMSNA
AFDKLSPENQ AVILKAAKES MSYMKDLWSE EEKQEFAKLD KMGVKVYQVD KAPFIEKVQP
MYANFAKDNP ALAPMLADIQ AAK