Gene EcSMS35_4674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4674 
Symbol 
ID6144848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4771451 
End bp4772422 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content45% 
IMG OID641619490 
ProductTRAP transporter solute receptor DctP family protein 
Protein accessionYP_001746598 
Protein GI170681376 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.841471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAA AAGTATCCGC TGGCATTATC GGTGCTGTTC TTATGTTATC CGCAAGCCAG 
TCCTGGGCAG TGACATTAAA ACTGAGTCAT AATCAGGATA AGTCTCATCC TGTTCATAAA
GCGATGGAGT TCTTTGCGAA AAAGAGCAAA GAGTACTCTA ACGGTGATAT TACTATTCGT
ATTTATCCAA ATGGAACATT GGGTACTCAA CGAGAAACAA TGGAGCTGAT TCGTTCTGGC
GCTATTCCAC TGGTAAAAAC CAACGCGGCA GAAATGGAAG CATTTGAAAA TTCCTATAAA
TTATTTAGCC TGCCTTATTT GTTCCGCGAT CGTGATCATT ATTATCAGGT CATGCAGGGC
GATATCGGGA GAAAAATCCT CGACTCAACG AAAAGCAAAG GTTATTTCGG GCTGACTTTT
TATGATGGAG GCGCCCGCAG TTTCTACGGC AATAAACCAG TACTGAAACC AGACGATCTC
AAAGGCATGA AAGTCCGTGT CCAGCCAAGT CCTGGCGCAG TTGAAATGAT CAAAGTCATG
GGCGGTAACC CGACGCCACT GGATTACGGC GAGTTGTATA CAGCCTTACA GCAGGGTGTG
GTCGATATGG CAGAAAACAG CGTGATGGCG CTGACCACCA TGCGTCACGG TGAAGTGGCA
AAATCCTTCA GCCTTGACGA ACACACTATG GTTCCCGATG TGGTTCTGAT GAGCAATGCT
GCGTTTGATA AACTTAGCCC GGAAAATCAG GCAGTTATAT TAAAAGCAGC TAAAGAATCA
ATGAGCTACA TGAAAGACTT GTGGAGCGAG GAAGAGAAAC AAGAATTTGC AAAACTGGAT
AAAATGGGCG TGAAAGTCTA CCAGGTAGAT AAAGCTCCGT TTATCGAGAA AGTACAGCCA
ATGTACGCAA ACTTCGCTAA GGACAACCCA GCCCTTGCCC CAATGCTGGC TGATATTCAG
GCAGCTAAGT AA
 
Protein sequence
MKIKVSAGII GAVLMLSASQ SWAVTLKLSH NQDKSHPVHK AMEFFAKKSK EYSNGDITIR 
IYPNGTLGTQ RETMELIRSG AIPLVKTNAA EMEAFENSYK LFSLPYLFRD RDHYYQVMQG
DIGRKILDST KSKGYFGLTF YDGGARSFYG NKPVLKPDDL KGMKVRVQPS PGAVEMIKVM
GGNPTPLDYG ELYTALQQGV VDMAENSVMA LTTMRHGEVA KSFSLDEHTM VPDVVLMSNA
AFDKLSPENQ AVILKAAKES MSYMKDLWSE EEKQEFAKLD KMGVKVYQVD KAPFIEKVQP
MYANFAKDNP ALAPMLADIQ AAK