Gene EcE24377A_4765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4765 
Symbol 
ID5586156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4758793 
End bp4759764 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content45% 
IMG OID640928376 
ProductTRAP transporter solute receptor DctP family protein 
Protein accessionYP_001465704 
Protein GI157158149 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.844835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA AAGTATCCGC TGGCATTATC GGTGCTGTTC TTATGTTATC CGCAAGCCAG 
TCCTGGGCAG TGACATTAAA ACTGAGTCAT AATCAGGATA AGTCTCATCC TGTTCATAAA
GCGATGGAGT TCTTTGCGAA AAAGAGCAAA GAGTACTCTA ACGGTGATAT TACTATTCGT
ATTTATCCAA ATGGAACATT GGGTACTCAA CGAGAAACAA TGGAGCTGAT TCGTTCTGGC
GCTATTCCCC TGGTAAAAAC CAATGCGGCA GAAATGGAAG CATTTGAAAA TTCCTATAAA
TTATTTAGCC TGCCTTATTT GTTCCGCGAT CGTGATCATT ATTATCAGGT CATGCAGGGC
GATATCGGGA GAAAAATCCT CGACTCAACG AAAAGCAAAG GTTATTTCGG GCTGACTTTT
TATGATGGAG GCGCCCGCAG TTTCTATGGC AATAAACCAG TACTGAAACC AGACGATCTG
AAAGGCATGA AAGTCCGTGT CCAGCCAAGC CCTGGCGCAG TTGAAATGAT CAAAGTCATG
GGCGGTAACC CGACGCCACT GGATTACGGC GAGTTGTATA CAGCCTTACA GCAGGGTGTG
GTCGATATGG CAGAAAACAG CGTGATGGCG CTGACCACCA TGCGTCACGG TGAAGTGGCA
AAATCCTTCA GCCTTGACGA ACACACTATG GTTCCCGATG TGGTTCTGAT GAGCAATGCT
GCGTTTGATA AACTTAGCCC GGAAAATCAG GCAGTTATAT TAAAAGCAGC TAAAGAATCA
ATGAGCTATA TGAAAGACTT GTGGAGCGAG GAAGAGAAAC AAGAATTTGC GAAACTGGAT
AAAATGGGCG TGAAAGTCTA CCAGGTAGAT AAAGCTCCGT TTATCGAGAA AGTACAGCCA
ATGTACGCAA ACTTCGCTAA GGACAACCCA GCCCTTGCCC CAATGCTGGC TGATATTCAG
GCAGCTAAGT AA
 
Protein sequence
MKIKVSAGII GAVLMLSASQ SWAVTLKLSH NQDKSHPVHK AMEFFAKKSK EYSNGDITIR 
IYPNGTLGTQ RETMELIRSG AIPLVKTNAA EMEAFENSYK LFSLPYLFRD RDHYYQVMQG
DIGRKILDST KSKGYFGLTF YDGGARSFYG NKPVLKPDDL KGMKVRVQPS PGAVEMIKVM
GGNPTPLDYG ELYTALQQGV VDMAENSVMA LTTMRHGEVA KSFSLDEHTM VPDVVLMSNA
AFDKLSPENQ AVILKAAKES MSYMKDLWSE EEKQEFAKLD KMGVKVYQVD KAPFIEKVQP
MYANFAKDNP ALAPMLADIQ AAK