Gene ECH74115_5637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5637 
SymboldcuB 
ID6971984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5277054 
End bp5278394 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content52% 
IMG OID643389271 
Productanaerobic C4-dicarboxylate transporter 
Protein accessionYP_002273668 
Protein GI209398687 
COG category[R] General function prediction only 
COG ID[COG2704] Anaerobic C4-dicarboxylate transporter 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases
[TIGR00770] anaerobic c4-dicarboxylate membrane transporter family protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.363668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTTA CTATCCAACT TATCATAATA CTGATATGTC TGTTTTATGG TGCCAGAAAG 
GGTGGTATCG CGCTGGGTTT ATTAGGCGGT ATCGGTCTGG TCATTCTGGT CTTCGTCTTC
CACCTTCAGC CAGGTAAACC ACCAGTTGAT GTCATGCTGG TTATCATTGC GGTGGTGGCG
GCATCGGCGA CCTTGCAAGC TTCGGGCGGT CTTGATGTCA TGCTGCAAAT TGCCGAGAAG
CTGCTGCGCC GCAACCCGAA ATATGTCTCA ATTGTCGCGC CGTTTGTGAC CTGTACGCTG
ACCATTCTTT GCGGTACGGG TCATGTGGTT TACACCATTC TGCCGATCAT CTACGACGTC
GCCATTAAGA ACAACATCCG TCCGGAACGT CCGATGGCGG CAAGTTCTAT CGGTGCACAG
ATGGGGATTA TCGCCAGTCC GGTGTCGGTT GCGGTCGTGT CTCTGGTTGC AATGCTGGGT
AATGTCACCT TTGATGGTCG CCATCTTGAG TTCCTCGACC TGCTGGCAAT CACCATTCCA
TCGACGTTAA TCGGTATCCT GGCGATCGGT ATCTTCAGCT GGTTCCGCGG TAAAGATCTG
GATAAAGACG AAGAGTTCCA GAAATTCATC TCCGTACCGG AAAACCGTGA GTATGTTTAC
GGTGATACCG CGACGCTGCT GGATAAAAAA CTGCCGAAAA GCAACTGGCT GGCAATGTGG
ATTTTCCTCG GGGCAATCGC TGTAGTCGCA CTTCTTGGTG CTGATTCGGA CCTGCGTCCA
TCCTTCGGCG GCAAACCGCT GTCGATGGTA CTGGTTATTC AGATGTTTAT GCTGCTGACC
GGGGCGCTGA TTATTATCCT GACCAAAACC AATCCCGCGT CTATCTCAAA AAACGAAGTC
TTCCGTTCCG GTATGATCGC CATCGTGGCG GTGTACGGTA TCGCATGGAT GGCAGAAACC
ATGTTCGGTG CGCATATGTC TGAAATTCAG GGCGTACTGG GTGAAATGGT GAAAGAGTAT
CCGTGGGCCT ATGCCATTGT TCTGCTGCTG GTTTCCAAGT TTGTAAACTC TCAGGCTGCG
GCGCTGGCGG CGATTGTTCC GGTCGCGCTA GCGATCGGCG TTGATCCGGC ATACATCGTG
GCTTCAGCAC CGGCTTGCTA CGGTTATTAC ATCCTGCCGA CTTATCCGAG CGATCTGGCA
GCGATTCAGT TTGACCGTTC CGGCACCACC CACATCGGTC GCTTCGTCAT CAACCACAGC
TTTATTCTGC CGGGGTTGAT TGGTGTGAGC GTATCGTGCG TCTTCGGCTG GATCTTCGCC
GCGATGTACG GGTTCTTATA A
 
Protein sequence
MLFTIQLIII LICLFYGARK GGIALGLLGG IGLVILVFVF HLQPGKPPVD VMLVIIAVVA 
ASATLQASGG LDVMLQIAEK LLRRNPKYVS IVAPFVTCTL TILCGTGHVV YTILPIIYDV
AIKNNIRPER PMAASSIGAQ MGIIASPVSV AVVSLVAMLG NVTFDGRHLE FLDLLAITIP
STLIGILAIG IFSWFRGKDL DKDEEFQKFI SVPENREYVY GDTATLLDKK LPKSNWLAMW
IFLGAIAVVA LLGADSDLRP SFGGKPLSMV LVIQMFMLLT GALIIILTKT NPASISKNEV
FRSGMIAIVA VYGIAWMAET MFGAHMSEIQ GVLGEMVKEY PWAYAIVLLL VSKFVNSQAA
ALAAIVPVAL AIGVDPAYIV ASAPACYGYY ILPTYPSDLA AIQFDRSGTT HIGRFVINHS
FILPGLIGVS VSCVFGWIFA AMYGFL