Gene ECH74115_4234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4234 
Symbol 
ID6968995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3922479 
End bp3923867 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content50% 
IMG OID643387972 
ProductPTS system, mannitol-specific cryptic EIICB component 
Protein accessionYP_002272411 
Protein GI209398789 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2213] Phosphotransferase system, mannitol-specific IIBC component 
TIGRFAM ID[TIGR00851] PTS system, mannitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACA AGTCTGCTCG TGCAAAGGTC CAGGCTTTTG GGGGCTTTTT GACTGCAATG 
GTCATCCCCA ATATTGGTGC TTTTATTGCC TGGGGTTTTA TTACTGCGTT ATTTATTCCC
ACCGGTTGGC TGCCTAACGA ACATTTCGCC AAAATTGTCG GCCCGATGAT TACCTATTTA
TTGCCCGTGA TGATTGGTTC TACCGGTGGT CATCTGGTCG GCGGTAAACG CGGCGCGGTC
ATGGGCGGAA TAGGTACTAT TGGTGTGATC GTCGGCGCAG AGATCCCGAT GTTCCTTGGC
TCAATGATTA TGGGGCCGCT CGGTGGTCTG GTCATAAAAT ATGTTGATAA GGCACTGGAA
AAACGCATAC CTGCCGGTTT TGAGATGGTT ATCAATAACT TCTCATTAGG TATCGCGGGG
ATGCTCCTTT GCCTGCTGGG TTTTGAAGTG ATCGGCCCGG CGGTGTTAAT TGCCAATACT
TTCGTCAAAG AGTGTATTGA GGCGCTGGTA CATGCGGGTT ATCTGCCTCT GCTGTCAGTC
ATCAATGAAC CGGCGAAAGT GCTTTTCCTT AATAATGCGA TCGATCAGGG CGTCTATTAT
CCGCTGGGAA TGCAACAGGC TTCGGTTAAC GGTAAATCCA TCTTCTTTAT GGTGGCCTCT
AACCCAGGTC CGGGCCTGGG GCTGCTGCTG GCGTTTACCT TGTTTGGTAA AGGGATGAGT
AAACGTTCTG CGCCCGGGGC GATGATTATT CACTTCCTCG GTGGGATCCA CGAACTGTAT
TTCCCATATG TGCTGATGAA GCCGCTGACT ATTATTGCCA TGATTGCGGG CGGTATGTCT
GGCACCTGGA TGTTTAACTT ACTGGACGGT GGTCTGGTGG CTGGCCCAAG TCCGGGGTCT
ATCTTTGCTT ACCTGGCGCT GACGCCGAAA GGTTCGTTCC TGGCGACAAT TGCCGGTGTT
ACGGTAGGTA CCCTGGTGTC CTTTGCTATC ACTTCGCTGA TACTGAAGAT GGAAAAAACG
GTGGAAACGG AGAGCGAAGA TGAGTTTGCT CAGTCAGCCA ATGCGGTTAA GGCGATGAAA
CAAGAGGGTG CATTCTCGTT AAGCAGGGTT AAGCGTATCG CCTTTGTTTG CGATGCGGGG
ATGGGCTCCA GTGCGATGGG CGCGACCACC TTCCGTAAAC GCCTGGAAAA AGCGGGGCTG
GCAATTGAAG TAAAACATTA CGCCATAGAA AACGTGCCTG CGGATGCGGA TATCGTCGTT
ACTCATGCCA GTCTGGAAGG GCGCGTGAAA CGTGTGACGG ATAAACCACT GATATTGATT
AATAACTATA TTGGCGATCC AAAACTCGAC ACTTTATTTA ATCAATTAAC CGCCGAACAT
AAACACTGA
 
Protein sequence
MENKSARAKV QAFGGFLTAM VIPNIGAFIA WGFITALFIP TGWLPNEHFA KIVGPMITYL 
LPVMIGSTGG HLVGGKRGAV MGGIGTIGVI VGAEIPMFLG SMIMGPLGGL VIKYVDKALE
KRIPAGFEMV INNFSLGIAG MLLCLLGFEV IGPAVLIANT FVKECIEALV HAGYLPLLSV
INEPAKVLFL NNAIDQGVYY PLGMQQASVN GKSIFFMVAS NPGPGLGLLL AFTLFGKGMS
KRSAPGAMII HFLGGIHELY FPYVLMKPLT IIAMIAGGMS GTWMFNLLDG GLVAGPSPGS
IFAYLALTPK GSFLATIAGV TVGTLVSFAI TSLILKMEKT VETESEDEFA QSANAVKAMK
QEGAFSLSRV KRIAFVCDAG MGSSAMGATT FRKRLEKAGL AIEVKHYAIE NVPADADIVV
THASLEGRVK RVTDKPLILI NNYIGDPKLD TLFNQLTAEH KH