Gene ECH74115_2546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2546 
SymbolmanX 
ID6969990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2410171 
End bp2411142 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content50% 
IMG OID643386414 
ProductPTS system, mannose-specific IIAB component 
Protein accessionYP_002270896 
Protein GI209399568 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2893] Phosphotransferase system, mannose/fructose-specific component IIA
[COG3444] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIB 
TIGRFAM ID[TIGR00824] PTS system, mannose/fructose/sorbose family, IIA component
[TIGR00854] PTS system, mannose/fructose/sorbose family, IIB component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000579402 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCATTG CTATTGTTAT AGGCACACAT GGTTGGGCTG CAGAGCAGTT GCTTAAAACG 
GCAGAAATGC TGTTAGGCGA GCAGGAAAAC GTCGGCTGGA TCGATTTCGT TCCAGGTGAA
AATGCCGAAA CGCTGATTGA AAAGTACAAC GCTCAGTTGG CAAAACTCGA CACCACTAAA
GGCGTGCTGT TTCTCGTTGA TACATGGGGA GGCAGCCCGT TCAATGCTGC CAGCCGCATT
GTCGTCGACA AAGAGCATTA TGAAGTCATT GCAGGCGTTA ACATTCCAAT GCTCGTGGAA
ACGTTAATGG CTCGTGATGA TGACCCAAGC TTTGATGAAC TGGTTGCGCT GGCAGTAGAA
ACAGGCCGTG AAGGCGTGAA AGCACTGAAA GCCAAACCGG TTGAAAAAGC CGCGCCAGCA
CCCGCTGCCG CAGCACCCAA AGCGGCTCCA ACTCCGGCAA AACCAATGGG GCCAAACGAC
TACATGGTTA TTGGCCTTGC GCGTATCGAC GACCGTCTGA TTCACGGTCA GGTCGCCACC
CGCTGGACCA AAGAAACCAA TGTCTCCCGT ATTATTGTTG TTAGTGATGA AGTGGCTGCG
GATACTGTTC GTAAGACACT GCTCACCCAG GTTGCACCTC CGGGCGTAAC AGCACACGTA
GTTGATGTTG CCAAAATGAT TCGCGTCTAC AACAACCCGA AATATGCTGG CGAACGTGTA
ATGCTGTTAT TTACCAACCC AACAGATGTA GAGCGTCTCG TTGAAGGCGG CGTGAAAATC
ACCTCTGTTA ACGTCGGTGG TATGGCATTC CGTCAGGGTA AAACCCAAGT GAATAACGCG
GTTTCGGTTG ATGAAAAAGA TATCGAGGCG TTCAAAAAAC TGAATGCACG CGGTATTGAG
CTGGAAGTCC GTAAGGTTTC CACCGATCCG AAACTGAAAA TGATGGATCT GATCAGCAAA
ATCGATAAGT AA
 
Protein sequence
MTIAIVIGTH GWAAEQLLKT AEMLLGEQEN VGWIDFVPGE NAETLIEKYN AQLAKLDTTK 
GVLFLVDTWG GSPFNAASRI VVDKEHYEVI AGVNIPMLVE TLMARDDDPS FDELVALAVE
TGREGVKALK AKPVEKAAPA PAAAAPKAAP TPAKPMGPND YMVIGLARID DRLIHGQVAT
RWTKETNVSR IIVVSDEVAA DTVRKTLLTQ VAPPGVTAHV VDVAKMIRVY NNPKYAGERV
MLLFTNPTDV ERLVEGGVKI TSVNVGGMAF RQGKTQVNNA VSVDEKDIEA FKKLNARGIE
LEVRKVSTDP KLKMMDLISK IDK