Gene ECH74115_3071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3071 
SymbolgatC 
ID6966799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2843259 
End bp2844614 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content51% 
IMG OID643386903 
ProductPTS system, galactitol-specific IIC component 
Protein accessionYP_002271371 
Protein GI209397449 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID[TIGR00827] PTS system, galactitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0000414376 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTTCAG AAGTCATGCG TTATATTCTC GATCTCGGCC CTACGGTGAT GTTGCCGATT 
GTCATCATTA TTTTTTCTAA AATTTTAGGC ATGAAGGCAG GCGATTGCTT TAAAGCGGGT
CTGCATATCG GGATTGGCTT TGTTGGCATT GGCCTTGTGA TTGGCTTAAT GCTGGATTCC
ATTGGCCCGG CGGCGAAAGC GATGGCGGAA AATTTCGACC TGAATCTGCA TGTGGTCGAT
GTCGGCTGGC CGGGCTCTTC ACCAATGACC TGGGCGTCGC AAATTGCGCT GGTGGCGATT
CCGATTGCGA TTCTGGTTAA CGTGGCGATG TTACTGACCC GTATGACGCG GGTGGTAAAT
GTTGATATCT GGAATATCTG GCATATGACC TTCACCGGCG CGTTGCTGCA TCTGGCAACC
GGTTCATGGA TGATAGGGAT GGCAGGTGTG GTAATTCACG CGGCGTTTGT TTATAAGCTC
GGCGACTGGT TTGCCCGCGA TACCCGAAAT TTCTTTGAGC TGGAAGGCAT TGCTATTCCG
CACGGTACGT CGGCGTATAT GGGGCCGATT GCGGTGCTGG TCGATGCTAT CATCGAGAAA
ATCCCAGGCG TTAACCGTAT TAAATTTAGC GCCGACGATA TTCAGCGCAA ATTTGGCCCG
TTTGGCGAGC CTGTCACCGT GGGTTTTGTG ATGGGGCTGA TTATCGGCAT CCTCGCGGGT
TACGATGTCA AAGGTGTATT GCAGCTGGCG GTAAAAACGG CGGCGGTGAT GCTGCTAATG
CCACGGGTGA TTAAACCCAT CATGGATGGT TTAACGCCCA TCGCTAAGCA GGCCCGTAGT
CGTTTACAGG CGAAGTTCGG CGGTCAGGAG TTCCTGATTG GCCTGGATCC CGCATTACTG
CTGGGGCATA CGGCGGTGGT ATCGGCAAGC CTGATTTTTA TTCCGCTCAC CATTTTAATT
GCTGTTTGTG TTCCTGGTAA TCAGGTACTG CCGTTTGGCG ATCTTGCCAC TATCGGCTTC
TTCGTGGCGA TGGCGGTCGC GGTGCATCGT GGAAATCTGT TCCGCACCTT AATCTCGGGT
GTCATCATTA TGAGCATCAC CCTGTGGATC GCGACGCAAA CTATTGGTTT GCACACCCAA
CTGGCGGCTA ATGCTGGGGC GTTAAAAGCC GGGGGTATGG TGGCTTCAAT GGATCAGGGC
GGTTCTCCCA TTACCTGGTT ACTGATTCAG GTTTTCTCCC CGCAAAATAT TCCCGGTTTC
ATTATTATCG GCGCAATTTA TCTTACCGGT ATTTTCATGA CCTGGCGTAG AGCGCGTGGC
TTTATTAAAC AAGAGAAAGT CGTTCTCGCA GAATAA
 
Protein sequence
MFSEVMRYIL DLGPTVMLPI VIIIFSKILG MKAGDCFKAG LHIGIGFVGI GLVIGLMLDS 
IGPAAKAMAE NFDLNLHVVD VGWPGSSPMT WASQIALVAI PIAILVNVAM LLTRMTRVVN
VDIWNIWHMT FTGALLHLAT GSWMIGMAGV VIHAAFVYKL GDWFARDTRN FFELEGIAIP
HGTSAYMGPI AVLVDAIIEK IPGVNRIKFS ADDIQRKFGP FGEPVTVGFV MGLIIGILAG
YDVKGVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKQARS RLQAKFGGQE FLIGLDPALL
LGHTAVVSAS LIFIPLTILI AVCVPGNQVL PFGDLATIGF FVAMAVAVHR GNLFRTLISG
VIIMSITLWI ATQTIGLHTQ LAANAGALKA GGMVASMDQG GSPITWLLIQ VFSPQNIPGF
IIIGAIYLTG IFMTWRRARG FIKQEKVVLA E