Gene EcSMS35_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0972 
SymbolgatC 
ID6144264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp981806 
End bp983161 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content52% 
IMG OID641615859 
ProductPTS system, galactitol-specific IIC component 
Protein accessionYP_001743051 
Protein GI170683704 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID[TIGR00827] PTS system, galactitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.487394 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCAG AAGTCATGCG TTATATTCTC GACCTCGGCC CTACGGTGAT GCTGCCGATT 
GTCATCATTA TTTTTTCTAA AATATTAGGC ATGAAGGCAG GCGATTGCTT TAAAGCGGGT
CTGCATATCG GGATTGGCTT TGTTGGCATT GGCCTTGTGA TTGGCTTAAT GCTGGATTCC
ATTGGTCCGG CGGCGAAAGC GATGGCGGAA AATTTCGACC TGAATCTGCA TGTGGTCGAT
GTCGGCTGGC CGGGCTCTTC ACCCATGACC TGGGCGTCGC AAATTGCGCT GGTGGCGATT
CCGATTGCGA TTCTGGTTAA CGTGGCGATG TTACTGACCC GTATGACGCG GGTGGTAAAT
GTTGATATCT GGAATATCTG GCATATGACC TTCACCGGCG CGTTGCTGCA TCTGGCAACC
GGTTCATGGA TGATAGGGAT GGCGGGTGTG GTAATTCACG CGGCGTTTGT TTATAAGCTC
GGCGACTGGT TTGCCCGCGA TACCCGAAAT TTCTTTGAGC TGGAAGGCAT TGCCATCCCG
CACGGTACGT CGGCGTATAT GGGGCCAATT GCGGTGCTGG TGGATGCTAT CATCGAGAAA
ATCCCAGGCG TTAACCGAAT TAAATTTAGT GCCGACGATA TTCAGCGCAA ATTTGGTCCG
TTTGGCGAGC CTGTCACCGT GGGTTTTGTG ATGGGGCTGA TTATCGGCAT CCTCGCGGGT
TACGATGTCA AAGGTGTATT GCAGCTGGCG GTAAAAACGG CGGCGGTTAT GCTGCTGATG
CCACGGGTGA TTAAACCCAT CATGGATGGT TTAACGCCCA TCGCTAAGCA GGCCCGTAGT
CGTTTACAGG CGAAGTTCGG CGGTCAGGAG TTCCTGATTG GCCTGGATCC CGCATTACTG
CTGGGGCATA CGGCGGTGGT ATCGGCAAGC CTGATTTTTA TCCCGCTCAC CATTTTAATT
GCTGTTTGTG TGCCGGGTAA TCAGGTGCTG CCGTTTGGCG ACCTTGCCAC CATCGGCTTC
TTTGTGGCGA TGGCGGTCGC GGTGCATCGT GGAAATCTGT TCCGCACCTT AATCTCGGGT
GTCATCATTA TGAGCATCAC CCTGTGGATC GCGACGCAAA CTATTGGTTT GCACACCCAA
CTGGCGGCTA ATGCGGGCGC GTTAAAAGCC GGGGGTATGG TGGCTTCAAT GGATCAGGGC
GGTTCTCCCA TTACCTGGTT ACTGATTCAG GTTTTCTCCC CGCAAAATAT TCCCGGTTTC
ATTATTATCG GCGCAATTTA TCTGACCGGT ATTTTCATGA CCTGGCGTAG AGCGCGTGGT
TTTATTAAAC AAGAGAAAGC CGTTCTCGCA GAATAA
 
Protein sequence
MFSEVMRYIL DLGPTVMLPI VIIIFSKILG MKAGDCFKAG LHIGIGFVGI GLVIGLMLDS 
IGPAAKAMAE NFDLNLHVVD VGWPGSSPMT WASQIALVAI PIAILVNVAM LLTRMTRVVN
VDIWNIWHMT FTGALLHLAT GSWMIGMAGV VIHAAFVYKL GDWFARDTRN FFELEGIAIP
HGTSAYMGPI AVLVDAIIEK IPGVNRIKFS ADDIQRKFGP FGEPVTVGFV MGLIIGILAG
YDVKGVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKQARS RLQAKFGGQE FLIGLDPALL
LGHTAVVSAS LIFIPLTILI AVCVPGNQVL PFGDLATIGF FVAMAVAVHR GNLFRTLISG
VIIMSITLWI ATQTIGLHTQ LAANAGALKA GGMVASMDQG GSPITWLLIQ VFSPQNIPGF
IIIGAIYLTG IFMTWRRARG FIKQEKAVLA E