Gene EcolC_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1555 
Symbol 
ID6065268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1720719 
End bp1722074 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content51% 
IMG OID641600971 
ProductPTS system, galactitol-specific IIC subunit 
Protein accessionYP_001724541 
Protein GI170019587 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID[TIGR00827] PTS system, galactitol-specific IIC component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCAG AAGTCATGCG TTATATTCTC GACCTCGGCC CTACGGTGAT GCTGCCGATT 
GTCATCATTA TTTTTTCTAA AATATTAGGC ATGAAGGCAG GCGATTGCTT TAAAGCGGGT
CTGCATATCG GGATTGGCTT TGTTGGCATT GGCCTTGTGA TTGGCTTAAT GCTGGATTCC
ATTGGCCCGG CGGCGAAAGC GATGGCGGAA AATTTCGACC TGAATCTGCA TGTGGTCGAT
GTTAGCTGGC CGGGCTCTTC ACCAATGACC TGGGCGTCGC AAATTGCGCT GGTGGCGATT
CCGATTGCGA TTCTGGTTAA CGTGGCGATG TTACTGACCC GTATGACGCG GGTGGTAAAT
GTTGATATCT GGAATATCTG GCATATGACC TTCACCGGCG CGTTGCTGCA TCTGGCAACC
GGTTCATGGA TGATAGGGAT GGCAGGTGTG GTAATTCACG CGGCGTTTGT TTATAAGCTC
GGCGACTGGT TTGCCCGCGA TACCCGAAAT TTCTTTGAGC TGGAAGGTAT TGCTATTCCG
CACGGTACGT CGGCGTATAT GGGGCCGATT GCGGTGCTGG TCGATGCTAT CATCGAGAAA
ATCCCAGGCG TTAACCGAAT TAAATTTAGC GCCGACGATA TTCAGCGCAA ATTTGGTCCA
TTTGGCGAGC CTGTCACCGT GGGTTTTGTG ATGGGGCTGA TTATCGGCAT CCTCGCGGGT
TACGATGTCA AAGGTGTATT GCAGCTGGCG GTAAAAACGG CGGCAGTGAT GCTGCTAATG
CCACGGGTGA TTAAACCCAT CATGGATGGT TTAACGCCCA TCGCTAAGCA GGCTCGTAGT
CGTTTACAGG CGAAGTTCGG CGGTCAGGAG TTCCTGATTG GCCTTGATCC GGCGTTGCTG
CTGGGACATA CGGCGGTGGT ATCGGCAAGC CTGATTTTTA TCCCACTCAC CATTTTAATT
GCTGTTTGTG TGCCGGGTAA TCAGGTGCTG CCGTTTGGCG ATCTTGCCAC CATCGGCTTC
TTCGTGGCGA TGGCGGTCGC CGTGCATCGT GGAAATCTGT TCCGCACCTT AATCTCGGGT
GTCATCATTA TGAGCATCAC CCTGTGGATC GCGACGCAAA CTATTGGTTT GCACACCCAA
CTGGCGGCTA ATGCTGGGGC GTTAAAAGCC GGGGGTATGG TGGCTTCAAT GGATCAGGGC
GGTTCTCCCA TTACCTGGTT ACTGATTCAG GTTTTCTCCC CGCAAAATAT TCCCGGTTTC
ATTATTATCG GTGCAATTTA TCTGACCGGT ATTTTCATGA CCTGGCGTAG AGCGCGTGGC
TTTATTAAAC AAGAGAAAGT CGTTCTCGCA GAATAA
 
Protein sequence
MFSEVMRYIL DLGPTVMLPI VIIIFSKILG MKAGDCFKAG LHIGIGFVGI GLVIGLMLDS 
IGPAAKAMAE NFDLNLHVVD VSWPGSSPMT WASQIALVAI PIAILVNVAM LLTRMTRVVN
VDIWNIWHMT FTGALLHLAT GSWMIGMAGV VIHAAFVYKL GDWFARDTRN FFELEGIAIP
HGTSAYMGPI AVLVDAIIEK IPGVNRIKFS ADDIQRKFGP FGEPVTVGFV MGLIIGILAG
YDVKGVLQLA VKTAAVMLLM PRVIKPIMDG LTPIAKQARS RLQAKFGGQE FLIGLDPALL
LGHTAVVSAS LIFIPLTILI AVCVPGNQVL PFGDLATIGF FVAMAVAVHR GNLFRTLISG
VIIMSITLWI ATQTIGLHTQ LAANAGALKA GGMVASMDQG GSPITWLLIQ VFSPQNIPGF
IIIGAIYLTG IFMTWRRARG FIKQEKVVLA E