Gene EcolC_1647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1647 
Symbol 
ID6065248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1825809 
End bp1827062 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content44% 
IMG OID641601061 
ProductPTS system galactitol-specific IIC component 
Protein accessionYP_001724631 
Protein GI170019677 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3775] Phosphotransferase system, galactitol-specific IIC component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000444163 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAATGT TGAAAAATTT AATGGAAACA ATCACCGGAA TGGGGGCAAC TGCAATACTG 
CCGCTGGTGA TATTTATTTT AGGTCTGGTG TTCAGAATGA AACCAGGTGC CGCGATTAAG
TCGGGCATCA CAGTCGGTAT TGGTTTTATT GGTTTAGGTT TGGTTGTTGG TTTATTAAAT
AGTTCATTAC AGCCAGCTAT TGAGTATTAC TCAAAAGTAG GTAGTGGTTT TACGGTTGCA
GATATTGGTT GGCCAGCAGT CGGTGCTGCT GCATGGGTAG CACCTTTTGC CGCATTAGTG
ATACCAGTTG GCATCGTACT TAACCTGATT CTGGTGCGCC TGAAATTAAC CAAAACGTTG
AATGTTGATA TCTGGAACTA TATGCATTTT TTAGTTCCGG GTGCTCTGGC ATATTTTGTG
TTCGACAGTT TTATCATCGG CTTCTCTGTC GCGGTGGCTT TGAGTATTGC GGCTCTGTTT
ATTGGCGATT TGATTGCGCC CAGATGGCAA AAATATTATG GACTCGAGGG AACGACCTGT
ACCACGATGA TTCATATCGG CTGGACTCTA CCGTTCGCCT GGGTAGTAAA TAAAATCATT
GATTACATCC CTGGGTTAAA TAAGTTAGAT GTTGATTTAA ACAGCGTGCA AAAACGCCTG
GGTGTCTTTG GTGAACCTGC AATTATCGGT GTTATCGTTG GCGCATTACT CGGAGTTTTA
ACGAAACAGG CAATTACCAC AATTGTTCCC ATGGCGATGG GGGTTGCTGG TGTAATGGTA
TTGTTACCTA AAGTGGTCGG TGTGCTGATG GAAGGTCTTA ACCCGATTGG GAAAAGTGCC
AAAGAAATCA TGCAAAAACA GATGGGTAAA GATGCTGAAT TAAACATCGG TATGGATTGT
GCACTAGCGT TGGGGGATCC GGCGACGGTC ACCGTGACAG TAATTACCAT TCCTTTAACC
ATGCTATGTG CTCTGGTATT GCCTGATATT AAGATCTTCC CAATTGGCGT ATTGATGTCA
ATTATTTATA TGACCACCAT GACCGTAATG GCGAGCAACG GTAACGTGAT TCGTTCGATT
ATCTCGACCT TGTTATTCTG CGTTGTAGTG ATGTATTTAG GCGGTTATGT CGCACCAGGG
GCAACGCAAT TTTTAGCTGG AGCCGGTGTA GGCTTGCAAG GACAAGGTAC TGATTTTGTA
TTAACCGGCC CGTGGGAAAT TTTAACCTAT TGGTTGAGTA CCGTATTACA TTGA
 
Protein sequence
MEMLKNLMET ITGMGATAIL PLVIFILGLV FRMKPGAAIK SGITVGIGFI GLGLVVGLLN 
SSLQPAIEYY SKVGSGFTVA DIGWPAVGAA AWVAPFAALV IPVGIVLNLI LVRLKLTKTL
NVDIWNYMHF LVPGALAYFV FDSFIIGFSV AVALSIAALF IGDLIAPRWQ KYYGLEGTTC
TTMIHIGWTL PFAWVVNKII DYIPGLNKLD VDLNSVQKRL GVFGEPAIIG VIVGALLGVL
TKQAITTIVP MAMGVAGVMV LLPKVVGVLM EGLNPIGKSA KEIMQKQMGK DAELNIGMDC
ALALGDPATV TVTVITIPLT MLCALVLPDI KIFPIGVLMS IIYMTTMTVM ASNGNVIRSI
ISTLLFCVVV MYLGGYVAPG ATQFLAGAGV GLQGQGTDFV LTGPWEILTY WLSTVLH