Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1647 |
Symbol | |
ID | 6065248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1825809 |
End bp | 1827062 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641601061 |
Product | PTS system galactitol-specific IIC component |
Protein accession | YP_001724631 |
Protein GI | 170019677 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3775] Phosphotransferase system, galactitol-specific IIC component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000444163 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAAATGT TGAAAAATTT AATGGAAACA ATCACCGGAA TGGGGGCAAC TGCAATACTG CCGCTGGTGA TATTTATTTT AGGTCTGGTG TTCAGAATGA AACCAGGTGC CGCGATTAAG TCGGGCATCA CAGTCGGTAT TGGTTTTATT GGTTTAGGTT TGGTTGTTGG TTTATTAAAT AGTTCATTAC AGCCAGCTAT TGAGTATTAC TCAAAAGTAG GTAGTGGTTT TACGGTTGCA GATATTGGTT GGCCAGCAGT CGGTGCTGCT GCATGGGTAG CACCTTTTGC CGCATTAGTG ATACCAGTTG GCATCGTACT TAACCTGATT CTGGTGCGCC TGAAATTAAC CAAAACGTTG AATGTTGATA TCTGGAACTA TATGCATTTT TTAGTTCCGG GTGCTCTGGC ATATTTTGTG TTCGACAGTT TTATCATCGG CTTCTCTGTC GCGGTGGCTT TGAGTATTGC GGCTCTGTTT ATTGGCGATT TGATTGCGCC CAGATGGCAA AAATATTATG GACTCGAGGG AACGACCTGT ACCACGATGA TTCATATCGG CTGGACTCTA CCGTTCGCCT GGGTAGTAAA TAAAATCATT GATTACATCC CTGGGTTAAA TAAGTTAGAT GTTGATTTAA ACAGCGTGCA AAAACGCCTG GGTGTCTTTG GTGAACCTGC AATTATCGGT GTTATCGTTG GCGCATTACT CGGAGTTTTA ACGAAACAGG CAATTACCAC AATTGTTCCC ATGGCGATGG GGGTTGCTGG TGTAATGGTA TTGTTACCTA AAGTGGTCGG TGTGCTGATG GAAGGTCTTA ACCCGATTGG GAAAAGTGCC AAAGAAATCA TGCAAAAACA GATGGGTAAA GATGCTGAAT TAAACATCGG TATGGATTGT GCACTAGCGT TGGGGGATCC GGCGACGGTC ACCGTGACAG TAATTACCAT TCCTTTAACC ATGCTATGTG CTCTGGTATT GCCTGATATT AAGATCTTCC CAATTGGCGT ATTGATGTCA ATTATTTATA TGACCACCAT GACCGTAATG GCGAGCAACG GTAACGTGAT TCGTTCGATT ATCTCGACCT TGTTATTCTG CGTTGTAGTG ATGTATTTAG GCGGTTATGT CGCACCAGGG GCAACGCAAT TTTTAGCTGG AGCCGGTGTA GGCTTGCAAG GACAAGGTAC TGATTTTGTA TTAACCGGCC CGTGGGAAAT TTTAACCTAT TGGTTGAGTA CCGTATTACA TTGA
|
Protein sequence | MEMLKNLMET ITGMGATAIL PLVIFILGLV FRMKPGAAIK SGITVGIGFI GLGLVVGLLN SSLQPAIEYY SKVGSGFTVA DIGWPAVGAA AWVAPFAALV IPVGIVLNLI LVRLKLTKTL NVDIWNYMHF LVPGALAYFV FDSFIIGFSV AVALSIAALF IGDLIAPRWQ KYYGLEGTTC TTMIHIGWTL PFAWVVNKII DYIPGLNKLD VDLNSVQKRL GVFGEPAIIG VIVGALLGVL TKQAITTIVP MAMGVAGVMV LLPKVVGVLM EGLNPIGKSA KEIMQKQMGK DAELNIGMDC ALALGDPATV TVTVITIPLT MLCALVLPDI KIFPIGVLMS IIYMTTMTVM ASNGNVIRSI ISTLLFCVVV MYLGGYVAPG ATQFLAGAGV GLQGQGTDFV LTGPWEILTY WLSTVLH
|
| |