Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2904 |
Symbol | |
ID | 6142718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2975794 |
End bp | 2977272 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641617773 |
Product | carbohydrate kinase, FGGY family protein |
Protein accession | YP_001744928 |
Protein GI | 170679797 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1070] Sugar (pentulose and hexulose) kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0969713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAA AATACATCAT AGGGATTGAT GGCGGAAGTC AGAGCACAAA AGTGGTGATG TATGATCTGG AAGGTAATGT GGTTTGCGAA GGCAAAGGCT TATTGCAGCC GATGCACACG CCAGATGCCG ATACCGCAGA ACATCCTGAC GACGATTTAT GGGCATCATT ATGTTTTGCC GGTCACGATT TGATGAGTCA GTTTGCCGGA AATAAAGAAG ATATTGTCGG TATTGGTCTG GGATCCATCC GTTGCTGCCG TGCGTTATTG AAAGCCGATG GCACGCCAGC TGCGCCGTTG ATTAGCTGGC AGGATGCACG CGTTACACGT CCCTACGAGC ACACCAATCC TGATGTGGCA TATGTCACCT CTTTTTCAGG TTATCTGACA CACCGTTTAA CCGGCGAGTT TAAAGACAAT ATCGCCAACT ATTTTGGTCA GTGGCCGGTG GATTATAAGA CCTGGGCATG GAGCGAAGAT ACCGCGGTAA TGGAGAAGTT TAATATTCCC CGACAGATGC TTTTTGATGT GCAAATGCCG GGCACTGTTC TTGGGCATAT CACACCACAA GCCGCGCTGG CGACACATTT CCCGGTTGGA CTGCCCGTTG TTTGTACCAC CAGTGATAAA CCGGTAGAAG CACTGGGAGC AGGGTTACTG GATGATGAAA CGGCGGTAAT TTCTCTTGGC ACTTACATCG CGTTGATGAT GAACGGCAAA GCACTGCCGA AAGATCCGGT AGCGTACTGG CCGATTATGT CTTCTATTCC GCAAACCTTG CTGTATGAAG GTTACGGTAT TCGCAAAGGC ATGTGGACGG TGAGCTGGCT ACGCGACATG CTAGGCGAGT CGTTAATTCA GGATGCCAAA GCACAGGATC TTTCACCGGA AGATTTGCTC AACAAAAAAG CGTCTTGTGT GCCGCCTGGC TGTAATGGTC TGATGACGGT GCTGGACTGG CTGACCAATC CGTGGGAACC ATACAAACGC GGGATTATGA TCGGCTTTGA TTCCAGCATG GATTACGCAT GGATATATCG TTCGATACTG GAAAGCGTGG CGCTGACGCT GAAGAACAAT TACGACAATA TGTGTAATGA AATGAATCAC TTTGCAAAGC ATGTGATCAT TACTGGCGGC GGTTCGAACA GCGATCTGTT TATGCAGATT TTTGCCGACG TGTTCAACCT TCCGGCACGC CGAAACGCCA TTAATGGTTG TGCAAGTCTG GGGGCTGCGA TCAATACGGC GGCAGGTCTG GGGCTATACC CGGATTACGC AACGGCTGTC GACAAAATGG TTCGCGTGAA AGATATCTTT ATGCCTGTTG AGAGCAACGC CAAACGCTAC GACGCGATGA ATAAAGGCAT TTTCAAAGAC CTAACCAAGC ACACTGATGT GATCCTGAAA AAATCGTATG AAGTGATGCA TGGTGAGCTG GGGAATGAAG ATTCGATCCA GAGCTGGTCG AATGCGTAA
|
Protein sequence | MSKKYIIGID GGSQSTKVVM YDLEGNVVCE GKGLLQPMHT PDADTAEHPD DDLWASLCFA GHDLMSQFAG NKEDIVGIGL GSIRCCRALL KADGTPAAPL ISWQDARVTR PYEHTNPDVA YVTSFSGYLT HRLTGEFKDN IANYFGQWPV DYKTWAWSED TAVMEKFNIP RQMLFDVQMP GTVLGHITPQ AALATHFPVG LPVVCTTSDK PVEALGAGLL DDETAVISLG TYIALMMNGK ALPKDPVAYW PIMSSIPQTL LYEGYGIRKG MWTVSWLRDM LGESLIQDAK AQDLSPEDLL NKKASCVPPG CNGLMTVLDW LTNPWEPYKR GIMIGFDSSM DYAWIYRSIL ESVALTLKNN YDNMCNEMNH FAKHVIITGG GSNSDLFMQI FADVFNLPAR RNAINGCASL GAAINTAAGL GLYPDYATAV DKMVRVKDIF MPVESNAKRY DAMNKGIFKD LTKHTDVILK KSYEVMHGEL GNEDSIQSWS NA
|
| |