Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3770 |
Symbol | |
ID | 6147031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3833883 |
End bp | 3835391 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641618596 |
Product | carbohydrate kinase |
Protein accession | YP_001745736 |
Protein GI | 170682100 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1070] Sugar (pentulose and hexulose) kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGATA ACAGCGCAGC TATCGTTATC GATATTGGCA CCACCAATTG CAAAGTCACC TGCTTTTCCT GCCTGGATGC AACGACGTTG GGCGCGCATA AATTCGTGAC GGCAAAACAG ATCTCCCCAC AGGGCGATGT CGATTTCGAT ATCGACGCCC TCTGGCAGGA GGTCCGCCAG GCAATAGCGC AACTGAGCGC CGCTTCGCCG CTGCCGGTCA GGCGGATAAG CATTGCCAGT TTTGGCGAGT CAGGGGTGTT TCTTGACGAG CATGGCGAGA TCCTGACGCC AATGCTGGCA TGGTATGACC GCCGCGGTGA AGAGTACCTG GCAACGCTTA GTGAGGCAGA CAGTGCGGCG CTTTATGACA TCTGCGGGCT ACCGCTGCAC AGCAATTACT CTGCCTTCAA AATGCGTTGG TTGCTGGAGC ATTATCCACT GCGTAATCGC CGCGGCCTGC GTTGGCTACA TGCGCCGGAA GTACTGCTCT GGCGGCTGAC TGGCGAACAG CGTACGGATA TCACCTTAGC CAGCCGTACG CTGTGTCTGG ACGTGCGTAA AGGCGAATGG TCAGCGAAAG CGGCGGCGTT GTTACACGTT CCCTGTTCAG CATTTGCGCC ATTGGTGCAG CCAGGCGAGC ACGCCGGATG GGTCAGCGAG TCGCTCTGCA AAACGCTTGG GTTCTTGCAA CCGGTCAGCG TGACGCTGGC CGGGCATGAC CATATGGTGG GTGCGCGGGC GTTGCAGATG ACGCCGGGCG ATATCCTTAA CTCGACGGGC ACCACAGAAG GCATTCTGCA ACTGGATACA CAACCGACGC TGGATGAACA TGCCAAACGT GACAAGCTGG CAAACGGCTG TTATTCGCTT GCAAACCAGT TCACCCTGTT TGCGTCGCTG CCGGTGGGCG GTTTCGCTCT GGAGTGGCTG CGCAACACGT TCCGGCTAAC CGATGAGGAG ATCGCCGCAT CACTTACTCG CGGACATGCG GATTACCTGG CGGGGAACTG GTCGCTCGAT GACATTCCCG TTTTTATTCC GCATCTTCGC GGTTCGGGTT CGCCCTATAA AAATCGCCAT ACCCGAGGAT TATTTTATGG GCTTGGCGAT ACGTTAAATA TCGATATGTT AATTGCCAGC GTCTCACTGG GATTAACCAT GGAATTTGCC AACTGCTTCG CCTGTTTTAA TGTGCCCGGC ACCAGCGCGT TAAAAGTGAT AGGTCCGGCA ACCCGTAATC CCCTTTGGCT GCAATTAAAG GCGGATATTT TACAGCGTCC GGTTGAAGCA ATTGCATTTA ACGAGGCGGT TTCTGTCGGA GCATTATTAA CCGCCGCACC GGATATTCCA CCGCCGCAAG TCACCATAGC CCAACGTTTG TTACCGAATC GGGCGAGATA CCATCAATTA CAGCGTTATC AGCACAAATG GAAAAGCTGG TATCAGTTGA AATTACAGCA AGAAGGCGTG ATGCCATTAC ATCATCGGGA GGAGCACTAT GTTGAGTAA
|
Protein sequence | MPDNSAAIVI DIGTTNCKVT CFSCLDATTL GAHKFVTAKQ ISPQGDVDFD IDALWQEVRQ AIAQLSAASP LPVRRISIAS FGESGVFLDE HGEILTPMLA WYDRRGEEYL ATLSEADSAA LYDICGLPLH SNYSAFKMRW LLEHYPLRNR RGLRWLHAPE VLLWRLTGEQ RTDITLASRT LCLDVRKGEW SAKAAALLHV PCSAFAPLVQ PGEHAGWVSE SLCKTLGFLQ PVSVTLAGHD HMVGARALQM TPGDILNSTG TTEGILQLDT QPTLDEHAKR DKLANGCYSL ANQFTLFASL PVGGFALEWL RNTFRLTDEE IAASLTRGHA DYLAGNWSLD DIPVFIPHLR GSGSPYKNRH TRGLFYGLGD TLNIDMLIAS VSLGLTMEFA NCFACFNVPG TSALKVIGPA TRNPLWLQLK ADILQRPVEA IAFNEAVSVG ALLTAAPDIP PPQVTIAQRL LPNRARYHQL QRYQHKWKSW YQLKLQQEGV MPLHHREEHY VE
|
| |