Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4181 |
Symbol | |
ID | 5736043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5332317 |
End bp | 5334020 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281336 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001546941 |
Protein GI | 159900694 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAAAAA TTGTGCCTGG GTATCACATT ACAACCGCTG CCCAAATTCG GGCAATCGAG CAACATGCTG TTGACGAAGG GGCGACGTGG GCTGGCCTGA TGGCCGAGGC TGCACGCGGC ATGGCCGATG TTGGATTAAC CGTGATCGCC AAGCAAACCA ACCCCAGCGT GTTAGTGCTG GTTGGTTCTG GCAACAATGG CGGCGATGCT TTGGTGATTG CACGGCATAT CAAAGAGGCT GGCTTTCCAG TCACCTGCTA TCTATATAAA CGCAAGCCGC ACGCTGATGA TTGGCCGTTT GCTGCTGCCC AAGCCGAAGC CATTCCAATG ATCTTCGCTG CTGATGATCC ACAAAATCAG CAACTCCAGC AACTGTTGCA AACCAACACA TTCATCATTG ATGGGTTGTT TGGCATTGGC CTGAGTCGTT CGTTGGCGGC TGAGGTAGTC CAAATTATCG ATTTGGTCAA TGCCAGCAAA TTGCCAGTTT TGGCAGTCGA TGTGCCTTCG GGCCTTGATG CTGACAATGG CAAGATTTGG GGCACAATCA TCAATGCTGC CTACACCGTA GCGGCTGGCC TAACCAAACG CGGCCATCAT TTGTACCCAG GTGCGGCCTA TGTTGGCAAA TTGGCAATCG CCCCCTTTAC CCTACCAGAT TCTATGGAGG AGCCGATGAC TACAACTGAA CTGAATCTTG CGACAATCCG TTCGCTGGTG CCGGCCCGTC CGGTTGATGG CCATAAAGAT ACTTTCGGGC GCGTGATGGT TGTGGCTGGC TCGTATCTCT ATCCTGGCGC GGCTTGGCTT GCGGCCACGG CGGCGGCTCG TTCGGGTGCT GGGGTAGTAA CCTTGGCTTG CCCACGCTCA ATTTATGGCA GCACCGCCGC CCATCTGCAT GAAGTAACCT ATTTGCCGTT GCCAGAAGTT GAACCAGGCG AGTTGACCGA AGCCGCTGCT AAGTTGGTGC ACGAAAAACT AGCCAAATAT AAAGCCTTGC TGGTTGGACC AGGCCTTGGC ACCGAAACAG GCACTGGCGA TTTTCTACGG GCGCTGATTG GCTTGGCCTC AAGCAAACGT CGGTTAGGCG TAGGTTTTCT TGGCTCAAGC GAGCTTGAGT TGCCGACCAA ACGCAAAGGC GGGGTTGGTT TCGGCCTAGC TGCTCGGCCC AAAGAAGAAG CAAAGGCTGA AGAAGATGGC CCAGTCGTGC TGCCACCGCT GGTTATCGAT GCCGATGGCT TGAATATTTT GGCCACAATT GAACATTGGG AAGAAAAATT ACGCGATCAG CCTGTCGTGC TCACACCACA CATCGGCGAG ATGGCCCGCT TGTTGGGCGA AGAAAAAATC GGTGAAGATC ACCCACAAAT TGCACTCGAA GCGGCAGCCC GCTGGGGCGT GACCGTGGTG CTGAAATCGG CCCACACAAT TATTGCAAGC CCTGATGGCC GCTTGGCGTT GCATGGTTTG GCTAACCCAG CATTGGCAAC CGCCGGATCT GGTGATATCC TTGCAGGACT AACTGCTGGC TTGATGGCAC AAGGCTTAGC CCCATTTGAA GCCGCCCAGC TTGCAGTTGG CGTGCATGGC GTTGCAGGCG CTCTGGTTCG TGAAGAACTC GGCGAACGCG GCACCATTGC TAGCGATATT CTTAATCGCC TGCCCTTAGC ATGGCGAAAA CTTACAGAAG GCGGATTAAA ATAA
|
Protein sequence | MPKIVPGYHI TTAAQIRAIE QHAVDEGATW AGLMAEAARG MADVGLTVIA KQTNPSVLVL VGSGNNGGDA LVIARHIKEA GFPVTCYLYK RKPHADDWPF AAAQAEAIPM IFAADDPQNQ QLQQLLQTNT FIIDGLFGIG LSRSLAAEVV QIIDLVNASK LPVLAVDVPS GLDADNGKIW GTIINAAYTV AAGLTKRGHH LYPGAAYVGK LAIAPFTLPD SMEEPMTTTE LNLATIRSLV PARPVDGHKD TFGRVMVVAG SYLYPGAAWL AATAAARSGA GVVTLACPRS IYGSTAAHLH EVTYLPLPEV EPGELTEAAA KLVHEKLAKY KALLVGPGLG TETGTGDFLR ALIGLASSKR RLGVGFLGSS ELELPTKRKG GVGFGLAARP KEEAKAEEDG PVVLPPLVID ADGLNILATI EHWEEKLRDQ PVVLTPHIGE MARLLGEEKI GEDHPQIALE AAARWGVTVV LKSAHTIIAS PDGRLALHGL ANPALATAGS GDILAGLTAG LMAQGLAPFE AAQLAVGVHG VAGALVREEL GERGTIASDI LNRLPLAWRK LTEGGLK
|
| |