Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0671 |
Symbol | |
ID | 4710260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 753907 |
End bp | 755436 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 639855133 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001002255 |
Protein GI | 121997468 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGACTG ATCCCTCCAT ACGGAACCCT GAACCCGCTC CCATGACGGA ACTCTACACC CCCGATCAAG TGGCGCGCCT CGATCAGGCG GCCATCCGTG CCGGCCTCCC CGGCGAGGTG CTCATGGACC GCGCCGGGCG GCGGCTGTGG CGGGAGATCC GTCGGCGCTG GCCCGAGGCG CGGCGGCTGG TGGTCGTCTG TGGCGGCGGC AACAACGGCG GCGACGGCTA CGTGGTGGCG CGCCTGGCGG CGCGGGCCGG GCTGGCGGTG GAGGTGCTCC ACCGGGTCCC GCCCGAGCGC CTGGGCGGGG ATGCGGCCCG TCACGCCCAG CGGTATCTCG AGGGCGGCGG CGTCTGCCGC CCCTTCGACG CGGCCGCCCT GGCCGAGGCG GATGTGATCG TCGATGCCCT GCTCGGCACC GGGCTGGATC GGCCGGTGAG CGGGGCCTTC GCCGAGGCGG TGGCGGCCAT CAATGCCGCG CCGGCGCCGG TGGCCGCCGT GGATATCCCC TCGGGGATCC ACGGGCGGAC CGGTGCCGAG ATGGGGGTGG CCGTGCGCGC GCAGGTAACC GCGACCTTCG TCGCCCGCAA GAGCGGGCTG TTCACCGGCC GCGGTCCAGC GTGCAGCGGG GCGGTGGTCT TCGACGATCT GGGCACCGGG GCGCTGGTCG CCGGCAGCGA GTCGCCCCAT ACACGGCAGG TGACGGCGGC GGATCGGGCC GCGCTGCTGC CGCCGCGGCC GCGGGATGCC CACAAGGGGC ACTATGGGCA CGTGCTGGTG GTCGGCGGCG ATGCCGGCAT GGCCGGTGCG GTGCGCCTGG CCGCGGAGGC GGCGGCGCGC TGCGGCGCCG GTCTGGTCAG CGTGGCGACC CGCCCGGAGC ACGTCCCCGT GGTGGTGGGG GCCTGTCCGG CGGTCATGGC CCACGGCGTG ACCGACGCCC AGGAGCTGGC GCCGCTGCTG GAGCGGGCCA GTGTGGTGGC CATCGGCCCC GGCCTGGGGC AGGACCCCTG GGGGCAGGCG ATGTGGGCGG CCTGCCGGGA CGTGGTGCGC CCCCGGGTGG TGGACGCCGA CGGCCTCAAC CTGCTTGCTG TCGACGGGCA GCCGGTGACC GACGCCGTGC TCACCCCGCA CCCTGGCGAG GCGGTGCGGC TCCTGGGTCC GGGTTGGGAC ACGGCGGCGA TCGCCGCGGA TCGCTTCGCA GCCGTGCGGG CGCTGGCCAC GCAGTGGCAG GCGGTGGCCC TGCTCAAGGG GGCGGGCAGC CTGGTGGATG ACGGCGCCTC TCGCTACCTG GCCGGCACCG GTACGCCGGG GATGGCCAGC GGCGGTATGG GCGATGTGCT CACCGGGGTG GTGGCGGGCC TGCGCGCCCA GCGACCGGAC GCGGACCCGG CCTGGCTGGC GGCGGTGGCC GCCGAGGTCC ACGGCCGCGC GGGGGAGCGG GCCGCCGAGG CCCTGGGCGG CGAGCGCGGG CTGCTCGCCA GCGATCTGCT CGGCTGGTTG CCGGCGGTGC TGGCGGAGGA GCCGGCGTGA
|
Protein sequence | MQTDPSIRNP EPAPMTELYT PDQVARLDQA AIRAGLPGEV LMDRAGRRLW REIRRRWPEA RRLVVVCGGG NNGGDGYVVA RLAARAGLAV EVLHRVPPER LGGDAARHAQ RYLEGGGVCR PFDAAALAEA DVIVDALLGT GLDRPVSGAF AEAVAAINAA PAPVAAVDIP SGIHGRTGAE MGVAVRAQVT ATFVARKSGL FTGRGPACSG AVVFDDLGTG ALVAGSESPH TRQVTAADRA ALLPPRPRDA HKGHYGHVLV VGGDAGMAGA VRLAAEAAAR CGAGLVSVAT RPEHVPVVVG ACPAVMAHGV TDAQELAPLL ERASVVAIGP GLGQDPWGQA MWAACRDVVR PRVVDADGLN LLAVDGQPVT DAVLTPHPGE AVRLLGPGWD TAAIAADRFA AVRALATQWQ AVALLKGAGS LVDDGASRYL AGTGTPGMAS GGMGDVLTGV VAGLRAQRPD ADPAWLAAVA AEVHGRAGER AAEALGGERG LLASDLLGWL PAVLAEEPA
|
| |