Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1235 |
Symbol | |
ID | 7399503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1245922 |
End bp | 1247370 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643708299 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_002565897 |
Protein GI | 222479660 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.00615219 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATAACCA CCGACCGAAT GGCGGCGGTC GACGCTAACG CGGCCGCCCT CGGCGTTCCT CGGAAGCAGC TGATGGAGTC GTCCGGCAAC GCCGTCGCCC GCGAGGTCCG AGCGATCGCG GACCCCGGCG CGAGCGTCGA ACTGCTCTGC GGACGCGGGA ACAACGGCGG AGACGCGTTC GTCGCGGCGC GCTTCCTCTC CGCGTACGAC GTGACCGTCC GCCTGCTCGG GCGCCCCGAG TCGATCCGGA CCGAGATCGC TCGCGAGAAC TGGGACGCCC TCAAGAGCGC CGCGATCCCC ACCGAGACCG TCGCCGACGC CGCCGACCTC GCGCTCGACG ATCCGGACGT GATCGTCGAC GCGATGCTCG GAAGCGGAAT TACCGGCGCG CTCCGAGAGC CGGAACGGAC CGCGGCGCGG CTGGCGAACG AGAGCGACGC CGCGGTCGTC GCCGTCGACG TGCCCTCCGG GATCGACGCC GACACCGGCG AATCGACCGG GAGCGGCGAC GACGACGTGG TCCGTGTCGA GGCCGACCGC GTCGTCACCT TCCACGACGA GAAGCCCGGA CTGACGGCGC TCGACGCCGA CGTGACCGTC GCGGACATCG GTATCCCGGC GGCCGCCGAG CGGTTCACCG GTCCAGGCGA CCTGCTCGGG ATCGCGCGCG ACCCGAACTC TCACAAGGGC GAGAACGGCG AGGTGCTCGT GATCGGCGGC GGCCCGTACA CCGGCGCACC CTCGCTTTCG GCCCGATCGG CCCTCCGGAC CGGCGCCGAC CTCGTGCGCG TCGCCTGCCC GGAGACCGTC GCAAGGACGG TTCAGGGCTA CTCCGCAGAC CTGATCGTTC GCGGGCTGCC GGGCAACCGT ATCGGCCCCG CCCACGTCGA CCGCGCGCTA GAACTTGCCG CCGGCAACGA CGTGGTCGTG CTCGGCCCGG GGCTCGGCGA CAGCGACGGC GTGAGCGAGT TCGTCCGTGA GTTCCTGTCG CGGTACGACG GGCGGGCGGT CGTCGACGCC GACGCACTCC GGGTCGTCCC CGAGATCGAC ACGGACGCCG AACTGATCTG CACGCCGCAT CAGGGCGAAC TGGTCGGGAT GGGCGGCGAG ACCGCCGACG ACCCCGACGA GCGCGCGGCG CTCGTGCGGT CGTTCGCCGA CGAGATCGGT CACACGCTGC TGGTGAAGGG CGCGGTCGAC GTGGTTAGCG ACGGCGACGG GGTCCGGCTG AACCACACGG GGAACCCGGG GATGACCGTC GGCGGGACCG GCGACGTACT CGCGGGCGCG GTCGGCGCGC TCGCGGCCGT GACCGACTCG TTCCACGCGG CCGCGGTCGG GGTGTACGCC AACGGGCTGG CGGGCGACGC GGCGGCCGAC GATATGGGGT ACGGCCTCGT GGCGACGGAC TTACCCGACC GGCTTCCCGA GGCGATGCGT GATGAGTGA
|
Protein sequence | MITTDRMAAV DANAAALGVP RKQLMESSGN AVAREVRAIA DPGASVELLC GRGNNGGDAF VAARFLSAYD VTVRLLGRPE SIRTEIAREN WDALKSAAIP TETVADAADL ALDDPDVIVD AMLGSGITGA LREPERTAAR LANESDAAVV AVDVPSGIDA DTGESTGSGD DDVVRVEADR VVTFHDEKPG LTALDADVTV ADIGIPAAAE RFTGPGDLLG IARDPNSHKG ENGEVLVIGG GPYTGAPSLS ARSALRTGAD LVRVACPETV ARTVQGYSAD LIVRGLPGNR IGPAHVDRAL ELAAGNDVVV LGPGLGDSDG VSEFVREFLS RYDGRAVVDA DALRVVPEID TDAELICTPH QGELVGMGGE TADDPDERAA LVRSFADEIG HTLLVKGAVD VVSDGDGVRL NHTGNPGMTV GGTGDVLAGA VGALAAVTDS FHAAAVGVYA NGLAGDAAAD DMGYGLVATD LPDRLPEAMR DE
|
| |