Gene Hlac_1235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1235 
Symbol 
ID7399503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1245922 
End bp1247370 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content72% 
IMG OID643708299 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_002565897 
Protein GI222479660 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00615219 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATAACCA CCGACCGAAT GGCGGCGGTC GACGCTAACG CGGCCGCCCT CGGCGTTCCT 
CGGAAGCAGC TGATGGAGTC GTCCGGCAAC GCCGTCGCCC GCGAGGTCCG AGCGATCGCG
GACCCCGGCG CGAGCGTCGA ACTGCTCTGC GGACGCGGGA ACAACGGCGG AGACGCGTTC
GTCGCGGCGC GCTTCCTCTC CGCGTACGAC GTGACCGTCC GCCTGCTCGG GCGCCCCGAG
TCGATCCGGA CCGAGATCGC TCGCGAGAAC TGGGACGCCC TCAAGAGCGC CGCGATCCCC
ACCGAGACCG TCGCCGACGC CGCCGACCTC GCGCTCGACG ATCCGGACGT GATCGTCGAC
GCGATGCTCG GAAGCGGAAT TACCGGCGCG CTCCGAGAGC CGGAACGGAC CGCGGCGCGG
CTGGCGAACG AGAGCGACGC CGCGGTCGTC GCCGTCGACG TGCCCTCCGG GATCGACGCC
GACACCGGCG AATCGACCGG GAGCGGCGAC GACGACGTGG TCCGTGTCGA GGCCGACCGC
GTCGTCACCT TCCACGACGA GAAGCCCGGA CTGACGGCGC TCGACGCCGA CGTGACCGTC
GCGGACATCG GTATCCCGGC GGCCGCCGAG CGGTTCACCG GTCCAGGCGA CCTGCTCGGG
ATCGCGCGCG ACCCGAACTC TCACAAGGGC GAGAACGGCG AGGTGCTCGT GATCGGCGGC
GGCCCGTACA CCGGCGCACC CTCGCTTTCG GCCCGATCGG CCCTCCGGAC CGGCGCCGAC
CTCGTGCGCG TCGCCTGCCC GGAGACCGTC GCAAGGACGG TTCAGGGCTA CTCCGCAGAC
CTGATCGTTC GCGGGCTGCC GGGCAACCGT ATCGGCCCCG CCCACGTCGA CCGCGCGCTA
GAACTTGCCG CCGGCAACGA CGTGGTCGTG CTCGGCCCGG GGCTCGGCGA CAGCGACGGC
GTGAGCGAGT TCGTCCGTGA GTTCCTGTCG CGGTACGACG GGCGGGCGGT CGTCGACGCC
GACGCACTCC GGGTCGTCCC CGAGATCGAC ACGGACGCCG AACTGATCTG CACGCCGCAT
CAGGGCGAAC TGGTCGGGAT GGGCGGCGAG ACCGCCGACG ACCCCGACGA GCGCGCGGCG
CTCGTGCGGT CGTTCGCCGA CGAGATCGGT CACACGCTGC TGGTGAAGGG CGCGGTCGAC
GTGGTTAGCG ACGGCGACGG GGTCCGGCTG AACCACACGG GGAACCCGGG GATGACCGTC
GGCGGGACCG GCGACGTACT CGCGGGCGCG GTCGGCGCGC TCGCGGCCGT GACCGACTCG
TTCCACGCGG CCGCGGTCGG GGTGTACGCC AACGGGCTGG CGGGCGACGC GGCGGCCGAC
GATATGGGGT ACGGCCTCGT GGCGACGGAC TTACCCGACC GGCTTCCCGA GGCGATGCGT
GATGAGTGA
 
Protein sequence
MITTDRMAAV DANAAALGVP RKQLMESSGN AVAREVRAIA DPGASVELLC GRGNNGGDAF 
VAARFLSAYD VTVRLLGRPE SIRTEIAREN WDALKSAAIP TETVADAADL ALDDPDVIVD
AMLGSGITGA LREPERTAAR LANESDAAVV AVDVPSGIDA DTGESTGSGD DDVVRVEADR
VVTFHDEKPG LTALDADVTV ADIGIPAAAE RFTGPGDLLG IARDPNSHKG ENGEVLVIGG
GPYTGAPSLS ARSALRTGAD LVRVACPETV ARTVQGYSAD LIVRGLPGNR IGPAHVDRAL
ELAAGNDVVV LGPGLGDSDG VSEFVREFLS RYDGRAVVDA DALRVVPEID TDAELICTPH
QGELVGMGGE TADDPDERAA LVRSFADEIG HTLLVKGAVD VVSDGDGVRL NHTGNPGMTV
GGTGDVLAGA VGALAAVTDS FHAAAVGVYA NGLAGDAAAD DMGYGLVATD LPDRLPEAMR
DE