Gene Hhal_0671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0671 
Symbol 
ID4710260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp753907 
End bp755436 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content77% 
IMG OID639855133 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001002255 
Protein GI121997468 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACTG ATCCCTCCAT ACGGAACCCT GAACCCGCTC CCATGACGGA ACTCTACACC 
CCCGATCAAG TGGCGCGCCT CGATCAGGCG GCCATCCGTG CCGGCCTCCC CGGCGAGGTG
CTCATGGACC GCGCCGGGCG GCGGCTGTGG CGGGAGATCC GTCGGCGCTG GCCCGAGGCG
CGGCGGCTGG TGGTCGTCTG TGGCGGCGGC AACAACGGCG GCGACGGCTA CGTGGTGGCG
CGCCTGGCGG CGCGGGCCGG GCTGGCGGTG GAGGTGCTCC ACCGGGTCCC GCCCGAGCGC
CTGGGCGGGG ATGCGGCCCG TCACGCCCAG CGGTATCTCG AGGGCGGCGG CGTCTGCCGC
CCCTTCGACG CGGCCGCCCT GGCCGAGGCG GATGTGATCG TCGATGCCCT GCTCGGCACC
GGGCTGGATC GGCCGGTGAG CGGGGCCTTC GCCGAGGCGG TGGCGGCCAT CAATGCCGCG
CCGGCGCCGG TGGCCGCCGT GGATATCCCC TCGGGGATCC ACGGGCGGAC CGGTGCCGAG
ATGGGGGTGG CCGTGCGCGC GCAGGTAACC GCGACCTTCG TCGCCCGCAA GAGCGGGCTG
TTCACCGGCC GCGGTCCAGC GTGCAGCGGG GCGGTGGTCT TCGACGATCT GGGCACCGGG
GCGCTGGTCG CCGGCAGCGA GTCGCCCCAT ACACGGCAGG TGACGGCGGC GGATCGGGCC
GCGCTGCTGC CGCCGCGGCC GCGGGATGCC CACAAGGGGC ACTATGGGCA CGTGCTGGTG
GTCGGCGGCG ATGCCGGCAT GGCCGGTGCG GTGCGCCTGG CCGCGGAGGC GGCGGCGCGC
TGCGGCGCCG GTCTGGTCAG CGTGGCGACC CGCCCGGAGC ACGTCCCCGT GGTGGTGGGG
GCCTGTCCGG CGGTCATGGC CCACGGCGTG ACCGACGCCC AGGAGCTGGC GCCGCTGCTG
GAGCGGGCCA GTGTGGTGGC CATCGGCCCC GGCCTGGGGC AGGACCCCTG GGGGCAGGCG
ATGTGGGCGG CCTGCCGGGA CGTGGTGCGC CCCCGGGTGG TGGACGCCGA CGGCCTCAAC
CTGCTTGCTG TCGACGGGCA GCCGGTGACC GACGCCGTGC TCACCCCGCA CCCTGGCGAG
GCGGTGCGGC TCCTGGGTCC GGGTTGGGAC ACGGCGGCGA TCGCCGCGGA TCGCTTCGCA
GCCGTGCGGG CGCTGGCCAC GCAGTGGCAG GCGGTGGCCC TGCTCAAGGG GGCGGGCAGC
CTGGTGGATG ACGGCGCCTC TCGCTACCTG GCCGGCACCG GTACGCCGGG GATGGCCAGC
GGCGGTATGG GCGATGTGCT CACCGGGGTG GTGGCGGGCC TGCGCGCCCA GCGACCGGAC
GCGGACCCGG CCTGGCTGGC GGCGGTGGCC GCCGAGGTCC ACGGCCGCGC GGGGGAGCGG
GCCGCCGAGG CCCTGGGCGG CGAGCGCGGG CTGCTCGCCA GCGATCTGCT CGGCTGGTTG
CCGGCGGTGC TGGCGGAGGA GCCGGCGTGA
 
Protein sequence
MQTDPSIRNP EPAPMTELYT PDQVARLDQA AIRAGLPGEV LMDRAGRRLW REIRRRWPEA 
RRLVVVCGGG NNGGDGYVVA RLAARAGLAV EVLHRVPPER LGGDAARHAQ RYLEGGGVCR
PFDAAALAEA DVIVDALLGT GLDRPVSGAF AEAVAAINAA PAPVAAVDIP SGIHGRTGAE
MGVAVRAQVT ATFVARKSGL FTGRGPACSG AVVFDDLGTG ALVAGSESPH TRQVTAADRA
ALLPPRPRDA HKGHYGHVLV VGGDAGMAGA VRLAAEAAAR CGAGLVSVAT RPEHVPVVVG
ACPAVMAHGV TDAQELAPLL ERASVVAIGP GLGQDPWGQA MWAACRDVVR PRVVDADGLN
LLAVDGQPVT DAVLTPHPGE AVRLLGPGWD TAAIAADRFA AVRALATQWQ AVALLKGAGS
LVDDGASRYL AGTGTPGMAS GGMGDVLTGV VAGLRAQRPD ADPAWLAAVA AEVHGRAGER
AAEALGGERG LLASDLLGWL PAVLAEEPA