Gene Hhal_0895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0895 
Symbol 
ID4709945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp971853 
End bp972827 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content73% 
IMG OID639855364 
Productthiamine-monophosphate kinase 
Protein accessionYP_001002473 
Protein GI121997686 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0369638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCGGCG GAGGCGAAGC GGCGCTCATC CAGCGCTATT TCCACGGTCT GACCGCGCAG 
CGCGACGGGG TCGAGCTGGG GGTCGGGGAC GACGCGGCAC TGCTCCAGAC CGAGACGGGG
CTGCTGGCTG CCTGTTGCGA CACCTTGGTC GAGGACATCC ACTTCCCTGG GGATATCCCA
CCGGAGGCGC TGGGTCACCG GGTCCTGGCG GTCAATCTCA GCGATCTGGC CGCCGTCGGT
GCCCGACCGG CCTGGACCCT GCTCTCCCTG ACCCTGCCAG AGAAAGACCC GCAGTGGTTG
GAGCGGTTCA GCGCCGGATT CAAGGCGCTG GCCGACCGCT ACGGCGTCGC CCTGGTCGGC
GGAGACACCA CCCAGGGTCC GCTGTCGGTC TCGGTCACCG CCCTCGGCCA GGTGGCCGGC
GATCACGGCC TTCGGCGCGG TGGCGCGCGG CCCGGCGACG GCGTCTGGGT GACCGGTACC
CTGGGGGATG CGGCCCTGGG GCTGGAGCTG TGGCAGGAGC GCGAGGAGGC GACCGCACTG
GCCGGCGATC CGGCCTACCT GGCGGGCCGG CTGTTCCGCC CCGAGCCGCG GGTGGCGGCC
GGTACCGCGC TGCTGGGGCG CGCGAGTGCT GCTATCGACG TCTCCGACGG GCTCGCCGCC
GACCTGTCGC GGGTGCTCGA TGAGAGCGGC GTCGGCGCCA CCCTGGAGCT GGAGGCATTG
CCCCGCTCCC AGGCGTTCAT CGATGAGCAG GGCGACCTCC GCCACCTGCT CCACGGCGGC
GACGACTACG AACTCTGCTT CACCCTGCCG GCGGAGCGGG AGGAGGAGAT GGCCTGCCTG
CGCGAGCACG CCGCCACGCC GGTGACGCGC ATCGGCACCG TGGAGGAGAC CCCCGGGCTG
CGCGGGGTGG ACGCCGGCGG CGTGGTCTGC GCCCTGGAGC CGGGTGGCTA CGACCACTTC
GCGGAGGGGT CGTGA
 
Protein sequence
MIGGGEAALI QRYFHGLTAQ RDGVELGVGD DAALLQTETG LLAACCDTLV EDIHFPGDIP 
PEALGHRVLA VNLSDLAAVG ARPAWTLLSL TLPEKDPQWL ERFSAGFKAL ADRYGVALVG
GDTTQGPLSV SVTALGQVAG DHGLRRGGAR PGDGVWVTGT LGDAALGLEL WQEREEATAL
AGDPAYLAGR LFRPEPRVAA GTALLGRASA AIDVSDGLAA DLSRVLDESG VGATLELEAL
PRSQAFIDEQ GDLRHLLHGG DDYELCFTLP AEREEEMACL REHAATPVTR IGTVEETPGL
RGVDAGGVVC ALEPGGYDHF AEGS