Gene Hhal_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1725 
Symbol 
ID4710553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1892127 
End bp1893653 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content68% 
IMG OID639856193 
Producthypothetical protein 
Protein accessionYP_001003291 
Protein GI121998504 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0645] Predicted kinase
[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGATC GGATCCCTGA CCTGGAGGCC CACCGGGCCC TGGCCGAGGC ACTCCAGGCC 
CCGGAGTGCT ACCCGTGGCC GGTGGACACT GTCGAGTGTA TCGAGACCCA CATCTCCACG
GTGCTGCTCG CCGGGGCGTA CGTGGTCAAG CTCAAGAAAC CGCTGGACCT CGGCTTCCTG
GACTTCTCCA GCCTGGAGCG GCGACGCTAC TTCTGCGACG AAGAGGTTCG CCTCAACGGC
CGCTTGGCTC CGCAGATCTA CCTGCGCCGA GTGGCAATCG CCGGTCCGGC GACGGAGCCG
CGCATCGACG GCGAAGGGGA CGTGCTGGAG TACGCGGTGC TCATGCGCCG CTTCCCCGAG
AATGAGCTCA TGAGCCGTCT GCTGCGCGAG GGGCGTCTGC CCCATGGAGC CGTCGAGCGC
CTGGCTGAGA CTGTGGCGCG GTTCCATGCC GGGCTGCCGG CCGCCGGGGA GGATAGTGAA
TACGGGTCGC CGGAGGCCGT GGCCGATCCC ATGCGTGACA ACTTCCGCGC CCTGGAGACC
CAGTCCGCGG CCGCCTCGAT GCGCGGCGAA CTGCGTGCCC TGGAGCGCTG GACCGAGGCA
CAGCTGCAGC GGCTGGAGCC GCTGATCCGC CGGCGACGCG CCGAGGGCGC GGTGCGTGAA
TGCCACGGGG ACCTGCACTT GGGCAATGTC GCCTGGCATG AGGACGACCT GATCATCTTT
GATGGCATCG AGTTCAATCC GGCACTGCGC TGGATCGACA CGGCCAGCGA GGTCGCCTTC
ACCGTTATGG ATCTGGATTT TGAAGGGGCC CGTCGGCTTC GCCACTGCTT CCTGGACCGC
TACCTGGAGC AGAGTGGCGA CTACCAGGCG CTGCCGCTGC TGCCCCTGTA CGCCGTGTAT
CGGGCCCTGG TGCGGGCCAA GATCAACGGC CACGAGGTCG AACAGGGGGG TGGCAGTGCA
GCCCAGGAGG CGCTCTGCGA CCACGTCCAA CTGGCCAAGA GCTATACGGT GGCGCAGGTG
CCGGAGTTGG TCATTACCTA CGGGTTATCG GGATCCGGCA AGAGCGTACG CGCCCGCCGA
CTGGTCGAGG AGCGCGGGTT TATCCGGCTG CGATCGGATG TCGAGCGCAA GCGGCTGTTC
GGTTTGGAGC CCCGCGCGCG CTCGGACTCG ACCCTGGATA GCGGGCTCTA TTCGCCGGAG
GCAACGTGGC GGACTTACGA ACGACTGCAG GAGCAGGCCG AGGGCGCCCT GGAAGCCGGC
TTTTCGGTGG TGGTGGATGC CGCCTTCCTC AAGGCCGAGC GACGCCGGCC GTTCCTGGAG
CTGGCTGCGC GCACCGGATG TCGATTTCGG ATCCTGCATG TTCGGGCCGA CGAACAGACC
CTGCGGGAAC GCCTGCGCAA GCGGCTGGCC GAGGGGCGCG ATCCCTCCGA GGCCGATGAG
ACGGTGCTCG ATGCTCAGCT GCGTACAGCG CAACCGCCGT CCGGCGAGGA AGCGGCTTTC
GTCGAAACCG TCGACGCGGA CGGGTAA
 
Protein sequence
MTDRIPDLEA HRALAEALQA PECYPWPVDT VECIETHIST VLLAGAYVVK LKKPLDLGFL 
DFSSLERRRY FCDEEVRLNG RLAPQIYLRR VAIAGPATEP RIDGEGDVLE YAVLMRRFPE
NELMSRLLRE GRLPHGAVER LAETVARFHA GLPAAGEDSE YGSPEAVADP MRDNFRALET
QSAAASMRGE LRALERWTEA QLQRLEPLIR RRRAEGAVRE CHGDLHLGNV AWHEDDLIIF
DGIEFNPALR WIDTASEVAF TVMDLDFEGA RRLRHCFLDR YLEQSGDYQA LPLLPLYAVY
RALVRAKING HEVEQGGGSA AQEALCDHVQ LAKSYTVAQV PELVITYGLS GSGKSVRARR
LVEERGFIRL RSDVERKRLF GLEPRARSDS TLDSGLYSPE ATWRTYERLQ EQAEGALEAG
FSVVVDAAFL KAERRRPFLE LAARTGCRFR ILHVRADEQT LRERLRKRLA EGRDPSEADE
TVLDAQLRTA QPPSGEEAAF VETVDADG