Gene Hhal_0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0301 
Symbol 
ID4711211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp336617 
End bp338245 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content70% 
IMG OID639854761 
Productmetal dependent phosphohydrolase 
Protein accessionYP_001001897 
Protein GI121997110 
COG category[T] Signal transduction mechanisms 
COG ID[COG2206] HD-GYP domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGGAC TGAACCCACC GCCGCTGACT GAGCAGCGCC TGGCGCAGCT GCTGGAGGTC 
GGTGTCGCCC TCTCGGCGGA GCAGGACACC GATCGCCTAC TCGAACGCAT CCTCTCCGGT
GCGCGCGAGC TGACGGAGGC GGATGCCGGG ACCATCTACC GGGTCCACGA GGGGCAGCTG
CACTTCGACA CCGTCCACAA CGACACTCTC GGTCTCCACC TGGGCGGCAC GTCCGGGGTG
CCCATCGACT TCTCGCCCCT CCCCCTCTAC CTGGCCGACG ACACGCCGAA CACCCGCAAC
GTGGCCGCCT ACGCGGCGGT CTCCGGCGAA ACCGTGCGCA TCGACGATGC CTACCACGCC
GAGGGGTTCG ACTTCTCGGG CACCCGCGCG GTGGACGCGC GCACCGGATA CCGCTCGCAG
TCCTTCCTCA CTGTCCCGCT GCGCGACCAC GAACAGACCA TCATCGGTGT CCTGCAACTG
ATCAACGCCG TGCGCGAGGG ACAGACGGTC CCCTTCACCG GCCGCGATGC CCGGATTGCC
GAGTCATTGG CCTCCCAGGC GGCCATTGCC CTGACCAAAC AAGGCCTGAT CGACACCCAA
CGGCAGCTGT TCGAGTCGTT CACCGAGGTC CTCGCCCGGG CCATCGACCG CAAGAACCCG
ACCACCGGCC GCCACTGCGA GCGCGTCCCG CAGCTGACGC TGATGATCGC CGACGCCGCC
TGCCGCACCG GCGAGGGGGC GCTGGCCGAT TACCGCCTGA GCGCCGAGGA ACACGAAGAG
CTGCGCCTGG CCGCCTGGCT CCACGACTGC GGCAAGGTCA CCACCCCCGA GGCGGTGGTG
AACAAGGCCA CCAAGCTGGA ACGCCAGACC GATCGCATCG AAGAGGTCGC GACCCGTGCG
GCCGTGGTCC GCCAGGAGGC GGAGGCCGAA CGACTGCGCC GCCGGCTGGC CGCCCGCGAG
CAGGGCGTGG ACCGGGACGC CGAGCAGGCC GATGCCGACT ACCGCGCCCT CGTCCAGCGC
CTCGACGACG ACCTGGCCTT CCTGCGCAGC GCCAACATCG GCGGCGAGGC CATGGACGAC
GAGGCCTGCG AACGGGTCGC CGCCATCGCC CGCACCTACA GCTGGACGGA CGCCGCGGGC
GAGCGGCGCC CGCTACTCAG CGCCGAAGAG GTGGAGCACC TGCAGATCCG CCGCGGCACC
CTGAGCGCCG CGGAGATGGA GCAGATGCGC GACCACGTGC GGGTCAGCCG CGAGATGCTC
GAGCAGCTCA CCTACCCGCG CCACCTGCAG CGGGTGCCGG AGATCGCCTC GCAGCACCAT
GAGCGCATGG ACGGCGGCGG CTACCCCGAC GGCATCACCG GCGGGCAGAT GTGCCGGCGG
GCCCGAATGA TGGCGCTGGC CGACGTCTTC GAGGCCCTGA CCGCCGCCGA CCGCCCCTAC
AAGCCAGGCA AGAAGCTCAG CGAAGCGGTG CGCATCATGG GCTTCATGAC CCAGGACGGC
CACTTCGACC CGGAGCTGTT CGACCTCTTC ATCCGTGAAG GGGTCTACCT GGACTACGCC
CGCCAGTTCA TGCACCCGAG CGCCATCGAC GCGGTGGACG AGACCGCCAT CCCCGGCTAC
CGGCCGTAG
 
Protein sequence
MDGLNPPPLT EQRLAQLLEV GVALSAEQDT DRLLERILSG ARELTEADAG TIYRVHEGQL 
HFDTVHNDTL GLHLGGTSGV PIDFSPLPLY LADDTPNTRN VAAYAAVSGE TVRIDDAYHA
EGFDFSGTRA VDARTGYRSQ SFLTVPLRDH EQTIIGVLQL INAVREGQTV PFTGRDARIA
ESLASQAAIA LTKQGLIDTQ RQLFESFTEV LARAIDRKNP TTGRHCERVP QLTLMIADAA
CRTGEGALAD YRLSAEEHEE LRLAAWLHDC GKVTTPEAVV NKATKLERQT DRIEEVATRA
AVVRQEAEAE RLRRRLAARE QGVDRDAEQA DADYRALVQR LDDDLAFLRS ANIGGEAMDD
EACERVAAIA RTYSWTDAAG ERRPLLSAEE VEHLQIRRGT LSAAEMEQMR DHVRVSREML
EQLTYPRHLQ RVPEIASQHH ERMDGGGYPD GITGGQMCRR ARMMALADVF EALTAADRPY
KPGKKLSEAV RIMGFMTQDG HFDPELFDLF IREGVYLDYA RQFMHPSAID AVDETAIPGY
RP