Gene Hhal_0700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0700 
Symbol 
ID4710752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp783214 
End bp785154 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content73% 
IMG OID639855163 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_001002284 
Protein GI121997497 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain
[COG2200] FOG: EAL domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.522723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACGG AGCTCGAGGT CTTCATCCGT CGGCGCGCCA ATGAGCAGGG GCTCTCCCTG 
GCCGAGGTGG CGCGGCGGGC CGGGATGAGC CGGCAGTCGC TCTACGATTG CTGGTCGCGG
GATGGCTACC CGAACCTGGC CACCATCGTC GAGCTGGCCG AGGTCCTCGG GGTGCACCCG
CTGCGGCTCC TGGAGCTGCA GTTCCCCGAC GACGGCGCGG CCGAGGATCA CTGCCACACC
CTGCGTTGGA ACCTGGACCG GGCCGAGCTG GCCACGACGC TGCTTGAGCA CACGCCCCAG
GCCGTTATCG GCCTGGACGC CGATGGGCAG GTCGCGCTGT TGAACCCGGC CGCCGCCGAG
CAGCTGCAGC GCCCCCCGGA GCGAGTGCTC GGCCAGCCGG CCGAGCGCAT CCTGCCCGGC
GAGGCGCTGG GGCGCATCCG CGCGCAGTGC CCGGTGATGA GCGGCGCACC GACTCAGCCG
CACGCCACGC CCCGCGGCAG CCTGCGGCTG AGCACCCGTC GCGGCGAGGA CGGCAGCCTG
CTGCAGACGC GCGCGCAGTT GGTCGAGGCG CCGCTCTACG AGCGTAATGT GGTCCTGCTG
CTCCTCGACC CCGGGGATCC CGCCAACGGG ACGGCGGCCG AGGCACCGCC CCCCGCCTTC
GACCGCGAGC CGCTCACCGG CCTCCCCGAC CGAGAGTCGA CCGAGGCGCT TCTGCGCTGG
GCGATGCAGC GCACCGAGTC GCGGGGCAGC GAGCTGGCGG TGCTGGTCAT CGAGCTCGAC
GGCTTCGACC GGACCGAAGC CGGGCTCGGC CCGGAGATCG CCGACCGGGT GCTGCACACC
GCAGCCCAGC GGGTCGGCGG TCAGCTGCGC CACGCCGATC TCCTCGGCCA CCTGCAACGG
GGCCAGTTCA CCGCCATCCT GCCGGATACC GGCTCGCGCA GCGGCGCCCT GCAGGTGGCC
CAGCGCATCT GCCAGAGCCT GGAGCATCCG GTGGAGACCG GCTCCTGCGC CTGTCAGTTG
CAGCCGCGTA TCGGCATTGC GCTGTTCCCC GAGCACAGCC ACCACGCCGT CACCCTCCTC
GAACAGGCCC GCTCGGCCGC CACCCGCACG GCGGGAGACC ACCAACCGGG GCAGATCGTG
GAGCCGGACC TGGGCGACCG GGTGGCCCGG CGCGTGGCGC TGATCCGCGA CCTGCGCCGG
GCCCTAGAGC GGCGCAGCCT GTGCGTGCAC TACCAGCCGG TCTTCGATCT GGCCAGCGGC
GAGCTGACCG CCCTGGAGGC GCTGCTGCGC TGGCCCGAAG GCGACACCTC CCAGCCGTCC
ATCGGCGAGG TCATCGACGC CGCCGAGCAG GCCGGGCTGC TCGGTGAGCT GGACCGCTGG
GTGGTCGAGC AGGTGTTGTC GGACCTCGCC CACTGGCAAT CCCAGGGCTG CTGCGTGCCG
CGGGTGAGCG TCAACGCCTC GGGGCGCACC CTGGGCACCG GGGCCCTGGA GCCGGCGGTG
CTCAGCAAGC GCCTGGAGCG CCTCGGACTA CCGCCGGGCA CACTGGACAT CGAGCTCTCC
GAGCGCCACC TGCTCGACAC CGGCGGCGCC GGCCACGACG GCCTGACCCA TATCCGCGAG
CACACCCTCG GGGTCACCAT CGACCACTTC GGCACCGGCT ACGCCTCGCT GATCTCCCTG
CGCGACCTGC CGGCGGGACG ACTCAAGGTG GACCAGACCC TGATCCGCGA CCTGCCCACG
GACCCGGATC AGCAGGCCCT GGTGGCCAGC ATCGCCCGCC TGGGCGAACG CTTCGGCCTG
GAGCTGGCCG CCGAGGGGGT GGAGACCCGG CAGGAGGCGG AACACCTTCG CCAGCTGGGC
TTCACCGAGG CGCAGGGCTA CCACTTCGGC TACCCGGCCC CGGCGGCGGC GGTGTCCGAG
CACCTGGCGA CCGCGCCCTA G
 
Protein sequence
MSTELEVFIR RRANEQGLSL AEVARRAGMS RQSLYDCWSR DGYPNLATIV ELAEVLGVHP 
LRLLELQFPD DGAAEDHCHT LRWNLDRAEL ATTLLEHTPQ AVIGLDADGQ VALLNPAAAE
QLQRPPERVL GQPAERILPG EALGRIRAQC PVMSGAPTQP HATPRGSLRL STRRGEDGSL
LQTRAQLVEA PLYERNVVLL LLDPGDPANG TAAEAPPPAF DREPLTGLPD RESTEALLRW
AMQRTESRGS ELAVLVIELD GFDRTEAGLG PEIADRVLHT AAQRVGGQLR HADLLGHLQR
GQFTAILPDT GSRSGALQVA QRICQSLEHP VETGSCACQL QPRIGIALFP EHSHHAVTLL
EQARSAATRT AGDHQPGQIV EPDLGDRVAR RVALIRDLRR ALERRSLCVH YQPVFDLASG
ELTALEALLR WPEGDTSQPS IGEVIDAAEQ AGLLGELDRW VVEQVLSDLA HWQSQGCCVP
RVSVNASGRT LGTGALEPAV LSKRLERLGL PPGTLDIELS ERHLLDTGGA GHDGLTHIRE
HTLGVTIDHF GTGYASLISL RDLPAGRLKV DQTLIRDLPT DPDQQALVAS IARLGERFGL
ELAAEGVETR QEAEHLRQLG FTEAQGYHFG YPAPAAAVSE HLATAP