Gene Haur_4216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4216 
Symbol 
ID5736070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5371329 
End bp5373482 
Gene Length2154 bp 
Protein Length717 aa 
Translation table11 
GC content47% 
IMG OID641281371 
Productdiguanylate cyclase 
Protein accessionYP_001546976 
Protein GI159900729 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAAGC GACGATCCGT GCTACGGATC TCCTCGCGGC AACGAACTCA CTTTCAAGCC 
AATCCTTGGC GAATTCGGCG CATGATCACT GAGTTGCATG CGCTACACGA GATTCGCCAC
GCGATTAACC AGGCCTTGGA ATTGGATACG GTGCTTGAGG AGATTTATAG TCAAACATCG
CGATTAATGG ATACCCGCCA ATTTTACTTG GCGTTATACT CGCCTGAACA AGAATTGATC
GAGTTTGCCT TGGCAGTTGA AAATGGCCAG CGCGTCTCTT GGCCCGCCCG CCCCTATGCC
AATGGTTTAA CCGAATGGAT TTTGCAACAT CGCCGCCCCC TGCGGATCAA CGCCCATAAC
CTTGATCCTG CGGTTGCACC GATTATTGTT GGGCAGCCTG TCCAATCATT TTTAGGGGTT
CCAATTGTGC TTGGTGAGCA GGTTCTTGGG GTTATGAGTG TGCAATCGGT CGAACGAGCC
GAGGCCTATT CTGCCCACGA TGAACAAATT TTGACCATGA TCGCCGATCA GGCCGCTGTC
GCGATTGCCC ATGCCAAGCG CTTTGCCGCC GTTGATCAAC AGTTGCAAGC CCGCGTTGCT
CAACTTGAAA CCCTCGAAGC AACCATCCGC GATTTAAATG AAACCCTCGA TGTTGATACA
ATTCTGAATC GCTTACTTGA CCGTATCCAG CCGATTATGC ATGCTGATGC TGGGATGGTT
TGTTTGCTTG ATACCGAACA GCAACGGATG TTTATCCGCG CAATTCAGGG CTATCCAGCT
GAAGTTCGGC GTTTTCAATT TGAAGGTTGG CCAATTGATC GTGGCATTGC GGGCTTAGTT
GCACGTACCG GTCAACCTGA TTGGACTGCT GATATTCGTA ATTCGAGTTA CTATGTTAAT
TCGCGGCCAA CTACATTGAG CCAAATGACT GTGCCAATTG TGCATAGTGG CAAAGTTCTA
GGTGTAATGA TTCTCGAAAG TGATCGAGTT GGCACATTTA ATGATGATAC GACGCGCTTT
GTCAGCCAAA TTGCTGATCA TGCGGCGCTT TCAATTCATA ATGCGCGTAT TCATCAGCAA
GCAGTTGAAC AACAAGCGTT GTTGGCTCAA CGCTCACAAC AACTCAACGA AGTGCTGCGT
ATCAGCCAAG CATTAAGCGC CAACCTTAAT TTAAATGATC TGCTGCCAGA AATTGTTCGA
GCAATTCAAG CAAGTTTGGG TTTCAATATT GCCTTGCTGA GTTTAGTTGA TCAAGAACGA
CCAACCTTTA TGCGTCGCCG TGCCGTGGTT GGTGTGCCTG ATGAACGCTG GTTTGAATTA
CGTGATCAAT TAGTGCCAAT TGAATGGTAT CGCAGCGTAA TGCGCGAAGA ATTTCGGATT
AGCCGCTCGT ATTATATTCC GCATAGCCAT ACCAGCTATA CCGCCATTTG GGGCAATAAC
AGTGATACGT ATCGACCTGA TTTAGGTGAG CGTCGTCCAC ATGAATGGCA TCAAGATGAT
GCCTTGTTCG TGCCATTGTA CGATTCTGAT AATAATTTAA TTAGTATTCT CTCAGTTGAT
GATCCACGTG ATCGGCGGAA ACCATCGTTT GAATCGGTGC AGGTACTAGA AATTTTTGCC
ACTCAAGCAG CAATCGCGAT TGAAAACGCC CATTTATATA CAATTACCCA ACAATTAGCC
ATCACCGATG GCTTAACTGG CTTGTTCAAT CAACGCCATT TTATGACGAT GCTTGATCGT
GAAGTGGCCT TGGCCTATCG CTATAATTAT CCATTATCGT TATTAGCTTT GGATATTGAT
TATTTCAAGC AATATAATGA TAACTATGGC CACTTGGTTG GCAATGTGCT GCTGCGCGAT
TTTGCCCGGC TCATTTGCGA AAATGTACGC GATGTTGATA TTGTTTCGCG CAACGGTGGC
GAAGAATTTA CAATCATTTT GCCCAAAACT GATCAAGCAG GTGCAGTGTT GTTAGCCGAA
CGGCTGCGAG TTCGCACTGC CGAACATCTT TTTGGTCAAG GCCATATTAC GGTTAGCGTC
GGCGTTGCCA CCCTCAACAA CAATTGGGAT GCCCACACGT TGCACGATCA AGCTGATCAA
GCGCTTTATC GCGCCAAAAA TTCTGGGCGT AATCTGGTTA TCAGCGTCCC ATAA
 
Protein sequence
MTKRRSVLRI SSRQRTHFQA NPWRIRRMIT ELHALHEIRH AINQALELDT VLEEIYSQTS 
RLMDTRQFYL ALYSPEQELI EFALAVENGQ RVSWPARPYA NGLTEWILQH RRPLRINAHN
LDPAVAPIIV GQPVQSFLGV PIVLGEQVLG VMSVQSVERA EAYSAHDEQI LTMIADQAAV
AIAHAKRFAA VDQQLQARVA QLETLEATIR DLNETLDVDT ILNRLLDRIQ PIMHADAGMV
CLLDTEQQRM FIRAIQGYPA EVRRFQFEGW PIDRGIAGLV ARTGQPDWTA DIRNSSYYVN
SRPTTLSQMT VPIVHSGKVL GVMILESDRV GTFNDDTTRF VSQIADHAAL SIHNARIHQQ
AVEQQALLAQ RSQQLNEVLR ISQALSANLN LNDLLPEIVR AIQASLGFNI ALLSLVDQER
PTFMRRRAVV GVPDERWFEL RDQLVPIEWY RSVMREEFRI SRSYYIPHSH TSYTAIWGNN
SDTYRPDLGE RRPHEWHQDD ALFVPLYDSD NNLISILSVD DPRDRRKPSF ESVQVLEIFA
TQAAIAIENA HLYTITQQLA ITDGLTGLFN QRHFMTMLDR EVALAYRYNY PLSLLALDID
YFKQYNDNYG HLVGNVLLRD FARLICENVR DVDIVSRNGG EEFTIILPKT DQAGAVLLAE
RLRVRTAEHL FGQGHITVSV GVATLNNNWD AHTLHDQADQ ALYRAKNSGR NLVISVP