Gene Haur_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0044 
Symbol 
ID5731916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp53702 
End bp57784 
Gene Length4083 bp 
Protein Length1360 aa 
Translation table11 
GC content51% 
IMG OID641277165 
ProductTPR repeat-containing adenylate/guanylate cyclase 
Protein accessionYP_001542824 
Protein GI159896577 
COG category[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain)
[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0143745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGATTC AGCCCCCATC GTCAGAATTT GCCTCATTGT TGCGTCAAGT TCAACAGGCC 
GAGGTAATGC CCACCCAAAC GCAGCTCGAT GATGTGTATC GTCAGCTTGA CCAACGGATT
CAGCAATTGC AGCGCTACGT GCCCGACCCA ATTGTGCAGC GCTTAAAACA CGGCGAGTTG
CCCATTCCCT ATGGCGAACG TTGGTTAGGC TCGTTGCTCT TTGCCGATCT TTCGGGCTTT
ACTGCGCTTT CCGAGAGCCT AACCCGTTTG GGCAAAGAGG GAGCCGAGCA AGTAACCAGC
ATTATCAACC GCTTGTTTAA TCAGTTGGTA GCTGATATCG AGGCTCATGG CGGTGCATTA
ATCAAATTTG GCGGCGATGC AGTCACCGCC TTTTTTGATC AAACCAAACT TGGCGAGCAG
CATGCCTTGT ATGCCGCCCA CGCTGCCCAA GCTATGCAAA CATCAATGGC AACCTTAGGC
AAAATCCAGA TTCGGGCTGG TCAATTTAAT CTCACTTTGC GCATTGGCGT GCATAGCGGC
GAGGCTTTTG CTGCTTTGCT GGGCGATCAA CAGCATGCCG AGTTGATGCT GACTGGCATT
GCGGTTAACT TGGTGGCGCG TGCTCAAGAG TTGGCGCGAC CTGGCGAGGT GGTTTGTTCC
AAAGAAACCA TGCGGTTGCT AGGCTTGCCG ACCGATCCCA TGAATCCATT TACGGCCTTA
ACTACTGCGC TTGAACATCC GCCCTATCAA TCAACCAGTC AAGCGCCAAG TATCCCGCAA
ACTATCACTT TAGCCGATGT TCAAGCCCTT GCCAGCATCT ATGCAGGCAT TCAGCGTTTG
CTGCCCCAAC GCATGAGCGA AGAACAACTG CTCAGCAACG CCGCCAGCAG AACTGGTGAA
ATTCGGCTGG TCACAATTGT TTTTGCCCAT ATTGCTCCCT TCAGCACGAT TGTTGAATTG
TGCAATGATG CTAGCGCAAC TCAATTGCTC AACCACTACT ACCAACGCAT GCAGCAAATC
GTCAATCACT ATGGCGGGGT GGTCAATAAG CTCGATATGG CTGCCGATGG CGATAAATTG
CTGGCGATCT TTGGCGCACC GATTGCCAAC GAAAATGATA CTGAGTTGGC AGTACGAACT
GGCCTTGATA TGCAATCGGC CATGAACGAA ATTAATGCCT TGATTCAGCA GCACGACCCA
CGTTTGCCCT TGCTTGACCA ACAAATTGGA ATCAATCAAG GCCATGTGTT TGCGGGCATT
GTCGGCAGCG AAACCCGTCG CGAATATACC GTGATGGGTG ATCCAGTCAA TACTGCTGTG
CGTTTGATGA GCGTTTGTCG CTATGGCGAA GTGCTGGCCA GCCCCACGAT CAAGCGGCTT
ACCAGTCATG CCTTTGCCTA CGAAACCCTG CCAGCCTTAC CACTCAAAGG CAAAAGCCAA
CCAATCGAGC CAGCGCGGGT CGATCGGGCC TACAATGTGC GCCGCGAAAC CATGCGAGCC
GATTTGGTTG GGCGGGTGCA AGAACTCGAA CAATTAGCCA CAATCAGCAT TCAAGCGCTG
CAAGGCCAAG GCCAAATTAT CAATATCTTC GGTGAGGCCG GCATTGGCAA ATCGCGCTTG
CTCGAAGAGT TGCTCAGCCG TTTGTCGTTG GCTTCGTTCG ACCCAAACTT GAACGTGCCC
GAATTTATGC CAATTACGAT TGAATGCCAG TTGTATGAGC AAACGACTCC ATTTGCTTCA
GCCAGCGAAG TCTTGCGCCA AGTATTGCGC TTGAGCAGCA GCCTGAATGG CGAACGCTTG
GTGCAAGTCA TCAGCCAACG AGTCAATCGG CTTGTGCCCG AGCTAGAGCG CTTTTTGCCC
TTGCTCAGCG ATATTTTGCA TGTAAGTTTG GATGAAAACG AACTAACCCA AGCCCTAGCG
CCCGAACAAC GCCATGATCA AACGATTGAA TTGGTAGTAG CCTTGATGTT AGCCACTGCC
AACACCACGC CACTGGTGAT TTTATGGGAT GATGCCCACT GGATCGATGC CTCATCGCGC
CAATTGCTTG AACGTTTGGC CAAGCATGCG GATCAGGCCG CGATCGCCTT ACTGATTGGT
TCGCGCACCA AGGGCTTAAG TGCCATGACG TGGCCTGAGC AAACAGTGCA TCTTGAGTTA
AGCGAATTGA GCATGCCCGA AAGCGAAGCC CTTGCCAGCG CGATTATTGG CCATGCACCG
TTGCCAGCAG CCTTGCTCGA ACGCTGGCGA CGCAGCGACG AACGCTTTTT CAACCCACAG
GGCAATCCAT TTTTTATCGA AGAAATGATG CGTAGCTTGA TTCAACAAGG TGTGATTGTT
GAAACTGCGG CGGGCTGGGA TTTACGTGGC GAACTCGATA GCTTGCCAAC CACGATTGAA
GGCGTGATCA CCGCACGCTT GGATCGGCTG GATAATCGGA TTCGCGAAAC GGTGCAAGTT
GCTTCAGTGG TTGGGCGACG CTTTGAACTG AATATTTTGC GGGGCATTAC CAACGACGAC
GATTTAGTTA ATCAAATGGA TCGCTTGACC AATGCTGATG TGGTGTTGCC CGACCATATT
GCTGCCGAAC TAGCTTATTT ATTCAAACAT ACATTAACTC GCGATGTGGC CTACGAAGGC
ATTTTGTATG CCCGCCGCCG TGAGTTACAT CGACGCGTAG CTGCGCGGAT TCAAGAAGTG
CATCGCCAAA ATCTAGTTGA TGTTCTGCCG ATTTTGGCTC GCCACTATGC CCGTGCAGAG
GATTGGCACG AAGCAGTACG CTATTATCGC GAAGCTGGCA TCGCCTCGCA AAAACGTTAT
TACAATGCCG AAGCCTGCGC CCATTATCGT GATGCCCTTG AGCTATTGCC AAATCTGAGC
GCCTTTGATC GCTCGTTGGA GCAAGAACTA ACTGAACGCT TGGGCTATGT AATGATGCAA
GCTGGTGATT ATGCTGAAGC CCTGCCAATT CTCGCTCAAG CCCAAATTCA ATTACGCCAA
AGCAGCGTTT ATAGCAGTAG CCGCGAAGCA CGGATTTTGC GCCATATCAC CACAATTTAC
GAGCGGCGCG GCGAATACGA ACCAGCCTTC GAGCATCTGC AACAAGCACT CAATTTGTTA
GGCGAAAGTC AATCGGCGGA GCGAGTACGG GCCTTGTTGC TTGGCTCAGG GCTGCATCAA
CGCCAAGGTC GTTATCATGA AACAATTACC TGGGCCGAGC AAGCCTTAGC CATTGCCGAA
CAATTAAATG CCCAACCAGA ACAAGCCCGT GCCTATTTGC TAATTGGCGG AACCCATCGC
GTGCTTGGCT CGCGTGAGCA GGCCGTCGGC TATTTGGCCA AATCGATCGA GCTGTATCAG
GCGGTCAACG ACATTTCGCG TTTGGCCGAT GCCTACAACA ATGCCGCGAT CAACTTTAGC
GATCTTGAAC AATGGGAGGA ATCAATCGCC TTATATCAGG CTGCCTTGAA CATCAAAAAG
ACCATCAACG ATAGTTATGG TCAAGCGCTC GTTAGCCTCA ACCTTGGTGA GTTATATCGC
AAGACTCAGC AATTTGACCA AGCTTTAGAG ACCTTCAACC AAAGTTTACA GCTTTGGAAC
AAGCTTAACT CCAAACTTGG AGCCGCCGTC ACCCAGATGA ATATGGGCAA CACGCTGTTT
GCTCAAGGCC AAAGCGCCAC CGCTGAATCC ATGCTCGATC GCAGCCGCGC TATTTTTGAT
GAAATCCATG TCGATTCATT TTTACCCGAG CTGTATCGGA TTTATGCTGA ACTTTTTTAT
AGCCGCCACA ACTATGCCAA AGCTTTAGAT TATTGTGATT TTGCCTTGGA ACAGGCTCGC
CAACATTCAG CCAAAGCCGA TGAAGGCCTC GCCCAACTGA CTCAAAGTAA GATTCTGCTG
GCCTTAGACC ATTATGATCC AGCTCTACAT GCTGCGGAAC AAGCACTCAA GTTGTTGCAA
GAATGCGATA ACCAAGCTGA TGTTGAACGC TGTTTGGAGC AACTGATCGC TCTGACCCAG
ACTAACGATC CGCAACGCAT GGCTGATTAT CAGAATCAAC TAGACCAGTT AACCAAGGCT
TAG
 
Protein sequence
MSIQPPSSEF ASLLRQVQQA EVMPTQTQLD DVYRQLDQRI QQLQRYVPDP IVQRLKHGEL 
PIPYGERWLG SLLFADLSGF TALSESLTRL GKEGAEQVTS IINRLFNQLV ADIEAHGGAL
IKFGGDAVTA FFDQTKLGEQ HALYAAHAAQ AMQTSMATLG KIQIRAGQFN LTLRIGVHSG
EAFAALLGDQ QHAELMLTGI AVNLVARAQE LARPGEVVCS KETMRLLGLP TDPMNPFTAL
TTALEHPPYQ STSQAPSIPQ TITLADVQAL ASIYAGIQRL LPQRMSEEQL LSNAASRTGE
IRLVTIVFAH IAPFSTIVEL CNDASATQLL NHYYQRMQQI VNHYGGVVNK LDMAADGDKL
LAIFGAPIAN ENDTELAVRT GLDMQSAMNE INALIQQHDP RLPLLDQQIG INQGHVFAGI
VGSETRREYT VMGDPVNTAV RLMSVCRYGE VLASPTIKRL TSHAFAYETL PALPLKGKSQ
PIEPARVDRA YNVRRETMRA DLVGRVQELE QLATISIQAL QGQGQIINIF GEAGIGKSRL
LEELLSRLSL ASFDPNLNVP EFMPITIECQ LYEQTTPFAS ASEVLRQVLR LSSSLNGERL
VQVISQRVNR LVPELERFLP LLSDILHVSL DENELTQALA PEQRHDQTIE LVVALMLATA
NTTPLVILWD DAHWIDASSR QLLERLAKHA DQAAIALLIG SRTKGLSAMT WPEQTVHLEL
SELSMPESEA LASAIIGHAP LPAALLERWR RSDERFFNPQ GNPFFIEEMM RSLIQQGVIV
ETAAGWDLRG ELDSLPTTIE GVITARLDRL DNRIRETVQV ASVVGRRFEL NILRGITNDD
DLVNQMDRLT NADVVLPDHI AAELAYLFKH TLTRDVAYEG ILYARRRELH RRVAARIQEV
HRQNLVDVLP ILARHYARAE DWHEAVRYYR EAGIASQKRY YNAEACAHYR DALELLPNLS
AFDRSLEQEL TERLGYVMMQ AGDYAEALPI LAQAQIQLRQ SSVYSSSREA RILRHITTIY
ERRGEYEPAF EHLQQALNLL GESQSAERVR ALLLGSGLHQ RQGRYHETIT WAEQALAIAE
QLNAQPEQAR AYLLIGGTHR VLGSREQAVG YLAKSIELYQ AVNDISRLAD AYNNAAINFS
DLEQWEESIA LYQAALNIKK TINDSYGQAL VSLNLGELYR KTQQFDQALE TFNQSLQLWN
KLNSKLGAAV TQMNMGNTLF AQGQSATAES MLDRSRAIFD EIHVDSFLPE LYRIYAELFY
SRHNYAKALD YCDFALEQAR QHSAKADEGL AQLTQSKILL ALDHYDPALH AAEQALKLLQ
ECDNQADVER CLEQLIALTQ TNDPQRMADY QNQLDQLTKA