Gene Haur_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1214 
Symbol 
ID5733107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1398604 
End bp1401996 
Gene Length3393 bp 
Protein Length1130 aa 
Translation table11 
GC content50% 
IMG OID641278354 
Productadenylate/guanylate cyclase 
Protein accessionYP_001543990 
Protein GI159897743 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCCA ACCCTCAGAT CAGCAGCTTT CTGCCACGCT ATGTCCTGCA ACGGCTCATC 
GAAGAGCATC AACCGCGCCA GCCGCTCGCC GAAGATCAAC TGGCGACGGT GTTGTTTGCC
GATTTTTCTG GGTTTAGTCG CCTCAGCGAG CAATTTGCCT ACGACCAAAC CACCATCCTC
GAATATATTT CCGATGTACT GAACGACTCA TTTAGCCAAT TAATCGACAC CGTGATTGCC
CATGGCGGCG ATGTGGTGAA GTTTGCTGGC GATGCATTAA TTGCAATTTG GATGGCCGAC
GATCTTGATC AATTAACTCA AAATACCCAG CTTGCGGCTC AATGTGGCCT AGCGATTCAA
GCAAGCTTTA ATAATCCTAG TGAATCTGAG CGCTTGGCGA TTCGGGTGCA AATTGGGGCT
GGCTCGATTT CAAGTTTTAT CGTGGGTGGC GTGAATAATC ATTGGGAGCA ATTGATTACG
GGCGAGGCGT TGTTACAAGT GCATTTGCTT GGCTCGCAAA GCTTTGCTGG TCAAGTGATC
GTTTCGCCCG AAGCGCGTTC GTTAATTCGC GAATATGGGC TGGGCGAAGA TTTACGTGCT
CAGGCTTTTC GGCTGACTGC AATGTCCGAA AATCTGCCCT TGCCAATGAT GAACCAAACA
CTGCCCGAAT ATCCAACTGA GCAGTTGACT CCGTTTATGC CTACCGCGGT GATTGCTCGC
ATTAATGCAG GTTTGCACGA ATGGCTGGCT GAGCTACGCC ATCTGAGCGT GATGTTTATT
AATGTGCCAG CTTTGAGCTT TTTTACCACG TTAGAGCAAG TGCAAGCCGC AGTTAGTGCT
TTGCAAACCG TTATTTTTCG CTACGAAGGC AGCATCGACA AACTATCGGT CGATGATAAA
GGCGTAAGTT TGCTAGCGGC TTTTGGTTTG CCGCCACTCT CGCATCGCGA TGATGCTGAA
CGAGCGGTGC GCTCGGCCTT AGAAGCCTCA GCCGCATTGA GCAAACTGAG TTTAACCCAC
ACGATTGGCA TTGCCACTGG CCCAGCCTTT TGCGGCGAGA TTGGCAATAG CCAGCGCCGC
GAATTTACCA TGATTGGTGA TGTCGTCAAT CGGGCTGCCC GTTTGATGGA AGCTAATCTT
GCACCAATTC TCTGCGATCA AACCACCGCC CAAGCTAGCC AACAACATGT GCGCTTTCAA
GTACTACCGC CAATTACGAT TAAAGGCAAA AGCCAGCCAA TTACGATCTA TCGCCCGCAA
CAAACCAACC ATAGCCCCGA CCAAATTCGG CCTCGACGCT TGATTGGGCG ACGGCGCGAG
CGTGGTCAAA TTGAGGCGCT ATTCAATCAA ACCCAACCAA GCAGTCAGCA TGTGGCGATC
ACTGGCGATA GCGGTATGGG TAAATCGGCC TTGCTGTATG AGGCCGTTGA AATTGCTCAG
CATTACGAAC GCCAAGCTAT GTTGATTACC AGCAGCAGCC TGCGTCAATC GGCTCAGTAT
CCAGGCTGGC GTTTGTTGCT TGAGGCCTGT TTGGCGGTTG AGCAATGGCC GAATCAGGCG
GTTCAATCTT ATCAATTAAT TATTCAGCGT TTGCAATTGC CTGACCAGCT CGCTAAATCG
GTGCACTTGC TCCACGATGT TTTAGAGCTG CCAGGTCGCG CCGAAGCTAA CACCACCGAT
CATGCCCAAC AGGTGCAAAT CATCCATGAG TTAATTGGGC ATGCCCTCTA TCAACTGCAT
GCTCAACGCC CGTTAGTCTT ATGCATTGAT AATTTGCAAT GGTTCGATTC ATTAGCCTTG
GCGACGCTGG AATTATTGTT GCAGCAGCAT GACGACATTA TTCTGATTAC AACTGCGCCG
TCTGCTGTGG CATGTCTCGA ACCACAACAA ACCATCCATC TGCAAGCGCT TGATCCTGTG
GCTTGTATCG CGGTGGTTGC TCAATCATTG GGTGTGCAGG CGATTCCTCC AAGTGTGGCT
TTATTTATCA ATCAGCGTGC GGCAGGCCAT CCGTTGTGGA GCATCGAGCT AGCCCAAGCC
TTGCGCAATG CAGGGATGAT TCGAGTCAAC AATGGGGTTT GCAAGCTTGA TCAATTTAGC
CAGTTAGAAA AACTCAATCT CCCAAGCACA ATTCAGGGTG TGCTGGTCAG CCGCATCGAT
CAACTGCCGC CGCAACCCCA ATTAACCCTC AAAATTGCTA GTGTGATCGG CCATGATTTT
AGTTTGGCAG TGCTTGATGC GATTTATCCA GTGGCCCACG AGCGTGAGCA TATTCCAGCT
CACCTTGACT TATTGCTGCA ACAAGGTTTT ATTCACGAGG CTGCCGAGGG CTATCAATTT
AGCCAAGCAA TTATCCACGA TGTGGCTTAT TCATTGTTGT TGTTTGGTCA ACGACGAGCC
TTGCATCGAG CGATTGCTGA ATGGTACACC CGCGAACATC CCCATTTGGT TGAGGCAGGC
AGCAGCCTTT TGGCCCATCA CTGGAGCCAT GCGATCGATC CTGATGAGCC AGAAAGCCGT
CAACCTGCGA TCGATGCCTT ACGGCGAGCA GGCGAACAAA CCCTTGTGCG CTGTAGCTAT
CGCGAGGCGA TTCCGTTTTT CGAGCGAGCC TTGCATTTGT TGGCAATTGA TGATGATTTT
GCCTCGCAGC AACAAATTGT GCGCATGCAA TTTAATCTGA GTCAAGCCCG CTGGCGGTTG
GGTGACCACA GCATGGCCTT GACCAATTTG GATGCAGCAT TGGTAACCGC GCAACAAATT
GGCGATGGGA TTGGCGAGGC CGATGTGCTG CGCCAATTTG GCAATATTGC CTATGTCCAA
GGCGACCTTT ACACTGCCCA ACAACATTTT CTAGCCAGCG TCGCGCGAGC ACGCAAGGCC
AATTATCCCA GCGGAATTAT TAGTGGGGTT AGTAATGTCG GCGTGGTGGC GTTTGCACGG
GGCGATTATC AGGTTGCCCG CGAAGCTTAT CGTGATGGTT TAGCAATTTC GATCGAGCAA
GGTCACGATT TTGGCATCGC CGTCAATCAG CTCAATTTAG GTGGTTTGGC GATTGTTGAA
CAAGCTTGGG ATGAAGCACG CTCCTATTTG CAGCAAGCAT TGAGCTTGGG CTATGCCAAA
CACATGACTT TAGTCTGTCT GCATAGTTTG GTGGCCTTGG CTGAATGGCG TTTAGCGACC
AATCAAGCCG AGGCCAGTGC TAACTTGATT CAGATTGTGC TGCATCACGA AGCGATCGAT
AGCGAAATTC ATGCAGCAAT TGATAAACTC AAACCCAAGC TGATTCAAAT TTTAGGCGAG
ACCCAATGGC TGATTCTGAG CCAACGCCCA ACAACCCCGT TTGAACAAGT ACTGCCTGCG
ATTATGCAAG AATTGGCTAC TGAAAAAGCC TAA
 
Protein sequence
MAANPQISSF LPRYVLQRLI EEHQPRQPLA EDQLATVLFA DFSGFSRLSE QFAYDQTTIL 
EYISDVLNDS FSQLIDTVIA HGGDVVKFAG DALIAIWMAD DLDQLTQNTQ LAAQCGLAIQ
ASFNNPSESE RLAIRVQIGA GSISSFIVGG VNNHWEQLIT GEALLQVHLL GSQSFAGQVI
VSPEARSLIR EYGLGEDLRA QAFRLTAMSE NLPLPMMNQT LPEYPTEQLT PFMPTAVIAR
INAGLHEWLA ELRHLSVMFI NVPALSFFTT LEQVQAAVSA LQTVIFRYEG SIDKLSVDDK
GVSLLAAFGL PPLSHRDDAE RAVRSALEAS AALSKLSLTH TIGIATGPAF CGEIGNSQRR
EFTMIGDVVN RAARLMEANL APILCDQTTA QASQQHVRFQ VLPPITIKGK SQPITIYRPQ
QTNHSPDQIR PRRLIGRRRE RGQIEALFNQ TQPSSQHVAI TGDSGMGKSA LLYEAVEIAQ
HYERQAMLIT SSSLRQSAQY PGWRLLLEAC LAVEQWPNQA VQSYQLIIQR LQLPDQLAKS
VHLLHDVLEL PGRAEANTTD HAQQVQIIHE LIGHALYQLH AQRPLVLCID NLQWFDSLAL
ATLELLLQQH DDIILITTAP SAVACLEPQQ TIHLQALDPV ACIAVVAQSL GVQAIPPSVA
LFINQRAAGH PLWSIELAQA LRNAGMIRVN NGVCKLDQFS QLEKLNLPST IQGVLVSRID
QLPPQPQLTL KIASVIGHDF SLAVLDAIYP VAHEREHIPA HLDLLLQQGF IHEAAEGYQF
SQAIIHDVAY SLLLFGQRRA LHRAIAEWYT REHPHLVEAG SSLLAHHWSH AIDPDEPESR
QPAIDALRRA GEQTLVRCSY REAIPFFERA LHLLAIDDDF ASQQQIVRMQ FNLSQARWRL
GDHSMALTNL DAALVTAQQI GDGIGEADVL RQFGNIAYVQ GDLYTAQQHF LASVARARKA
NYPSGIISGV SNVGVVAFAR GDYQVAREAY RDGLAISIEQ GHDFGIAVNQ LNLGGLAIVE
QAWDEARSYL QQALSLGYAK HMTLVCLHSL VALAEWRLAT NQAEASANLI QIVLHHEAID
SEIHAAIDKL KPKLIQILGE TQWLILSQRP TTPFEQVLPA IMQELATEKA