Gene Haur_4106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4106 
Symbol 
ID5735967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5243532 
End bp5245370 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content50% 
IMG OID641281260 
Productadenylate/guanylate cyclase 
Protein accessionYP_001546866 
Protein GI159900619 
COG category[T] Signal transduction mechanisms 
COG ID[COG2114] Adenylate cyclase, family 3 (some proteins contain HAMP domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTA AGCCGTTTCG CCTAAAACAA TCGACTATCG TGCCACTAGG ATTGGCTGAG 
GTTTGGGAAA TTGCGGCTCA AACCAGCAAA GTTAACCACG CCGTTGGCCT ACCACCATTA
GTTTATAGCG AACGAACCCG CAGCGATGGT GTGCGCGAGG TGGTTGGTAA TGCCCGTCAA
TATGGCCTGC CAGTGCAATG GGTCGAACAT CCCTTTGAAT GGATTTATCA AAAAGAGCTG
CGGGTTGAAC GAGTTTTTGA GCATTTCCCC CCATTAGATC GCTTGATCGG TGGCACAATT
CTCGTCCCGC ATACCTCGAT TAGCACTGAA GTTGAAACCT TTGTCGATGT TTTCCCCAAG
AATCTGCTGG GCTATCCAGT GGGTTGGTTT GTGGCTAATC AGATGATTAC TGGGCAAGCA
GCTTTTTTTC GCTCGGTGGC CCAAAAGCCT AATGCAACTA ATTATTTCCC GCCAGCTCGC
AAAGTTCGCG CCAATCGCAA TGTGTTGCAA GAGTTGCGCA ATCGGATGCG CGATTTACCT
TGCGATCAGC AATTAATCGA TCGTTTGTTT GAGCTGATTC AAACCCAACG CGATGAAGAT
GTTGCGACCA TGCGGCCATT TGTGTTGGCG GATGCTTGGG GTGCTGAACG AATCAGCTTA
TTACGGGTCT TTTTATATGC TACTTGGACG GGCCTGCTCG ATATGAATTG GGATGTGTTG
TGTCCCAATT GTCGGGTTGC CCGCGAACCC TCGCCGCATT TACGCGACCT AACGGCAACC
GCCCATTGTG AAGTATGCAA TATTCGCTAC GATCTCAATT TTGATGAGTA TGTTGAATTG
CGTTTTTCGG TCAATCCCAA GGTGCGCAGT GCTAGTAATG CAACCTTCTG TGCGCTTGGC
TCGCCAGCCT TGAATGAGCA TATCGTTGCC CAAGCACGCC TCAAACCCAA CGAACAGCGT
GAATTAATGG TTCAGCTTGG CCCTGGTGGC TATCGCTTAC GCATGTTGGG GATTGAGCAA
CGCTGCACGA TTCAAGCCCA AGCTGCTGGC GCAACCAATG CCATAATCAA TCTAACCGCG
ACTGGCCCAG ATCAAGCGGC ATTAACGTTG GCACTTGAAC CTACAACCCT GAATGTGCAT
AACACCACCG ATCGCGAACA ACTGTTGATT ATCGAGCACG ATGCCTGGGG GGTTGGTCGG
GTCAGCGCCT CACTAGTTTC GACCTTGCCC GAATTTCGAT CGTTGTTCTC ATCGGAAGTG
CTAGCACCTG GCTTGGGCTT GGCAATCAAA AATCTGACAA TCTTATTTAG TGATATTAAA
AATTCAACCC CGATGTACGA GCAGCATGGC GACTCAAGCG CCTATGCCAT GGTTCGCGAC
CATTTCGATG TGCTATTCAA GGCGATTGAA CAGCACAATG GCTCGATTGT AAAAACGATT
GGTGACGCAG TGATGGCGGT GTTTGCCAGC CCAAACGATG GCGTTGCGGC GGCCTTGGCA
ATCCACCATA ATATGCAGCA AGCCAACCAA GCCCGCCCTG AACGCCCACC AATCGTAATC
AAAATTGGGT TGCATACAGG AACTTGTATT GCGGTTAATG CCAACGAAGT GCTTGATTAC
TTTGGCACAA CCGTCAATGC TGCGGCGCGT GCTCAAGGCC TGAGTGTTGG CGATGATGTG
GTCTTGACTT CTGATGTGAT GGAATCGGCA GGGGTGCGGG CAATACTTAG TCAACATAAT
CTGCTAAGCG AACCTTTTAC TCACAATCTC AAAGGCATAT CGCAGGTGTT TACGCTCTAT
CGGCTGATGC CAATGGCTCA GCGCGAGCAT CGGGCATAG
 
Protein sequence
MALKPFRLKQ STIVPLGLAE VWEIAAQTSK VNHAVGLPPL VYSERTRSDG VREVVGNARQ 
YGLPVQWVEH PFEWIYQKEL RVERVFEHFP PLDRLIGGTI LVPHTSISTE VETFVDVFPK
NLLGYPVGWF VANQMITGQA AFFRSVAQKP NATNYFPPAR KVRANRNVLQ ELRNRMRDLP
CDQQLIDRLF ELIQTQRDED VATMRPFVLA DAWGAERISL LRVFLYATWT GLLDMNWDVL
CPNCRVAREP SPHLRDLTAT AHCEVCNIRY DLNFDEYVEL RFSVNPKVRS ASNATFCALG
SPALNEHIVA QARLKPNEQR ELMVQLGPGG YRLRMLGIEQ RCTIQAQAAG ATNAIINLTA
TGPDQAALTL ALEPTTLNVH NTTDREQLLI IEHDAWGVGR VSASLVSTLP EFRSLFSSEV
LAPGLGLAIK NLTILFSDIK NSTPMYEQHG DSSAYAMVRD HFDVLFKAIE QHNGSIVKTI
GDAVMAVFAS PNDGVAAALA IHHNMQQANQ ARPERPPIVI KIGLHTGTCI AVNANEVLDY
FGTTVNAAAR AQGLSVGDDV VLTSDVMESA GVRAILSQHN LLSEPFTHNL KGISQVFTLY
RLMPMAQREH RA