Gene Haur_0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0797 
Symbol 
ID5732682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp900212 
End bp901438 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content48% 
IMG OID641277928 
Productdiguanylate cyclase 
Protein accessionYP_001543573 
Protein GI159897326 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.71948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTG ACCAGCCAAC AGTTGCCTTA ATTGTCGGAA TTGGCTTGTT GCTCCAAGCA 
TTGGTGATTG GGGCTTTTTT AGTCTTTGTG CGGCGCTATC AAGGGATTCG CTGCTTCTTG
ATTGGCACAA TTGGCTTGGG TTTGGGCTTT TTAGCCTTGG CGTTATTGCC TCAAACCTCA
ATCGTAGCAA TCACAAGTAG TTATATTCTG ATTCTCAGTG GCTTGTTTGG GCTAAGCAGT
GGCATTTTAC GCTTCATTGG AGCCGCCAGC TCAGCCAAAA TTTATGCGAT CTGCTGGCTA
ATCGTTGCTA GCATGCTCAG TCTGTGCGCA TGGTTGTTTG CCGATTCAGG CTTAATTATC
AATTTGGCGA TGCTGATCGG GAGTGCTGGC ATGGCCTATA CTGCGCTCCA GGTGTTTTTC
CAGCGTACCA TCCCGTATCA AAGTGCTAGC CGTTTGTTGG CTTTAGCCTT GACCCTCAAA
ATGGCATTTG TTGTTTTGGG ATTGGCCTAT TTTTGGTATA GCCAACGCTG GCCTAGCCTG
ACGACGTTGA AATTAGGCTA TAGCGTGTTG ATTTTGCTGC TTTCGTTGCT GTGGACTGGT
GGCTTTGCGG TAATGATTAC CCAACGCATG CAATGTGATT TATCGGCCTT AGCCAGTATT
GATGTCTTAA CTGGCATCGC CAATCGCCGC GCAATCAACG AATATTTAGA GCAGGCAATT
GCCAAATGGC GGCGCAACCA GCAGGGTTTT GCGGTAATCA TGCTCGATAT TGATTGTTTT
AAGCAGATTA ATGATCGCTA TGGTCACCAC GCGGGCGATT CCGTGTTACG CCATATTGCC
TTGGTGCTGA GCGAACAAGT GCGTATTAAT GATCTGATTG GGCGCTGGGG TGGTGAAGAG
TTTCTATTAA TTGTTGATGC CGACACGATT CAGCAGGCAA CACTGATGGC TGAACGCTTA
CGTCAAGCTA TTCAGCAACA GCCAACCGTC TGGAATGATC AGGTGATTCA GCATACAGTC
AGCATTGGGA TTGCGGTCTG TGGAATGCAT GGCTTCAACG AAGCCCAATT GCTGACTGCT
GCCGATTTAG CACTCTACGA GGCCAAGGAA ACAGGTAAAA ATCGCTGGAT TGTCTATGAG
CGCCGTTTGT TGAACCAACT TGATCCAACC CTAGAACTGG CCAACGAAGG ACTTATTTTC
GATGTTGATT CATCGATTTA TGCCTAA
 
Protein sequence
MALDQPTVAL IVGIGLLLQA LVIGAFLVFV RRYQGIRCFL IGTIGLGLGF LALALLPQTS 
IVAITSSYIL ILSGLFGLSS GILRFIGAAS SAKIYAICWL IVASMLSLCA WLFADSGLII
NLAMLIGSAG MAYTALQVFF QRTIPYQSAS RLLALALTLK MAFVVLGLAY FWYSQRWPSL
TTLKLGYSVL ILLLSLLWTG GFAVMITQRM QCDLSALASI DVLTGIANRR AINEYLEQAI
AKWRRNQQGF AVIMLDIDCF KQINDRYGHH AGDSVLRHIA LVLSEQVRIN DLIGRWGGEE
FLLIVDADTI QQATLMAERL RQAIQQQPTV WNDQVIQHTV SIGIAVCGMH GFNEAQLLTA
ADLALYEAKE TGKNRWIVYE RRLLNQLDPT LELANEGLIF DVDSSIYA