Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1108 |
Symbol | |
ID | 5732999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1268834 |
End bp | 1270624 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278246 |
Product | diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) |
Protein accession | YP_001543884 |
Protein GI | 159897637 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0147764 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTTC AAATGCCCAA CGTCCGATTG CCCGATTCGG CTGCCCAAGC GGCCATGTTG CCAACCGAAT GGTTATTAAT CGGCTTAGTT TGCGGGCTGG GAATCTGGTT GTGCTGGCGT TGGCAGCGCC GCCGCACCGC TCAGCGCATT AGCACCCAAT TAGGCTTGTG GCACTGGCCG CACAGTCAAC AAACGGTGAT TATCGATCAG CATGCGACGA TTATTGCAAT CAGCCCGCTG CTGAATCAAC AGCTTGGCTA TCCAGCCCTC AGCCTCAACG GTCAATCATT CAAGCACCTC CTGCATCCTG ATTCATCCAA CTATTTTGAA CAACTGCTGC ATACCCAAAC TACCCAAATT CACTGGCAAA TGCTGCGCTG TCGCCATGCT GATGGCTCGT GGCGTGGGCT GGAGTTTTGT CGGCATCAAG TTGGCCGCTA TTGGCTACTC AATTGCCGTG AAACTGAGCC TGTTGTAACT GCGCAACGAA CACCGCCTGC CTGCGATTGG CTGACGGGCT TGCCCAATCG CCAGAGCTTG GTTATTCAAC TACAACGCTT GATCGCCCAG CGCCAACTCG ATCAACAACA CTTTGCCGTC TTGTTTGTTG AAGTTGATCG ATTTAGTGCG ATTAACGATG CCTTGGGCTA CGATGTTGGC GATCAAGTGC TGCAAACGAT CGCCCAACGC CTCCAATCAA CGCTTGGCCC CAATGATTTA GCTGCCCGTT TAGGTGGCGA TGAATTTGTG GTTGTGCTTG GCCACGTGCA ACATGCGCAA CAAGCGCTTG AGCATGCTCA ACGAATTCAA GAGCAATTTA GCCAGCCAAT TGTGCTACGC GATAAGCCAA TTTATACCAC CCTCGGCATT GGTGTGACGC TCGGCGATGG CCAATCCCAG GCTGAAACTT TGCTGCGTGA GGCTGATACG GCCATGTATC AAGCCAAAAC CAATGGCATT AGCCAAATCT TTTTGTTTGA TCGTGCGTTG CATGCGGCGC TGGCTGAGCG TTGGCAACTG GAAATTGATT TGCGCGGAGC TTTAGAGCGC CATGAATTAA TTTTGCATTA TCAGCCAATT ACGGCACTGC CTACCGGCCA GATGGTTGGG GTTGAGGCTT TGTTGCGCTG GCATCACCCG CAGCGCGGCA TGATTCTGCC CAACGAGTTT ATTCATTTGG CCGAGGAAAG TGGCTTAATT GTGCCAATTG GTTTTTGGGC TTTAGAAACT GCCTGCCTAC AATTTTTGGC TTGGCAAGCT GATTATTCGC GGCCCCGCTT ACAAGTGCTT TCGGTCAATC TCTCGGCGCG TCAATTGGCC AGCCCAGCCT TAGTTCAAAC CTTGGGCGAG ATTATTCAGC GCACTGGAAT CAATCCGGCT CAGTTGGAGC TAGAAATTAC CGAAAGTATG GTGATTCATG GCTTTGAATT GGCGCGGGCA CAGTTGGGCG CAATCAAAAA TTTGGGGGTA AAGCTGGCGA TTGATGATTT TGGCACAGGG TATTCATCAT TAAGCTATTT GCACCACTTC CCCTTGGATA CGCTCAAAGT TGATCGCTCA TTCGTCAATG CCATGAGCGA TGGCAGCCGC AACGTCGAAA TTGTGCGGGC AATTATCATG CTCTCGCAGC AATTGGCCAT GAATGTAATT GCTGAGGGCA TCGAAACAAT TGAAGAAGCG GCGACCTTGC GCGATTTGGG CTGTGATTAT GGTCAGGGCT ATCATTTTAG CCGCCCCATG ACAGCGCCCG ATATTACCTC GTGGCTGCAT AGCCAACCGT TGAGTCTTTG A
|
Protein sequence | MSVQMPNVRL PDSAAQAAML PTEWLLIGLV CGLGIWLCWR WQRRRTAQRI STQLGLWHWP HSQQTVIIDQ HATIIAISPL LNQQLGYPAL SLNGQSFKHL LHPDSSNYFE QLLHTQTTQI HWQMLRCRHA DGSWRGLEFC RHQVGRYWLL NCRETEPVVT AQRTPPACDW LTGLPNRQSL VIQLQRLIAQ RQLDQQHFAV LFVEVDRFSA INDALGYDVG DQVLQTIAQR LQSTLGPNDL AARLGGDEFV VVLGHVQHAQ QALEHAQRIQ EQFSQPIVLR DKPIYTTLGI GVTLGDGQSQ AETLLREADT AMYQAKTNGI SQIFLFDRAL HAALAERWQL EIDLRGALER HELILHYQPI TALPTGQMVG VEALLRWHHP QRGMILPNEF IHLAEESGLI VPIGFWALET ACLQFLAWQA DYSRPRLQVL SVNLSARQLA SPALVQTLGE IIQRTGINPA QLELEITESM VIHGFELARA QLGAIKNLGV KLAIDDFGTG YSSLSYLHHF PLDTLKVDRS FVNAMSDGSR NVEIVRAIIM LSQQLAMNVI AEGIETIEEA ATLRDLGCDY GQGYHFSRPM TAPDITSWLH SQPLSL
|
| |