Gene Ava_4149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_4149 
Symbol 
ID3681088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp5175056 
End bp5176831 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content40% 
IMG OID637719495 
Productdiguanylate cyclase 
Protein accessionYP_324643 
Protein GI75910347 
COG category[T] Signal transduction mechanisms 
COG ID[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0015025 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.169582 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCATG ATCTTTTGGG TAGCATCCGG ACAAAATTGA TCGCCTCGTT TCTCGTTGTT 
GCTTTGATTC CGTTACTGTT ATTGGCATCT ATTAACAAAC AGACAACGGA AACAGCATTA
ACTGACAACG CTCGGCAAGC TTTATCTGCT GCGGCTAACC AAACCAGTAA CAGAATAGAT
GCTTTTATTG ATAGAAATCT TAATGCTGTG CGCGTAGAGG CGCTTTTACC AGGCTTGGCA
GGCTACCTCA GCCTAACTCC AAAAGCGCGA GATGATAGCC CCGAAATGCA ATTGGCAACG
GAAACATTAA ATCGGCTCAG TCGCAAAGAT ATGGTTCATA TTATCTCCTA TGGGTTGCTC
GACTTAAAAG GGAAGAATGT ATTGGATACA TATACATATA CATCAGACAT TAGCGAAGAT
GAATCAGATC AAGATTATTT TCAAAAACCA CTGCAAACTG GATTATCCTT TGCTTCTAGT
ATGAAGCGAT CGCCGATAAT TCCTGAGCTT ATTACTATCT TTTTTAGCAG TCCCGTTCGT
AATGCCCAAG GAGATATATT AGGTGTCTTG CGTGTTTCAT ACAATGCTAC TGTGATTCAG
CAGTTAGTAA ATAGAGAAAC TGAACTGGCT GGAGCTAAAT CCTTTGCTAT TCTTTTAGAT
GAAAATCATA TTTATCTGGC ACATAGTCAT GCACCGCAAC TACTTTTTAA ATCAATTGTG
CCTCTACCTT TCGATATTAT AACTCAACTA CAAAGGGAAG GGCGCTTGTC TAATGACCCT
ATCAGAGAAT TAGCAACTAA TGAGTTGAAA CTTAAACAAG CATTGAATAA TAAAAAGTTA
CATTTAACTA CTACTTTGTC AACAACAGGT AATCAGGTTA ATTTGATAGC GATCGCCAGT
TTAAAATATA AACCTTGGTC TGTTTTGTTC GCACAGCCTC TAGTTGTTGC CCTTGCACCT
GTAGAAAAGC AAATTCATGA CGCAATGTTT CTATTTGTAT TGATCGCTTC AGTCGTGACA
ATTATCGCTT TTGCTATTGG GCAACTGCTA ACAAGACCAA TAATTTACCT GACCAATATA
GTTTTTCAGT TTACAACAGG TAACTTAAAT ATCCGCGCCA AAATTAGCTC AACAGATGAA
ATAGGTCAAC TGGCGAAATC GTTTAATAAT ATGGCATTTC AGTTACAAAC GTCTTTTGAA
ACCTTAGAAC AACGGGTACA AGAAAGAACA GCAGAGTTAG TAATTGCCAA TCAGAAACTA
GAACAACTGG TAAATCTAGA TGGTTTGACT CAGGTGGCTA ACCGTCGTTG CTTCGATGAA
CGACTAAAAG CAGAATGGAA ACGCCTGGCG CGAGAACAAC AACCCCTGTC ACTGATTTTA
TTCGATGTTG ATAAATTCAA ATCTTACAAC GACTACTATG GCCATCTTGG AGGCGATGAT
TGTCTAATCA CCATAGCGCA AGCTGTGCAA CAGAAGCTTC ATCGTCCTGC TGACTTACTA
GCGCGTTACG GAGGAGAAGA ATTCTCGATA CTCCTCCCCA ATACTGACTT ACTAGGAGCG
ATCAAAGTAG CACAAATTAT TCAACAAGCA ATTTACGATC AAGCCATTCC CCATGCACAG
TCTGATATAA AGGATATCGT TACACTTAGT TTGGGTATTA CTTCTATTAT ACCTGCTGGA
GATATTAATC CTGATACACT CATCGCTTCA GCCGATAAAG CACTGTACAA TGCCAAACAA
CAGGGGCGCG ATCGCTATTG TACTCATGAG ACATAG
 
Protein sequence
MQHDLLGSIR TKLIASFLVV ALIPLLLLAS INKQTTETAL TDNARQALSA AANQTSNRID 
AFIDRNLNAV RVEALLPGLA GYLSLTPKAR DDSPEMQLAT ETLNRLSRKD MVHIISYGLL
DLKGKNVLDT YTYTSDISED ESDQDYFQKP LQTGLSFASS MKRSPIIPEL ITIFFSSPVR
NAQGDILGVL RVSYNATVIQ QLVNRETELA GAKSFAILLD ENHIYLAHSH APQLLFKSIV
PLPFDIITQL QREGRLSNDP IRELATNELK LKQALNNKKL HLTTTLSTTG NQVNLIAIAS
LKYKPWSVLF AQPLVVALAP VEKQIHDAMF LFVLIASVVT IIAFAIGQLL TRPIIYLTNI
VFQFTTGNLN IRAKISSTDE IGQLAKSFNN MAFQLQTSFE TLEQRVQERT AELVIANQKL
EQLVNLDGLT QVANRRCFDE RLKAEWKRLA REQQPLSLIL FDVDKFKSYN DYYGHLGGDD
CLITIAQAVQ QKLHRPADLL ARYGGEEFSI LLPNTDLLGA IKVAQIIQQA IYDQAIPHAQ
SDIKDIVTLS LGITSIIPAG DINPDTLIAS ADKALYNAKQ QGRDRYCTHE T