Gene Ava_3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3868 
Symbol 
ID3678508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4817864 
End bp4820929 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content40% 
IMG OID637719220 
Productdiguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_324368 
Protein GI75910072 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTTA CGCGTGGCAA AGTGTACAAA AATCATCTTG ATCTTCAGAA AAACAAAGGT 
TTTCCAACGC CAACAAAAGA TTCTATCAGT GAATGCGAGG CTGGGAATTC AGACTTTACA
CCAGAATTTC AGTCTTATGA AAAGTCCATA TTAGACTTGC AAGAAATAAT TAATTTATTA
GCTGAAATCA ACCCATTTCC TGTAGCAATT ATTGGTTTAG AAAGCCATCA AATTTTATTT
AAAAATAAGC TGGTTGATAA CCCGGTATTG CAGCAACTAA TTCCCGATTT TTTAGCTGAT
TCAGATTTAT GGACTCAGTT AAATAGCAAA TTAGTAAATG GCAATTTTAT TAGAAAATTG
GCGCTAGAAA TCAAAAGCAG CAATGAAAAC CGTTTTTTAG CTATAATTTC TGGAAAATTG
GTTAATTATG AGCATGAACG GGCAGCATTC TTGGTATTTA CAGATGTAAA TACCGTCATA
GGAGCCAGTG AGGAACAACA AAAAGATACA TTAGAAAAAC CAGAACTTAG CAATTTAATC
ACTGATACGA GTGATAGCCA GCGACTACCT ACGATAAAAC ATAGTCCATG GCCGGCTCCC
TCACATTTGA TGGAACAGGC GTTAGCTGCT ATCAGTAATG GCATTGTGTT GACAGATGCC
AATCAACCAG ACAATCCAAT TATCTATGTA AATCAAGCTT TTGAAGCCAT GACTGGCTAC
TCGGCTGGTG AAGTTATTGG ACAAAACTGC CGATTTTTAC AAGCTAATGA AACAGACCAG
CTCAGCCTTA GCGAGTTACG TTCTGCCCTC CAAGAAAAAA AAGAATGCCA TGTCGTCATC
AAAAATTTCC GCAAAGATGG CACCGAATTT TGGAATGAAC TTTATATTGC ACCTGTATTT
GATAGTTGTG GGCAGTTAAC TCATTTTATC GGTGTTCAAA ATGATATAAC TCATCACCTA
CAGGCATTAG AGACATTACA AGAGCAAAAA GAACAGTATC GTCGCATTGT AGAAACGGCC
AGTGAGGGAA TTTGGCTACT CAATGAAAAT AACGAAACTA CCTTCGTCAA TCAACAGATG
GCGAGTATGT TGGGTTACAC CATTGAAGAA ATGCTCGGTG CCAGCTTGTT TTCCTTTATG
GATGCCGAAG GTTTGGATAT AGCTCAAGAC TTACTATTAG GCCGTCAGCA AGACATTCAG
GAGAAACACA ACTTTAAATT CCGTTGCAAA GATGGCTCAG ACTTATGGGC AATTCTTTCC
TGTACGCCAT TCTTAGATGA GCAAGGAAAC TATACTGGCG CGTTGGGAAT GTTAACTAAT
ATTAGCGATG TCTACGACGA ACTGCGCTTA CGCAAACAAG CAGAAGCTTC ATTGCAAGAA
AGCAAAGAAC GTTTAGACGG AATTTTAAAT TCCCTGGAAG TTGTAATCTG GTCAATTGCC
GCCGATACTT TTGAGATGCT TTATCTCAAT TCTGCCGTAG TTCAAGTTTA CGGTCGTTCT
GTTTGTGAAT TTTACGATAA TTCAAAGCTG TGGTTTGAGT TAATTCACCC AGAAGACCAG
CAAAGGGTGA GTCAATCAAT TAAACCATTA CTGGCAAATG GTAGCCACGA ATTAGAATAC
AGAATCCTCC GGCAAGATGG ACAGGTACGC TGGCTTTATA ACCACAGTCA TGTGATTTAT
GACGCTGTTG GTCAGCCAAT TCGTATTGAA GGTGTTGCTA CGGATATTAC TGAGCGGAAA
AACATGGAAG AGCGGCTAGT CTACCATGCT TTTTACGATG ATCTGACTGG ATTGCCCAAC
CGAGTTTTAT TTATGGATCG GTTAGCGCAA ACCATCAACC AAGCCAAAGA ATCCCCGAAT
GATCTATTTG CCGTACTATT TTTGGATCTT GACCGTTTTA AAGTTGTGAA TGATAGTTTG
AGTCACTTAG TGGGTGACCA GCTACTAGTG AGTTTTGCCC AACGCTTACA AAGTTGCTTA
CAGCCAGAAG ATACTCTGGC TCGTTTGGGT GGAGATGAGT TCACAATTTT ACTATCTCAC
ATTCAATCTA TCGACGATGC AACCCGTATA GCCGAAAAGA TTCATCAGGC ACTCAAGTTA
CCATTTAACT TGAGCGGATA TGAAGTATTT ACAACTGTCA GCATTGGCAT TGCTTTAAGC
TCAAATGATT ATGTCCAAGC CGCAGATTTG CTGCGAGACG CTGATACTGC GCTTTACTGC
GCTAAGGAGC AAGGTAAAGC TTGGCACATA GTATTTGATT CTACTATGTA TGACCGAGCT
GTAGCTCTGT TGCAGTTAGA AACTGATTTG CGTTGGGCGA TCGCCAGGCA AGAGTTGTAT
GTTGTTTACC AACCCATTGT CTCAGTAGCC ACAGGTAAAA TTACCGGATT TGAAGCACTA
GTTCGCTGGG AACACCCAGA AAGAGGACTA ATTTCACCAG TCGAGTTTAT TCCTGTAGCC
GAAGAAACAG GACTGATTAT TCAGATTGGT CAGTTTGTTT TGCGGGAATC TTGCCAACAA
TTAAAACAAT GGCATTTAGA GTTTCCTGAG TTTCAGCATT TGAGTATCAA CGTCAATCTT
TCCGGTAAAC AGTTTTCCCA ACCATATTTA GTTGAAGAGA TTGAGCAACT GTTACAAGAA
TTTGAGCTAG ACGCTAACAG CATCAAGTTA GAAATTACCG AAAGCGCCAT TATGGCTAGT
CCTGAACAAG CTGCTACCAT TCTTCAACAA CTAAAAACTT TAGGGATTCA GTTGTGTATT
GATGACTTTG GTACAGGTTA TTCATCTTTA GCTTATTTAC ATTGTTTCCC CATTGATATT
TTAAAAATTG ACCGTTCTTT TACGAAGCGA ATTGATAGCG ATAGTGAGCA ATTGGCAATT
ATCCGTGCTA TTGTCACACT TGCCAGTAAC TTGGAAATGA GTGTAGTGGC TGAAGGTGTA
GAAACAGTCA ACCAGTTGGT ACAATTACAG TTATTAAAAT GTGACCAAGC CCAAGGGTAT
TTGTTCTCAA AACCCTTAAG TAGCGACAAA GTTAGCTTGT TATTAGCTGC AAAAACACAA
TATTAG
 
Protein sequence
MVFTRGKVYK NHLDLQKNKG FPTPTKDSIS ECEAGNSDFT PEFQSYEKSI LDLQEIINLL 
AEINPFPVAI IGLESHQILF KNKLVDNPVL QQLIPDFLAD SDLWTQLNSK LVNGNFIRKL
ALEIKSSNEN RFLAIISGKL VNYEHERAAF LVFTDVNTVI GASEEQQKDT LEKPELSNLI
TDTSDSQRLP TIKHSPWPAP SHLMEQALAA ISNGIVLTDA NQPDNPIIYV NQAFEAMTGY
SAGEVIGQNC RFLQANETDQ LSLSELRSAL QEKKECHVVI KNFRKDGTEF WNELYIAPVF
DSCGQLTHFI GVQNDITHHL QALETLQEQK EQYRRIVETA SEGIWLLNEN NETTFVNQQM
ASMLGYTIEE MLGASLFSFM DAEGLDIAQD LLLGRQQDIQ EKHNFKFRCK DGSDLWAILS
CTPFLDEQGN YTGALGMLTN ISDVYDELRL RKQAEASLQE SKERLDGILN SLEVVIWSIA
ADTFEMLYLN SAVVQVYGRS VCEFYDNSKL WFELIHPEDQ QRVSQSIKPL LANGSHELEY
RILRQDGQVR WLYNHSHVIY DAVGQPIRIE GVATDITERK NMEERLVYHA FYDDLTGLPN
RVLFMDRLAQ TINQAKESPN DLFAVLFLDL DRFKVVNDSL SHLVGDQLLV SFAQRLQSCL
QPEDTLARLG GDEFTILLSH IQSIDDATRI AEKIHQALKL PFNLSGYEVF TTVSIGIALS
SNDYVQAADL LRDADTALYC AKEQGKAWHI VFDSTMYDRA VALLQLETDL RWAIARQELY
VVYQPIVSVA TGKITGFEAL VRWEHPERGL ISPVEFIPVA EETGLIIQIG QFVLRESCQQ
LKQWHLEFPE FQHLSINVNL SGKQFSQPYL VEEIEQLLQE FELDANSIKL EITESAIMAS
PEQAATILQQ LKTLGIQLCI DDFGTGYSSL AYLHCFPIDI LKIDRSFTKR IDSDSEQLAI
IRAIVTLASN LEMSVVAEGV ETVNQLVQLQ LLKCDQAQGY LFSKPLSSDK VSLLLAAKTQ
Y