Gene GM21_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3072 
Symbol 
ID8138422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3561119 
End bp3562189 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content61% 
IMG OID644870676 
Productdiguanylate cyclase 
Protein accessionYP_003022858 
Protein GI253701669 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains
[COG3706] Response regulator containing a CheY-like receiver domain and a GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.000000000986718 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCGAGT GTGGTTGCGG TACCCAAAAA GAATCGCTCG AATCGCAGGT TAAGGCGCTC 
AAGGACCTGA TAGAGGTCGC CAAGGCGGTC GTCTCGACGC TCGATCTGGA TACGGTGCTG
CAGGCTATCC TCAACAGCGC CATGGGATTT GCCGAGACCC CCGCCGGGAG CGTCGCCCTT
TATTACGACG CCAAGCGTGA GTTGAGCCTG CATGCCCACT CGGGGCTTAC GGCGGACTTC
GTAAAGAAGG AGCGATGGGA GGTGGCCCCG GGCGGGCTTA CCGAACAGGT GCTTTCTGCG
GGGGAGATCT TCCTGATCGA GGACACGGAG AAGACCCCGT TCTTCAAGAA CCCGATAGCG
CTAAACGAAG GTATCCGCTC TCTGGTCTGC GTGCCGCTCA TCTTCCAGTC GCGCATCGTG
GGGATACTCT ACCTTGACGA CTTCAAGCCG AGGGAGTTCG ACCGGGAGAA GATGAACATG
CTCTCGATCC TCGCCTCGTT CGCCGCCATG GCGATACACA ACGCGACGCT GCACAAGCGG
ACCAAGCTCC TGGCGATCAC CGACTCGCTC ACCGGGCTGC ACAACCACCG CTACTTCAAG
CAGTACTTCA GGCAGGAGAT GGGGCGCGCC AAGCGCTACC ACAAGCCCTT CTCCATCATC
ATGATGGACG TGGACGACTT CAAGTCCTAC AACGACAGCT TCGGCCACGC CACCGGCGAC
AGGCTGCTGG CCTTCATGGG CGAGATTATC CTGCAGACCA TCCGCGGCGT GGACGTCGCC
TTCCGCTACG GCGGGGAAGA ATTCATCGTG CTGCTCCCCG AGACCAAGCT CGACAAGGCT
ATTCTCGCCG CCGAGCGTCT GCGCGAGAGC GTGCAGGCCG GAACTGCTAA CCGGCCGGTG
GACGGGTCGG GTCGCGGCGT GACCGTCAGC ATCGGCGTGG CGAGCTACCC CGACAATGCC
GACAAGATGG ACGAACTCTT CAACATCGTC GATTCTCTCC TTTACCTTGC CAAGCGCTGC
GGCAAGAACA AGGTATATCA CCAGGAAAGC CTACAGATCC CCGCGCCATG A
 
Protein sequence
MSECGCGTQK ESLESQVKAL KDLIEVAKAV VSTLDLDTVL QAILNSAMGF AETPAGSVAL 
YYDAKRELSL HAHSGLTADF VKKERWEVAP GGLTEQVLSA GEIFLIEDTE KTPFFKNPIA
LNEGIRSLVC VPLIFQSRIV GILYLDDFKP REFDREKMNM LSILASFAAM AIHNATLHKR
TKLLAITDSL TGLHNHRYFK QYFRQEMGRA KRYHKPFSII MMDVDDFKSY NDSFGHATGD
RLLAFMGEII LQTIRGVDVA FRYGGEEFIV LLPETKLDKA ILAAERLRES VQAGTANRPV
DGSGRGVTVS IGVASYPDNA DKMDELFNIV DSLLYLAKRC GKNKVYHQES LQIPAP