Gene GM21_3597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3597 
Symbol 
ID8138970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4175860 
End bp4177017 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content61% 
IMG OID644871217 
Productdiguanylate cyclase 
Protein accessionYP_003023376 
Protein GI253702187 
COG category[T] Signal transduction mechanisms 
COG ID[COG2199] FOG: GGDEF domain 
TIGRFAM ID[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones134 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAG AAAAGGAGTG GGTTAGCTCT GATGCTATGG CCAGATACGC CGGCCACGAC 
CGGGTCCTGG GCGTCATAAC ATGGCTCTTG GTAGCGCTGG TGCTGCTCGA CATATCGCTT
TTGCACGTCG GTCACCGCAG CACCATCGTT GTGCTCTTCT GCTCCCTGGG GCTTGTCTGC
TACAAGGCTT CGGCGCGGTT CCTCGTGCGC CGGGGCGAAA CGAAAAGTTT GCTCGATCTG
GTCCTGCTCC TTTTGTACGC CATCGCCGTC AGTTGGTTTA CCGGCAAAAC TTCCAGCCCC
TTCATCTCGG TTTTGTACCT GATCCTGATG GCGACCTCGC TCACCCTCGG GCGCTGGATC
GCCTTCATCA TGACCGGCCT GACCATTGCC CTCTATACCT TGCTCGCCTC CTTCCAGTCC
CCTAGCTTCT GGTACGACAT CGGCGGACAC CTCGTGAAGA TATTCCCCTT CATCCTGATC
GCGCACCTGG GTGCGCTGCT GCGAGGCGAG GCGGAGAGCG CGCGCGCCGA GGTGGAGCGG
CTTTCACTCA CCGACGACCT CACCGATCTC AACAACATGC GCAGCTTCGA GGCCCTTGCG
CTGCAGCAGG AGAAGATCTC GAAACGCTAC GGCACCCCGT TTTCCATCTG CATGCTGGAC
GCAGACAATC TGAAGCAGAT AAACGACCGG CACGGGCATC TGGCGGGGAC GGCGCTGATC
AAGTGGACCG CGCGCATCAT AGCCTCCAAT ATCAGGGAGA GCGACGTCGC CGCCAGGTTT
GGCGGCGACG AATTCATCAT CATGTTCGCC GGCCGGGAGC AGCAAAATAT CCTCGCTGCC
GTGGAGAGGA TCGTTCGCGC CATGAACGAC TCTCCTTTCT CCTTCGAGGG TGAGTTGGTC
CAGGGGACGC TGTCGGCCGG GGTGGCGTCG TTTCCGGCTG CCGGCGAGGA CCTGCGCAGC
ATTGTGAAGA AGGCGGACCT GGCGATGTAC CGAAGCAAGA GGCTGGGCAA GAACCGGGTT
TCGCTCTTCG ACGATCAGGA AGGGGAGGCT GTGCCGAGCG GGTTACAGGT AGGGGGAGAA
AAGCTTTGCC GCGGCGTCCC TCAGCTTGAT GGCGAGGGGG CGTCCGTCCA TCTCGGCGAG
GGTGACCTCC CGGGATAG
 
Protein sequence
MIEEKEWVSS DAMARYAGHD RVLGVITWLL VALVLLDISL LHVGHRSTIV VLFCSLGLVC 
YKASARFLVR RGETKSLLDL VLLLLYAIAV SWFTGKTSSP FISVLYLILM ATSLTLGRWI
AFIMTGLTIA LYTLLASFQS PSFWYDIGGH LVKIFPFILI AHLGALLRGE AESARAEVER
LSLTDDLTDL NNMRSFEALA LQQEKISKRY GTPFSICMLD ADNLKQINDR HGHLAGTALI
KWTARIIASN IRESDVAARF GGDEFIIMFA GREQQNILAA VERIVRAMND SPFSFEGELV
QGTLSAGVAS FPAAGEDLRS IVKKADLAMY RSKRLGKNRV SLFDDQEGEA VPSGLQVGGE
KLCRGVPQLD GEGASVHLGE GDLPG