Gene GM21_2096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2096 
Symbol 
ID8137432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2433654 
End bp2436515 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content64% 
IMG OID644869711 
Productresponse regulator receiver modulated diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) 
Protein accessionYP_003021906 
Protein GI253700717 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0000000000903564 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACCTT TGAACCGGCC GTTGAAGGAT ATCTCGCTCT TGCTGGTCGA GGACGAAGTC 
GAGGCGCGGG ACATGCTGGA ACGGATGCTC GGCCTCAACT ACCAGGGTGT GAAGATATAC
ACCGCCGACA ACGGGGTGAG CGGCCTTGAG GTCTTCCGCC GGTTACGGCC CGAGATCGTG
GTCACCGACA TCAACATGCC GCAGATGAAC GGCATCGCCA TGGCGCGCGG GATCAAGGAA
CTCGAACCCG AGGCTACTAT CGTCGCCGTG ACCGCCCACA GCGAGACCTC TTATCTCTTG
AGCGCCATCG AGATCGGCAT GGACCATTAC GTGCTGAAGC CGGTGAACTA CCCGGAACTC
TTCCACGTGC TGGACCGCAT AGCCGAAAAG ATCATGCTGA AGCGCCTGGT TTCCGAGCAG
GTGGAGCGTA TCCGGCGCAG GGAGCAGCAG CTCTCCCAGG CGGAGAGAAT CACACACCTG
GGAAGCTGGG AGTGGAACCT GGAGAGCGGA GAAACGGTCT GGTCCGACGA ACTGTACCGG
ATCCTGGGGC TTGAGCCGGA GAGCGTTGCG GCGACCTTCG AGTCGCTTTT GCGGATGGTG
CACCCGGCCG ACCGGGAGAC CATGGAAAAA GCGGTTCAGC AACTGCGCGA GACCGGCAGC
GCCCTGGGAA TCCAATACTA CCGGCTGCTG CGCGCCGACG GCGAGACCCG GATCGTGCGC
GGCCAGCTCG AGTCGGTGCG GGACGGCAGC GGCCGGCAGA CGGTGATCGG CGCCTGCCAC
GACGTCACCG AGCTCAAATG GGCCGAGGCA GCCCTGCGCG CCTCGGAGCG GCGCTTTTCC
AAGATATTCC AGGCGACCCC GGACCTTTTG AGCATCACGA GTCTTAAGGA CGGGACCATT
CTGGAGGTGA ACGAGGCCTT CCTGCGCGTC CTCGGTTACC AGCGCCAGGA GGTGATCGGC
ACCTACGCCG GGGAGCTTGG GCTTTGGGCC GACTCGGATC AGCCGGCGGC CCTTTTGCAG
CAGCTTGAGG AGCGGGGCGA GGTGCGGGAC ATCGAGGTGC GGCTCAAGGG GAGAAACGGC
CGGGAGCTCG AGGGGCTTGT GTCGGCCGAG CTGATCGAAA TCAACGGCAA CCCCTACCTC
CTCACGCTCT TCAAGGACAT CAGCGAAAGA AAGAGGCTAG AGGAGGACCG GGCGCGGCTT
GCCGCAATAG TGGAATCCTC CGACGACGCC ATCATGGCCG TGGACCGCGA GGGGGCCATC
ACCAGCTGGA ACGCCGGGGC CGAAAAGATG TTCGGCTATT GCGCCCAGGA CATGAAAGGG
GTGCACCTCT CGTTCCTGGT TTCCCCTGAG AAAAAGGAGC AGGTCACCTG CCTCAACGAC
GGCATCCGCA GCGACGGCGC GGTCACCCAC TGCGAGGTGG TGCACGTGAC CCGCGGGGGG
AGCCAGATCT ACGTCTCCCT CACAATGTCC CCCATCAAAA GCGCCGACGG CGGCATCGTC
GGGACCTCCT GCATCGCGCG CGACGTCACC GAGCGCACCC GCATGGAAGA GATCATCAAG
CACCAGGCCC AGCACGACAC CCTGACCGAT CTCCCCAACC GCAAGCTCTT CGGCGATTTC
CTCAACCTGG AGTTGGCCCA GGCGCGCCGC AACAGAAAGA GCCTCGCCGT GCTCTTCATG
GACCTGGACC ACTTCAAAAA GGTGAACGAC TCCCTGGGGC ACGCGGCGGG GGACCGTCTG
CTGCAGGCGG TGGCGCAAAG GTTGAAGCGC TGCGTCCGCG AGTCGGACAC CGTGGCGCGC
ATAAGCGGCG ACGAGTTCAA CGTCCTCATG CCGGACCTGA CCCAGACCGA CGACGTCGGC
ATCGTGGTCG GCAAGATCAT GGGGGTGTTC CGGACCCCGT TCGTCCTTGA GGGGGCGGAG
CTGAAGGTGA CCACCAGCGT CGGCGTGAGC ATGTTCCCGG ACGACGGCGA CTCGGCCCAG
GAACTTCTGC AAAAGGCCGA CGGCGCCATG TACGTCGCGA AACAGACCCC CGGCAACAGC
TACCAGTTCT ACAACGGCGA GATCAACGCC CGTACCGTGG CCAGGCAGAA CATGGAGCGC
CGCCTGCGCG AGGCGGTGTC CAAAGACGAG CTGGATCTGG TCTACCAGCC GCTATTGAGC
CTTGAGTCGG GGGAGATCGT CGGGGCGGAG GCCCTTTTGC GCTGGCATCA TCCCGAGCAT
GGGCTATTGC TCCCTGCGCA GTTTCTTCCG GTCGCCGAGG AGACCGGTGC CATCGTCCCC
ATCGGGGAGT GGGTCCTCTT CAACGCCTGC CGTCAGATGC GCCTGTGGCA GAGCCAGGGG
TTCGACTTAA GCGTGGCGGT GAACCTCTCC AACCGCGAGT TCCACCAGCC GAACTTCATC
GACCTGGCCA TGCGCGCCCT CTCCCAAAGC GGCCTCGATC CCGCCTCCCT GGAGCTGGAC
GTCACCGAGA AGGCGATCAT GGACAACCCC GCGTTCTCGC TGCGCAACAT GCGCCGGCTG
ACCGACATCG GCGTCGCCTT CTCGGTGGAC GACTTCGGAG TCGGCGCCTC CTCCCTTTCC
CGGATCAAGG AACTCCCCAT CGCCAAGGTC AAGATCGACC GCAGCTTCAT CAGGGACATC
CTCACCGAGC CAAACGACCT CGACGTGGTC TCCGCGGTCA TCTGCATGTC GCACAGCCTC
AAGATGCGGG TCAACGCGGT CGGGGTCGAG TCCGCCGAGC AACTCGCCTT GGTCGGCAGT
TTCGGCTGCG ACGAGGTGCA GGGGGACCTG ATCAGCAGGC CGCTTCCCCC GCACGAGTTC
GAGCGCCTGT TGGTGGAGGG AGGCGTTCAT GCCGAATTCT GA
 
Protein sequence
MEPLNRPLKD ISLLLVEDEV EARDMLERML GLNYQGVKIY TADNGVSGLE VFRRLRPEIV 
VTDINMPQMN GIAMARGIKE LEPEATIVAV TAHSETSYLL SAIEIGMDHY VLKPVNYPEL
FHVLDRIAEK IMLKRLVSEQ VERIRRREQQ LSQAERITHL GSWEWNLESG ETVWSDELYR
ILGLEPESVA ATFESLLRMV HPADRETMEK AVQQLRETGS ALGIQYYRLL RADGETRIVR
GQLESVRDGS GRQTVIGACH DVTELKWAEA ALRASERRFS KIFQATPDLL SITSLKDGTI
LEVNEAFLRV LGYQRQEVIG TYAGELGLWA DSDQPAALLQ QLEERGEVRD IEVRLKGRNG
RELEGLVSAE LIEINGNPYL LTLFKDISER KRLEEDRARL AAIVESSDDA IMAVDREGAI
TSWNAGAEKM FGYCAQDMKG VHLSFLVSPE KKEQVTCLND GIRSDGAVTH CEVVHVTRGG
SQIYVSLTMS PIKSADGGIV GTSCIARDVT ERTRMEEIIK HQAQHDTLTD LPNRKLFGDF
LNLELAQARR NRKSLAVLFM DLDHFKKVND SLGHAAGDRL LQAVAQRLKR CVRESDTVAR
ISGDEFNVLM PDLTQTDDVG IVVGKIMGVF RTPFVLEGAE LKVTTSVGVS MFPDDGDSAQ
ELLQKADGAM YVAKQTPGNS YQFYNGEINA RTVARQNMER RLREAVSKDE LDLVYQPLLS
LESGEIVGAE ALLRWHHPEH GLLLPAQFLP VAEETGAIVP IGEWVLFNAC RQMRLWQSQG
FDLSVAVNLS NREFHQPNFI DLAMRALSQS GLDPASLELD VTEKAIMDNP AFSLRNMRRL
TDIGVAFSVD DFGVGASSLS RIKELPIAKV KIDRSFIRDI LTEPNDLDVV SAVICMSHSL
KMRVNAVGVE SAEQLALVGS FGCDEVQGDL ISRPLPPHEF ERLLVEGGVH AEF