Gene GM21_0106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0106 
Symbol 
ID8135409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp128572 
End bp131472 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content68% 
IMG OID644867726 
Productmolybdopterin oxidoreductase Fe4S4 region 
Protein accessionYP_003019950 
Protein GI253698761 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.00863778 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGATA TGTCGAGACG GACATTTTTG TGGGTCACCG GAGGGAGCAG CATCGCCCTT 
GCCACCGATC CCCCGCGAAA GCTGGTCAAC AAGCTGATCC CAAAGGTGAT CCCACCGGAG
AACATCCGTC CGGGTTCCTG GACCATCTTC GCCACAACCT GCCGCGAGTG CCCGGCCGGA
TGCGGGATGC ACCTCTCCCA CCGCGACGGA CGGGTGACCA AGGCGGAGGG GAATCCTGCG
CATCCGGTGA ACCGGGGCGC CCTCTGCCCG CGGGGGCAGT CGGCGCCGCA GGGACTCTAC
GACCCGGACC GCCTCCGGCA GGTCCTCTAC CGCGGCGGCG GCGCCTCCCG GCCGAGCGAT
TGGCAGGACG CCCTCTCCGC CATCTCCACG CGCCTTATCT CCGGCGGGCG CGCCGTCATT
CTTTCCAGCC TCCAGACCGG CGCGCTGGCC GAGGTGATGG CGGGCTTCGC CTCCGCTTTC
CGCGGGGAAC TCCTGTTTTA CGAGGCGTTC AACTACGAGC CGATGCGGGC CGCACACCAA
GAGCTCTTCG GCCTGCCGGT GGTGCCGCAC CACGACCTAG AAAATAGCGA CTACATCCTG
AGCTTCGCCG CCGACTTCCT GGAAACCTGG GTGTCGCCGG TTTCCTACGC GCGCCAGTTC
GCCGACATGC ACGGTTTCCG GCAAAGCGAG GCGCAGAATC GCATGGTCTA CGTCGGGCCC
AGGCTTTCCA TGACCGCCTC AAACGCCGAC AGCTTCATCC AGGTCCCTCC CGGGCAGGAA
CGGCTGGTGG CGGCGACGCT TTTGAAGCTC GTCATCGAGC GCGGCTGGCA GAAAAACGAC
CTCACCAAAT TCAGCGGGGC GCTCGATCGG ATGCTGATGG CAGCGGGCCA GGTTCCCGCC
ATAGCCCAGG CGACGCTCTT GCAGGTGGCG CAGTCGTTCG CCAACGCCCG GGCCCCCCTG
GCGCTCGCCG GCCCGTCGGC GGCTACAGGA GCTGTTGCGA CCGACACGGC GCTCTGCGTG
GCCCTTTTGA ATTATGCGGT CGGAGCCGTC GGGAAGACGG TGGATTTTTC CCGCCCGCAT
GCCTTGAGCC GCACCGCGCG CGAAGCGGAA GTTTTCTCCC TCCTCGGGTC GCTGGGAGCC
AACGACGTCC TGTTCGTGCA CGACACCAAC CCCGCCTACA GCCGAAACGG AGCGGCGGCG
CAGCTTAAGC GTGCCGGGAC CGTGGTCTAC CTTGGAACCA TGCCGGACGA GACGGCGCAA
CTCTCCGACT GGGTGCTCCC CATCGACTCC CCGCTGGAGT CATGGGGGGA ATACGAACCC
GAACCGGGAG TCCGCGGCCT GATGCAGCCC GGGATGGGAC GCATCCACGA CACCCGCGGC
GCGGGAGACC TCTTCCTGAA GCTGGCCCGA CTGGCGCGAC GCCCCCTTTC CCGCGAGGGG
AGCGCCGAGC CCCCAGCCGA TTTCGCCTCC TGGCTCAAAG CCGGATGGAG CGCCCCCGGT
GGGGAAGCAT CCTGGACCGG TGCGCTCAGA ACGGGGGGAG ACTGGAGTAC CGGCCCGAAG
TCCGCGGCGG CGCAACCGGC CCTGCAGGTG AAGGGCGGAC TTCTTTTCGC GGCGGCGGGG
GTGACGCCGC TTCCCAAGCC CGATCAGGCC GAACTCTGGG CCTGGCCTTC CATCATGCTC
TACGACGGCC GCTTAGCCAA CCGCGGCTGG CTCCAGGAGG CGCCCGACCC GGTCTCCTTC
GTGGTCTGGG GCAACTGGGT GGACGTAAAC CCGCGCCAGG CCGAATCCCT CGGCATCGAG
GAGGGGGAGA TGGTCCAGAT CTCCACTGCG ACCGGCTCCC TGCGCGCCCC CGCCCGGATC
ACCGAGGAGG TCGGCCCGCA GACGGTGGCC GTAGGGCTGG GACATGGGCA CACGGCGCTG
GGGAAGACCG CCAAGGGGAT CGGGGCCAAC GCCTTCGTGC TTCTTGGGGG GGTGTACAGC
GGCTCGACCT TCGCTTCCTG CCGCATCGCG AAAGTCCCCG GCGGCGCCGG CGATCTCATG
ACCGCCACCG CCCCCACCCG CGACCAGTTG CACCGCGAGC TGCTGCAGGC GGTGCCGGCC
TCCGAGCTGC GCGTCATGAA GCCGGGGGAG GGGGACCGCC TCGACCTTCC CCTTGCCGAG
GGGTACCGCC CGGAGGAGGA CATGTACCCG GCGCACGAGC ACAAGAAGCA CCGCTGGGGG
ATGGCGATCG ACCTGCAGCG CTGCATCGGC TGCGGGGCCT GCGCCGTCGC CTGCTACGCC
GAGAACAACA TCCCCGTGAT CGGCAAGGAG CAGGTCGGGG GGGGGCGCGA GATGGCCTGG
CTCAGGGTCC CCCCCTACCG GATGCCCGGG GACCGGCTTC GTTACGCCTG GCTCCCACTG
CACTGCCAGC ACTGCGACGC CGCCCCCTGC GAACCGGTCT GCCCGGTCTT CGCCGCCGTC
CACAGCGAGG AGGGGCTGAA CGCCCAGATC TACAACCGCT GCATCGGCAC CCGCTACTGT
TCCAACAACT GCCCCTACAA GGTGCGTCGG TTCAACTGGC TCAACGTGCA GTGGCGCAAG
CCGCTCGACC TGCAGCTGAA CCCGGAGGTG ACGGTCAGGA CGCGCGGCGT GATGGAGAAA
TGCACCTTCT GCGTGCAGCG CATCCGCCAG GCTGAGTACC GCGCCTCGCG GGAGAGGCGT
CAGCTTCAGG ACGGGGAGAT CGTCCCAGCC TGCGCCCAGA CCTGCCCCAC CGGGGTCTTC
ACCTTCGGCG ACCTCCTGGA CCCCGACTCG CGGGTGTCGA GGATCGCCGC GACTGAGCCG
CGCCGCTACC AACTGCTGCA CGAGCTGCAC ACCAAACCGG CGGTGACCTT CCTGCGCAGG
GTGGAGGTGG AGCGTGGCTG A
 
Protein sequence
MPDMSRRTFL WVTGGSSIAL ATDPPRKLVN KLIPKVIPPE NIRPGSWTIF ATTCRECPAG 
CGMHLSHRDG RVTKAEGNPA HPVNRGALCP RGQSAPQGLY DPDRLRQVLY RGGGASRPSD
WQDALSAIST RLISGGRAVI LSSLQTGALA EVMAGFASAF RGELLFYEAF NYEPMRAAHQ
ELFGLPVVPH HDLENSDYIL SFAADFLETW VSPVSYARQF ADMHGFRQSE AQNRMVYVGP
RLSMTASNAD SFIQVPPGQE RLVAATLLKL VIERGWQKND LTKFSGALDR MLMAAGQVPA
IAQATLLQVA QSFANARAPL ALAGPSAATG AVATDTALCV ALLNYAVGAV GKTVDFSRPH
ALSRTAREAE VFSLLGSLGA NDVLFVHDTN PAYSRNGAAA QLKRAGTVVY LGTMPDETAQ
LSDWVLPIDS PLESWGEYEP EPGVRGLMQP GMGRIHDTRG AGDLFLKLAR LARRPLSREG
SAEPPADFAS WLKAGWSAPG GEASWTGALR TGGDWSTGPK SAAAQPALQV KGGLLFAAAG
VTPLPKPDQA ELWAWPSIML YDGRLANRGW LQEAPDPVSF VVWGNWVDVN PRQAESLGIE
EGEMVQISTA TGSLRAPARI TEEVGPQTVA VGLGHGHTAL GKTAKGIGAN AFVLLGGVYS
GSTFASCRIA KVPGGAGDLM TATAPTRDQL HRELLQAVPA SELRVMKPGE GDRLDLPLAE
GYRPEEDMYP AHEHKKHRWG MAIDLQRCIG CGACAVACYA ENNIPVIGKE QVGGGREMAW
LRVPPYRMPG DRLRYAWLPL HCQHCDAAPC EPVCPVFAAV HSEEGLNAQI YNRCIGTRYC
SNNCPYKVRR FNWLNVQWRK PLDLQLNPEV TVRTRGVMEK CTFCVQRIRQ AEYRASRERR
QLQDGEIVPA CAQTCPTGVF TFGDLLDPDS RVSRIAATEP RRYQLLHELH TKPAVTFLRR
VEVERG