Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0106 |
Symbol | |
ID | 8135409 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 128572 |
End bp | 131472 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644867726 |
Product | molybdopterin oxidoreductase Fe4S4 region |
Protein accession | YP_003019950 |
Protein GI | 253698761 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.00863778 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGGATA TGTCGAGACG GACATTTTTG TGGGTCACCG GAGGGAGCAG CATCGCCCTT GCCACCGATC CCCCGCGAAA GCTGGTCAAC AAGCTGATCC CAAAGGTGAT CCCACCGGAG AACATCCGTC CGGGTTCCTG GACCATCTTC GCCACAACCT GCCGCGAGTG CCCGGCCGGA TGCGGGATGC ACCTCTCCCA CCGCGACGGA CGGGTGACCA AGGCGGAGGG GAATCCTGCG CATCCGGTGA ACCGGGGCGC CCTCTGCCCG CGGGGGCAGT CGGCGCCGCA GGGACTCTAC GACCCGGACC GCCTCCGGCA GGTCCTCTAC CGCGGCGGCG GCGCCTCCCG GCCGAGCGAT TGGCAGGACG CCCTCTCCGC CATCTCCACG CGCCTTATCT CCGGCGGGCG CGCCGTCATT CTTTCCAGCC TCCAGACCGG CGCGCTGGCC GAGGTGATGG CGGGCTTCGC CTCCGCTTTC CGCGGGGAAC TCCTGTTTTA CGAGGCGTTC AACTACGAGC CGATGCGGGC CGCACACCAA GAGCTCTTCG GCCTGCCGGT GGTGCCGCAC CACGACCTAG AAAATAGCGA CTACATCCTG AGCTTCGCCG CCGACTTCCT GGAAACCTGG GTGTCGCCGG TTTCCTACGC GCGCCAGTTC GCCGACATGC ACGGTTTCCG GCAAAGCGAG GCGCAGAATC GCATGGTCTA CGTCGGGCCC AGGCTTTCCA TGACCGCCTC AAACGCCGAC AGCTTCATCC AGGTCCCTCC CGGGCAGGAA CGGCTGGTGG CGGCGACGCT TTTGAAGCTC GTCATCGAGC GCGGCTGGCA GAAAAACGAC CTCACCAAAT TCAGCGGGGC GCTCGATCGG ATGCTGATGG CAGCGGGCCA GGTTCCCGCC ATAGCCCAGG CGACGCTCTT GCAGGTGGCG CAGTCGTTCG CCAACGCCCG GGCCCCCCTG GCGCTCGCCG GCCCGTCGGC GGCTACAGGA GCTGTTGCGA CCGACACGGC GCTCTGCGTG GCCCTTTTGA ATTATGCGGT CGGAGCCGTC GGGAAGACGG TGGATTTTTC CCGCCCGCAT GCCTTGAGCC GCACCGCGCG CGAAGCGGAA GTTTTCTCCC TCCTCGGGTC GCTGGGAGCC AACGACGTCC TGTTCGTGCA CGACACCAAC CCCGCCTACA GCCGAAACGG AGCGGCGGCG CAGCTTAAGC GTGCCGGGAC CGTGGTCTAC CTTGGAACCA TGCCGGACGA GACGGCGCAA CTCTCCGACT GGGTGCTCCC CATCGACTCC CCGCTGGAGT CATGGGGGGA ATACGAACCC GAACCGGGAG TCCGCGGCCT GATGCAGCCC GGGATGGGAC GCATCCACGA CACCCGCGGC GCGGGAGACC TCTTCCTGAA GCTGGCCCGA CTGGCGCGAC GCCCCCTTTC CCGCGAGGGG AGCGCCGAGC CCCCAGCCGA TTTCGCCTCC TGGCTCAAAG CCGGATGGAG CGCCCCCGGT GGGGAAGCAT CCTGGACCGG TGCGCTCAGA ACGGGGGGAG ACTGGAGTAC CGGCCCGAAG TCCGCGGCGG CGCAACCGGC CCTGCAGGTG AAGGGCGGAC TTCTTTTCGC GGCGGCGGGG GTGACGCCGC TTCCCAAGCC CGATCAGGCC GAACTCTGGG CCTGGCCTTC CATCATGCTC TACGACGGCC GCTTAGCCAA CCGCGGCTGG CTCCAGGAGG CGCCCGACCC GGTCTCCTTC GTGGTCTGGG GCAACTGGGT GGACGTAAAC CCGCGCCAGG CCGAATCCCT CGGCATCGAG GAGGGGGAGA TGGTCCAGAT CTCCACTGCG ACCGGCTCCC TGCGCGCCCC CGCCCGGATC ACCGAGGAGG TCGGCCCGCA GACGGTGGCC GTAGGGCTGG GACATGGGCA CACGGCGCTG GGGAAGACCG CCAAGGGGAT CGGGGCCAAC GCCTTCGTGC TTCTTGGGGG GGTGTACAGC GGCTCGACCT TCGCTTCCTG CCGCATCGCG AAAGTCCCCG GCGGCGCCGG CGATCTCATG ACCGCCACCG CCCCCACCCG CGACCAGTTG CACCGCGAGC TGCTGCAGGC GGTGCCGGCC TCCGAGCTGC GCGTCATGAA GCCGGGGGAG GGGGACCGCC TCGACCTTCC CCTTGCCGAG GGGTACCGCC CGGAGGAGGA CATGTACCCG GCGCACGAGC ACAAGAAGCA CCGCTGGGGG ATGGCGATCG ACCTGCAGCG CTGCATCGGC TGCGGGGCCT GCGCCGTCGC CTGCTACGCC GAGAACAACA TCCCCGTGAT CGGCAAGGAG CAGGTCGGGG GGGGGCGCGA GATGGCCTGG CTCAGGGTCC CCCCCTACCG GATGCCCGGG GACCGGCTTC GTTACGCCTG GCTCCCACTG CACTGCCAGC ACTGCGACGC CGCCCCCTGC GAACCGGTCT GCCCGGTCTT CGCCGCCGTC CACAGCGAGG AGGGGCTGAA CGCCCAGATC TACAACCGCT GCATCGGCAC CCGCTACTGT TCCAACAACT GCCCCTACAA GGTGCGTCGG TTCAACTGGC TCAACGTGCA GTGGCGCAAG CCGCTCGACC TGCAGCTGAA CCCGGAGGTG ACGGTCAGGA CGCGCGGCGT GATGGAGAAA TGCACCTTCT GCGTGCAGCG CATCCGCCAG GCTGAGTACC GCGCCTCGCG GGAGAGGCGT CAGCTTCAGG ACGGGGAGAT CGTCCCAGCC TGCGCCCAGA CCTGCCCCAC CGGGGTCTTC ACCTTCGGCG ACCTCCTGGA CCCCGACTCG CGGGTGTCGA GGATCGCCGC GACTGAGCCG CGCCGCTACC AACTGCTGCA CGAGCTGCAC ACCAAACCGG CGGTGACCTT CCTGCGCAGG GTGGAGGTGG AGCGTGGCTG A
|
Protein sequence | MPDMSRRTFL WVTGGSSIAL ATDPPRKLVN KLIPKVIPPE NIRPGSWTIF ATTCRECPAG CGMHLSHRDG RVTKAEGNPA HPVNRGALCP RGQSAPQGLY DPDRLRQVLY RGGGASRPSD WQDALSAIST RLISGGRAVI LSSLQTGALA EVMAGFASAF RGELLFYEAF NYEPMRAAHQ ELFGLPVVPH HDLENSDYIL SFAADFLETW VSPVSYARQF ADMHGFRQSE AQNRMVYVGP RLSMTASNAD SFIQVPPGQE RLVAATLLKL VIERGWQKND LTKFSGALDR MLMAAGQVPA IAQATLLQVA QSFANARAPL ALAGPSAATG AVATDTALCV ALLNYAVGAV GKTVDFSRPH ALSRTAREAE VFSLLGSLGA NDVLFVHDTN PAYSRNGAAA QLKRAGTVVY LGTMPDETAQ LSDWVLPIDS PLESWGEYEP EPGVRGLMQP GMGRIHDTRG AGDLFLKLAR LARRPLSREG SAEPPADFAS WLKAGWSAPG GEASWTGALR TGGDWSTGPK SAAAQPALQV KGGLLFAAAG VTPLPKPDQA ELWAWPSIML YDGRLANRGW LQEAPDPVSF VVWGNWVDVN PRQAESLGIE EGEMVQISTA TGSLRAPARI TEEVGPQTVA VGLGHGHTAL GKTAKGIGAN AFVLLGGVYS GSTFASCRIA KVPGGAGDLM TATAPTRDQL HRELLQAVPA SELRVMKPGE GDRLDLPLAE GYRPEEDMYP AHEHKKHRWG MAIDLQRCIG CGACAVACYA ENNIPVIGKE QVGGGREMAW LRVPPYRMPG DRLRYAWLPL HCQHCDAAPC EPVCPVFAAV HSEEGLNAQI YNRCIGTRYC SNNCPYKVRR FNWLNVQWRK PLDLQLNPEV TVRTRGVMEK CTFCVQRIRQ AEYRASRERR QLQDGEIVPA CAQTCPTGVF TFGDLLDPDS RVSRIAATEP RRYQLLHELH TKPAVTFLRR VEVERG
|
| |