Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1806 |
Symbol | |
ID | 8137137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 2103883 |
End bp | 2106897 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869418 |
Product | delta-1-pyrroline-5-carboxylate dehydrogenase |
Protein accession | YP_003021618 |
Protein GI | 253700429 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.00285483 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATAACA GCGAACTGAA TAATCGAGTC GTGGCTAGGG GCAAGGAGTT CTTCTCGACC ATCTCCGGCG AAAAGCCGTC CCTCTTTAAT AAGGGCGCCT GGATGGGCAA GGTCATGGAT TGGAGCATGC AGAACGAGAC CTTCAAGATC CAGATGTTCC GCTTCGTCGA CGTCTTCCCG TCCCTCACCA CGGGGAAGCT GCTGACGGAC CATATCCGCG AATACTTCGG CGAGGAGAAG GACATGCCTC CGGTCCTATC CACCGGCGCC AAGGTCGCCG GCATGCTCGG CTCCTTCGGC GGCGCGGTTC TCAACAAGTT CCTCACCACG AACATCCAGG AGATGGCGAG ACAGTTCATC GTCGGGGAGA ACACCAAGGA AGCGATCAAG AACATGGAGC GCCTGAGAAA GGACGGCTTC GCCTTCGTCG TGGACGTGCT GGGTGAGGCG ACGCTTTCGG AGAAGGAAGC GGACATCTAC ATGAACACGT ACCTTGAGCT TCTGGACTCG CTCAAGAAGG AATACAAGGA TTGGAAATCG CTGCCGGGCA GGGGGGGGGA GGCGAGCCTC GATTGGGGGC ATGCTCCCAA GGTGAACGTC GCGGTGAAGC CGACGGCGCT TTTCTGCCTC GCCAACCCGC AGGACTTCGA GGGTTCGGTC GTCGCCATCC TGGAACGCGT GCGCAAGATC GCGGTGAAGG TCATGGAGCT TAACGGCTTC CTCTGCATCG ACATGGAGAC CTACCGCCAC AAGGACATCA TCATCGAGGT CTACAAACGG CTGAAGCTCG AGTACCCCGC ATACCACCAC TTCGGTCTAG TGCTCCAGGC CTACCTGGTG GACACCGACA AGGATCTCCC GGAACTCCTC TCCTGGGCCC GCCAGAACAA GGTGCAGATC TCGATCCGTC TCGTCAAAGG GGCGTACTGG GACTACGAGA CCGTGAAGGC CAAGCAGAAC GATTGGAAGG TGCCGGTTTG GACCATCAAG GCGGAGTCGG ACGCGGCTTA CGAGCGTCAG TCCAGGATGA TCCTGGAAAA CGCCGATATC TGCCACTTCG CCTGCGCCTC GCACAACATC AGGACCATCT CGGCGGTGAT GGAGATGGCC AAAGAGTTGA ACGTGCCGGA CGAGCGGTAC GAGTTCCAGG TGCTTTACGG GATGGCGGAG CCGGTCCGCA AGGGGATCCT GAAGGTTGCC GGCCGCATCC GCCTCTACTG CCCCTACGGC GACATGGTCC CGGGGATGGG ATACCTGGTC CGGCGCCTGC TGGAGAACAC GGCGAACGAA TCCTTCCTGC GCCAGAGCTT CGCCGAGGAC GCTCAGATCG AGAAGCTCCT GGAGGATCCG GCGGTGACCG CCGAGCGCGA GCGCGCAGCG CGCGCCGCGA AGCACAAGGC TGAGGCTAAG GGGCCCGGCA ACTTGGCGCC GTTCAACAAC GAGGCGATGG TCGACTTCAC CAGGGCCGAC CACCGCGCCG CGTTCCCGAA ACAGATCGCG GAGGTGAGGA AGCAGTTGGG CAGGACCTAC CCGCTCTTCG TCAACGGGAA GGAAGTGAAG ACCGGCGACA CCATCGCCTC GGTCAACCCG AACAACCCTT CGGAAACGGT CGGCATCGTC TGCCAGGCGG GGACCAAGGA AGTGGGCGAC GCCATCGCCG CCGCCAAGGG TGCCTTCCCT GCATGGCGCG ACACCGACAT CAAGGTCCGC GCCGAGTACC TCGTCAAGGC TGCCGAGGTC GCCCGCCGCA GGATCTTCGA GCTCTCCGCA TGGCAGGTGC TCGAGATCGG GAAGCAGTGG GATCAGGCCT ACGCCGACGT CTGCGAGGCG ATCGACTTCC TCGAGTACTA CGCCCGCGAG ATGATCGCCC TGGGAACCCC CAAGCGCATC GGCCACGCAC CGGGCGAATT GAACCACTAT TTCTACGAGC CCAAAGGGGT GGCCGCGGTC ATCTCCCCCT GGAACTTCCC GCTGGCGATC AGCATGGGGA TGGTGTCGGC GGCGATCGTC GCCGGCAACT GCGTGGTCTT CAAGCCTTCC GGCCTCACCT CGGTGATCGG CTACCATATC GTCGAGCTCT TCCGCGAAGT CGGCCTTCCA GACGGTGTCT TCAACTACAC CCCTGGCCGC GGCTCGGTGA TGGGAGACTA CCTGGTCGAC AGCCCGGACA TCAGCCTGAT CGCCTTCACC GGATCCATGG AAGTGGGCCT TCGCATCATC GAGCGCGCCG CTAAGGTGCA CCCAGGCCAG GAGAACGTGA AGAAGATCGT CTGCGAGATG GGGGGCAAGA ACGGCATCAT CATCGACGAC GACGCCGACC TGGACGAGGC GGTACCGCAC GTGCTCTACT CCGCATTCGG CTTCCAGGGG CAGAAGTGCT CCGCATGCTC GCGCGTCATC GTCCTCGATG CGGTCTACGA CAAGTTCGTT GAGCGCCTGG TCTCCATGGC GCAGGCGACC CTGGTCGGCC CCTCCGAGGA CCCGGCGCAC TACATGGGTG CCGTCGCCGA CGACAAGGCG ATGAAGACCA TCAAGGAGTA CGCGGAGATC GGCAAGAAGG AGGGGCAACT CCTTTACGAG AGCAAGGTTC CGGGCGACGG CGGCTACTAC GTCCCGATGA CCATCATCGG CGGCATCAAG CCCGAGCACA GGATCGCGCA GGAGGAGATC TTCGGACCGG TCCTGGCCGT GATGCGCGCC AAGGACTTCG ACCAGGCCAT CGCCTGGGCC AATTCCACCA AGTTCGCCCT GACCGGCGGC GTCTTCTCCA GAAGCCCCGA GCATCTGGCG CAGGCCCGGA AAGAGTTCCG GGTCGGCAAC CTGTACTTAA ACCGCAACAA CACCGGCGCC CTCGTCGGCC GCCAGCCCTT CGGCGGCTCC AAGATGTCCG GCGTCGGCAC CAAGGCGGGG GGGCACGACT ACCTGCTGCA CTTCATGGAC CCCAGGGTCG TCACCGAGAA CACCATGCGC CGCGGCTTCG CGCCGGTCGA GGAGGACGAC GACTGGGTTG CGTAA
|
Protein sequence | MNNSELNNRV VARGKEFFST ISGEKPSLFN KGAWMGKVMD WSMQNETFKI QMFRFVDVFP SLTTGKLLTD HIREYFGEEK DMPPVLSTGA KVAGMLGSFG GAVLNKFLTT NIQEMARQFI VGENTKEAIK NMERLRKDGF AFVVDVLGEA TLSEKEADIY MNTYLELLDS LKKEYKDWKS LPGRGGEASL DWGHAPKVNV AVKPTALFCL ANPQDFEGSV VAILERVRKI AVKVMELNGF LCIDMETYRH KDIIIEVYKR LKLEYPAYHH FGLVLQAYLV DTDKDLPELL SWARQNKVQI SIRLVKGAYW DYETVKAKQN DWKVPVWTIK AESDAAYERQ SRMILENADI CHFACASHNI RTISAVMEMA KELNVPDERY EFQVLYGMAE PVRKGILKVA GRIRLYCPYG DMVPGMGYLV RRLLENTANE SFLRQSFAED AQIEKLLEDP AVTAERERAA RAAKHKAEAK GPGNLAPFNN EAMVDFTRAD HRAAFPKQIA EVRKQLGRTY PLFVNGKEVK TGDTIASVNP NNPSETVGIV CQAGTKEVGD AIAAAKGAFP AWRDTDIKVR AEYLVKAAEV ARRRIFELSA WQVLEIGKQW DQAYADVCEA IDFLEYYARE MIALGTPKRI GHAPGELNHY FYEPKGVAAV ISPWNFPLAI SMGMVSAAIV AGNCVVFKPS GLTSVIGYHI VELFREVGLP DGVFNYTPGR GSVMGDYLVD SPDISLIAFT GSMEVGLRII ERAAKVHPGQ ENVKKIVCEM GGKNGIIIDD DADLDEAVPH VLYSAFGFQG QKCSACSRVI VLDAVYDKFV ERLVSMAQAT LVGPSEDPAH YMGAVADDKA MKTIKEYAEI GKKEGQLLYE SKVPGDGGYY VPMTIIGGIK PEHRIAQEEI FGPVLAVMRA KDFDQAIAWA NSTKFALTGG VFSRSPEHLA QARKEFRVGN LYLNRNNTGA LVGRQPFGGS KMSGVGTKAG GHDYLLHFMD PRVVTENTMR RGFAPVEEDD DWVA
|
| |