Gene GM21_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1806 
Symbol 
ID8137137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2103883 
End bp2106897 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content63% 
IMG OID644869418 
Productdelta-1-pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_003021618 
Protein GI253700429 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.00285483 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAACA GCGAACTGAA TAATCGAGTC GTGGCTAGGG GCAAGGAGTT CTTCTCGACC 
ATCTCCGGCG AAAAGCCGTC CCTCTTTAAT AAGGGCGCCT GGATGGGCAA GGTCATGGAT
TGGAGCATGC AGAACGAGAC CTTCAAGATC CAGATGTTCC GCTTCGTCGA CGTCTTCCCG
TCCCTCACCA CGGGGAAGCT GCTGACGGAC CATATCCGCG AATACTTCGG CGAGGAGAAG
GACATGCCTC CGGTCCTATC CACCGGCGCC AAGGTCGCCG GCATGCTCGG CTCCTTCGGC
GGCGCGGTTC TCAACAAGTT CCTCACCACG AACATCCAGG AGATGGCGAG ACAGTTCATC
GTCGGGGAGA ACACCAAGGA AGCGATCAAG AACATGGAGC GCCTGAGAAA GGACGGCTTC
GCCTTCGTCG TGGACGTGCT GGGTGAGGCG ACGCTTTCGG AGAAGGAAGC GGACATCTAC
ATGAACACGT ACCTTGAGCT TCTGGACTCG CTCAAGAAGG AATACAAGGA TTGGAAATCG
CTGCCGGGCA GGGGGGGGGA GGCGAGCCTC GATTGGGGGC ATGCTCCCAA GGTGAACGTC
GCGGTGAAGC CGACGGCGCT TTTCTGCCTC GCCAACCCGC AGGACTTCGA GGGTTCGGTC
GTCGCCATCC TGGAACGCGT GCGCAAGATC GCGGTGAAGG TCATGGAGCT TAACGGCTTC
CTCTGCATCG ACATGGAGAC CTACCGCCAC AAGGACATCA TCATCGAGGT CTACAAACGG
CTGAAGCTCG AGTACCCCGC ATACCACCAC TTCGGTCTAG TGCTCCAGGC CTACCTGGTG
GACACCGACA AGGATCTCCC GGAACTCCTC TCCTGGGCCC GCCAGAACAA GGTGCAGATC
TCGATCCGTC TCGTCAAAGG GGCGTACTGG GACTACGAGA CCGTGAAGGC CAAGCAGAAC
GATTGGAAGG TGCCGGTTTG GACCATCAAG GCGGAGTCGG ACGCGGCTTA CGAGCGTCAG
TCCAGGATGA TCCTGGAAAA CGCCGATATC TGCCACTTCG CCTGCGCCTC GCACAACATC
AGGACCATCT CGGCGGTGAT GGAGATGGCC AAAGAGTTGA ACGTGCCGGA CGAGCGGTAC
GAGTTCCAGG TGCTTTACGG GATGGCGGAG CCGGTCCGCA AGGGGATCCT GAAGGTTGCC
GGCCGCATCC GCCTCTACTG CCCCTACGGC GACATGGTCC CGGGGATGGG ATACCTGGTC
CGGCGCCTGC TGGAGAACAC GGCGAACGAA TCCTTCCTGC GCCAGAGCTT CGCCGAGGAC
GCTCAGATCG AGAAGCTCCT GGAGGATCCG GCGGTGACCG CCGAGCGCGA GCGCGCAGCG
CGCGCCGCGA AGCACAAGGC TGAGGCTAAG GGGCCCGGCA ACTTGGCGCC GTTCAACAAC
GAGGCGATGG TCGACTTCAC CAGGGCCGAC CACCGCGCCG CGTTCCCGAA ACAGATCGCG
GAGGTGAGGA AGCAGTTGGG CAGGACCTAC CCGCTCTTCG TCAACGGGAA GGAAGTGAAG
ACCGGCGACA CCATCGCCTC GGTCAACCCG AACAACCCTT CGGAAACGGT CGGCATCGTC
TGCCAGGCGG GGACCAAGGA AGTGGGCGAC GCCATCGCCG CCGCCAAGGG TGCCTTCCCT
GCATGGCGCG ACACCGACAT CAAGGTCCGC GCCGAGTACC TCGTCAAGGC TGCCGAGGTC
GCCCGCCGCA GGATCTTCGA GCTCTCCGCA TGGCAGGTGC TCGAGATCGG GAAGCAGTGG
GATCAGGCCT ACGCCGACGT CTGCGAGGCG ATCGACTTCC TCGAGTACTA CGCCCGCGAG
ATGATCGCCC TGGGAACCCC CAAGCGCATC GGCCACGCAC CGGGCGAATT GAACCACTAT
TTCTACGAGC CCAAAGGGGT GGCCGCGGTC ATCTCCCCCT GGAACTTCCC GCTGGCGATC
AGCATGGGGA TGGTGTCGGC GGCGATCGTC GCCGGCAACT GCGTGGTCTT CAAGCCTTCC
GGCCTCACCT CGGTGATCGG CTACCATATC GTCGAGCTCT TCCGCGAAGT CGGCCTTCCA
GACGGTGTCT TCAACTACAC CCCTGGCCGC GGCTCGGTGA TGGGAGACTA CCTGGTCGAC
AGCCCGGACA TCAGCCTGAT CGCCTTCACC GGATCCATGG AAGTGGGCCT TCGCATCATC
GAGCGCGCCG CTAAGGTGCA CCCAGGCCAG GAGAACGTGA AGAAGATCGT CTGCGAGATG
GGGGGCAAGA ACGGCATCAT CATCGACGAC GACGCCGACC TGGACGAGGC GGTACCGCAC
GTGCTCTACT CCGCATTCGG CTTCCAGGGG CAGAAGTGCT CCGCATGCTC GCGCGTCATC
GTCCTCGATG CGGTCTACGA CAAGTTCGTT GAGCGCCTGG TCTCCATGGC GCAGGCGACC
CTGGTCGGCC CCTCCGAGGA CCCGGCGCAC TACATGGGTG CCGTCGCCGA CGACAAGGCG
ATGAAGACCA TCAAGGAGTA CGCGGAGATC GGCAAGAAGG AGGGGCAACT CCTTTACGAG
AGCAAGGTTC CGGGCGACGG CGGCTACTAC GTCCCGATGA CCATCATCGG CGGCATCAAG
CCCGAGCACA GGATCGCGCA GGAGGAGATC TTCGGACCGG TCCTGGCCGT GATGCGCGCC
AAGGACTTCG ACCAGGCCAT CGCCTGGGCC AATTCCACCA AGTTCGCCCT GACCGGCGGC
GTCTTCTCCA GAAGCCCCGA GCATCTGGCG CAGGCCCGGA AAGAGTTCCG GGTCGGCAAC
CTGTACTTAA ACCGCAACAA CACCGGCGCC CTCGTCGGCC GCCAGCCCTT CGGCGGCTCC
AAGATGTCCG GCGTCGGCAC CAAGGCGGGG GGGCACGACT ACCTGCTGCA CTTCATGGAC
CCCAGGGTCG TCACCGAGAA CACCATGCGC CGCGGCTTCG CGCCGGTCGA GGAGGACGAC
GACTGGGTTG CGTAA
 
Protein sequence
MNNSELNNRV VARGKEFFST ISGEKPSLFN KGAWMGKVMD WSMQNETFKI QMFRFVDVFP 
SLTTGKLLTD HIREYFGEEK DMPPVLSTGA KVAGMLGSFG GAVLNKFLTT NIQEMARQFI
VGENTKEAIK NMERLRKDGF AFVVDVLGEA TLSEKEADIY MNTYLELLDS LKKEYKDWKS
LPGRGGEASL DWGHAPKVNV AVKPTALFCL ANPQDFEGSV VAILERVRKI AVKVMELNGF
LCIDMETYRH KDIIIEVYKR LKLEYPAYHH FGLVLQAYLV DTDKDLPELL SWARQNKVQI
SIRLVKGAYW DYETVKAKQN DWKVPVWTIK AESDAAYERQ SRMILENADI CHFACASHNI
RTISAVMEMA KELNVPDERY EFQVLYGMAE PVRKGILKVA GRIRLYCPYG DMVPGMGYLV
RRLLENTANE SFLRQSFAED AQIEKLLEDP AVTAERERAA RAAKHKAEAK GPGNLAPFNN
EAMVDFTRAD HRAAFPKQIA EVRKQLGRTY PLFVNGKEVK TGDTIASVNP NNPSETVGIV
CQAGTKEVGD AIAAAKGAFP AWRDTDIKVR AEYLVKAAEV ARRRIFELSA WQVLEIGKQW
DQAYADVCEA IDFLEYYARE MIALGTPKRI GHAPGELNHY FYEPKGVAAV ISPWNFPLAI
SMGMVSAAIV AGNCVVFKPS GLTSVIGYHI VELFREVGLP DGVFNYTPGR GSVMGDYLVD
SPDISLIAFT GSMEVGLRII ERAAKVHPGQ ENVKKIVCEM GGKNGIIIDD DADLDEAVPH
VLYSAFGFQG QKCSACSRVI VLDAVYDKFV ERLVSMAQAT LVGPSEDPAH YMGAVADDKA
MKTIKEYAEI GKKEGQLLYE SKVPGDGGYY VPMTIIGGIK PEHRIAQEEI FGPVLAVMRA
KDFDQAIAWA NSTKFALTGG VFSRSPEHLA QARKEFRVGN LYLNRNNTGA LVGRQPFGGS
KMSGVGTKAG GHDYLLHFMD PRVVTENTMR RGFAPVEEDD DWVA