Gene Dgeo_0850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0850 
Symbol 
ID4057969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp908882 
End bp910453 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content65% 
IMG OID641229870 
Product1-pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_604321 
Protein GI94984957 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.154073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0551454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAG TCCAGGACTA CCGCCCGCAG CCCTTCACCG ACTTCACCAA CCCGGAAAAT 
GTCGCTGCTT ACCAAGCCGC GCTTGAGAAG GTTCGCGCCG AGCTGGTCGG CAAGCATTAC
CCCCTCATCA TCAACGGCGA GCGGGTGGAT ACGGCGGAGC GGCTCACCTC CATCAACCCC
TGCGACACCT CGGAAGTCAT CGGCACAACG GCAAAGGCCA CCATTGAGGA CGCGCAGCGC
GCCCTGGAAG GCGCGTGGGA AGCTTTCGAG ACGTGGAAAG CGTGGGATAT GGACGCCCGC
GCCCGCATTC TGCTCAAGGC CGCCGCGATC CTCAAGCGCC GCCGCCTAGA AGCCTGCGCG
CTGATGACGC TGGAGGTCGG CAAAAACTAT GCCGAGGCCG ACGTGGAGGT CGCGGAGGCG
ATTGACTTCC TGGAGTACTA CGCCCGCTCC GCCATGAAGT ACGCAGGCTT CGGCGCCGCC
GAGACGACCT GGTTTGAGGG CGAGGAAAAC GGCCTGCTGT ACCTGCCGCT GGGGGTCGGC
GTCTCCATCT CGCCCTGGAA CTTCCCCTGC GCGATCTTCA CGGGGATGCT GGCCGCGCCG
CTGGTGGTGG GCAACTGCGT CCTGGCCAAG CCTGCCGAGG ACTCCGGCAT GATCGCGGGC
TTTATGGTGG ACATCCTGCT GGAAGCCGGG CTGCCCGCCG GCGTGCTGCA ATTCCTGCCC
GGCATCGGCT CGGAGGTGGG CGAGTACCTC ACCACGCACC CCAAGACGCG CTTCATCACC
TTCACCGGAA GCCGCGCGGT GGGCCTACAC ATCAACGAGG TGGCGGCCAA AATCCAGCCC
GGCCAGAAGT GGATCAAGAA GGTCGTGCTG GAACTGGGTG GCAAGGACGC GCTGATCGTG
GACGAGACGG CTGATCTGGA CGTGGCTGTG ACCGCCGCCA CCCAGAGCGC CTTTGGTTTC
AACGGGCAAA AGTGCTCGGC GATGAGCCGC CTGATCGTGC TGGACGAGGT ATACGACACG
GTGGTAAATG CCTTTGTCGA ACGCGCCAAG AGCCTCAAGG TCGGCACGGG AGAAGAGAAC
GCGGCCGTGA CGGCGGTTGT GAACGAGGAG AGCTTCGAGA AGATCCGCCA GTACCTCGCC
CTCGGCAAGC AGGAAGGTCA GGTGCTGCTG GGCGGCGAGG CCCCCGGCGA GTGGGGGGGC
AAGAAGGGCT ATTACGTCCA GCCCACCATC ATTGGAGACG TGAAACCAGA AGCCCGCATC
GCCCAAGAGG AGATCTTCGG GCCCGTGGTG GCGGTGCTGC GGGCGCGCGA CTGGCAGGAC
GCGCTGCGGA TCGCCAACTC GACCGAATAC GGCCTGACCG GCGGCGTGTG CAGCCAAGAC
CGCCAGCGCC TGGAACAGGC CCGTGCCGAG TTCGAGGTCG GCAACCTCTA CTTCAACCGC
AAGATCACCG GCGCGATCGT GGGTGTGCAG CCCTTTGGGG GGTACAACAT GAGTGGCACC
GACTCCAAGG CGGGCGGCCC CGACTATCTG GCCAACTTCC TCCAGCTCAA GGCCGTGACC
GAGCGCTGGT AG
 
Protein sequence
MLKVQDYRPQ PFTDFTNPEN VAAYQAALEK VRAELVGKHY PLIINGERVD TAERLTSINP 
CDTSEVIGTT AKATIEDAQR ALEGAWEAFE TWKAWDMDAR ARILLKAAAI LKRRRLEACA
LMTLEVGKNY AEADVEVAEA IDFLEYYARS AMKYAGFGAA ETTWFEGEEN GLLYLPLGVG
VSISPWNFPC AIFTGMLAAP LVVGNCVLAK PAEDSGMIAG FMVDILLEAG LPAGVLQFLP
GIGSEVGEYL TTHPKTRFIT FTGSRAVGLH INEVAAKIQP GQKWIKKVVL ELGGKDALIV
DETADLDVAV TAATQSAFGF NGQKCSAMSR LIVLDEVYDT VVNAFVERAK SLKVGTGEEN
AAVTAVVNEE SFEKIRQYLA LGKQEGQVLL GGEAPGEWGG KKGYYVQPTI IGDVKPEARI
AQEEIFGPVV AVLRARDWQD ALRIANSTEY GLTGGVCSQD RQRLEQARAE FEVGNLYFNR
KITGAIVGVQ PFGGYNMSGT DSKAGGPDYL ANFLQLKAVT ERW