Gene Dret_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1959 
Symbol 
ID8419804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp2241236 
End bp2244241 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content60% 
IMG OID645038547 
Productdelta-1-pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_003198821 
Protein GI258406079 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.028462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATGC AGACCCTGAA CCCGAAAATC AAGGCCCGTG GCCGGGAATT CTTTCAAAGT 
ATCAGTGGGG AAGCTCCGAC TGTCTTTAAT AAGGGGTGGT GGACCGGCAA GGTCATGGAC
TGGGCGATGC GCAATGAGCA GTTCAAGATC CAATTGTTCC GCTTCGTCGA TGTTCTGCCC
TATCTCAATA CCACCGAGTC CCTGAACCGG CATATCCAGG AATACTTCGT GGCCGAGGAC
CAGGAGATCC CGGCTGTGCT CAAGTGGGGG GCCAAAGGCG CCGGAATGGG CAAGGGATTG
GCCGGCAAGG TCATGACCAA GACCATCCGC AGCAATATCG AGGGCATGGC CAAACAGTTC
ATCATCGGCG AGAACACCAA ACAGGCCATC AAAAACCTCA ACAAACTCCG CAAAGACGGC
TTCGCCTTCA CTGTGGATAT CCTGGGAGAA GCAACGGTCA GTGAAGAGGA GGGGGTAGAA
TACCAGACCA ACTATCTCCA CCTCATGGAC GCCCTGGCCC AGGAAGCCAA AAGCTGGAAA
GGCCTGGGCG GGAACAGCGG GGATTGGGGA CACGCCCCGC TGACCAATAT CTCCATCAAG
CCCTCGGCCC TGTATTCCCA GGCCAGACCC TCGGATTTTG AGAACACGGT ACAGGCCATT
TTCGACCGGC TCATGCCGGT ACTGGACAAG GCCATGGCCA TGGGGTGTCA CGTCTGCATC
GACATGGAGC AGTACAAGTA CAAGGACATC ACCCTGGAAG TCTTCAAGCG GTTGCGCTCC
CACGAAAAAT TCCGGGAGTA CCCGCACATC GCCATTGTGC TGCAATCCTA TCTTCTGGAC
ACGGATCAGG ACCTTGATGC CTTGTTGCAT TGGGCACGGA CAGAACACCT CCCCATTTCC
ATCAGATTGG TCAAAGGCGC CTACTGGGAT TACGAGACGG TCCTGGCCAA GCAGCACGGC
TGGGACATTC CGGTCTACAT CAACAAGCAC GAGACCGACG CCGCCTTCGA ACGCCAGACA
GCGACCATCC TCCGCAATTC CGACATCTGC CACTACGCTT GCGGTTCGCA CAACATCCGG
TCCATCGCCG CGGCCCAGGA GATGGCTGAG GAACTCGGCG TTCCGGAAGA ACAGTATGAA
TTTCAGGTCC TCTACGGAAT GGCCGAACCA GTGCGCAAAG GATTGCGCAA CGTCGCCAAG
CGCGTCCGGC TCTATTGTCC CTACGGCGAA CTCATCCCGG GCATGGCCTA TCTCGTGCGC
CGACTGCTGG AAAACACGGC CAACGAATCC TTCCTCAAGC AGAGTTTCGC TGACCACGAA
GATGTCGACC ACCTGCTGGA AGATCCCGCG CTTTTGGCCC AGTCCGGCCC TCTGAGCCGG
GAAAAGACCC CGGCACCACC GCCACTGTCG CCCTCAGAAC CGTTTCGCAA TGAGCCGGCT
CCGGATTTCA CCAAGGAATC CGAACGGGTG GCCTACCCTG AGGCGCTTGC CGAGGTCCGC
CGGCAATTGG GCCGGACCTA TCCTCTGTAC ATAAACGGCC AGGAAGTGAC CACCGAAGAC
CTGCTGCCTT CCCTCAATCC AGCCGACCCC GGGGAAGTCG TGGGATCCGT ATGCCAGGCC
TCGACCCGGG AGATCGACGA GGCCATCGCC GCGGCCAACG AAGCCCTGCC GGCCTGGCGG
GATCTGGCCC CTGAAGACCG GGCCCAGTAC ATCTTTCGCG CCGCCGACAT TGCTCGCAAG
AATATCCACA CGCTGAGCGC CTGGCAGATC CTGGAGGTCG GCAAGCAATG GAATCAGGCC
CACGGCGACG TGGCCGAAGC CATCGACTTC ATGGAGTACT ACGCCCGGGA CATGATCCGG
CTCGGTCGCC CCCGGCGCAC CGGCAAGGCT CCAGGTGAAT TGACCCACTA TTTCTACCAA
TCCAAGGGTA TCGCCGCAGT CATCGGCCCC TGGAATTTCC CTTTGGCTAT CAGTTGCGGC
ATGAGTGCCG CCTCACTGGT GACCGGCAAT TGTGTGCTCT ATAAACCCGC CGGCCTGTCC
TCTGTCGTCG GACACACCTT GTGCCAGATT TTCCGCGACG CGGGACTCCC CCCCGGGGTC
TTCAATTTCG TACCCGGCCG GGGCTCTGTT ATCGGCGACT ATCTGGTCGA CCATCCGGAC
ATCGCCCTGG TGGCCTTCAC CGGCTCCCTG GACGTTGGGC TGCGGATCAT CGAGCGGGCC
GCCAGGCTCC AGCCCGGCCA GGAACACGTC AAAAAAGTCA TCGCCGAGAT GGGCGGAAAA
AATGCGATCA TTGTCGACGA CGATGCCGAC CTAGACGAAG CTGTAGCCGA TATCATCTAC
TCCGCCTTCG GATACCAGGG CCAGAAATGC TCAGCCTGTT CGCGGGTCAT TGTCCTGGAT
TCCATTTACG ACAAATTCAT GCACCGGCTG ACCAAGGCCG CACAATCCTT ACCCATCGGA
CCGGCCGAGG ATCCCGGCAA TTTTATGGGA CCGGTGGTGG ACAAAGGCCA GCAGGACAAG
GTCAATGCGG CCGTGGCCCT GGCCGAAGAA GAGGGCAATG TCGTGCTCAA ACGGGACGAC
CATCCTGACA CCGGCTCCTA TGCCCCACTG ACCATTGTCG AAAACATCAC CCCGGAACAC
CGTCTGGCCC AGGAAGAGAT CTTCGGGCCT GTCCTGGCGG TCATGCGGGT CAAGGACTTC
GATCAGGCCC TGCAATGGGC CAATTCGACC CCCTACGCCT TGACCGGGGC CGTCTTCTCC
CGCAGTCCCC AGCACCTGGA CCGGGCAAGA ACCGAATTCC GCGTCGGCAA TCTCTACCTC
AACCGCGGCA GCACCGGCTC GATGGTCGAG CGCCACCCCT TCGGCGGCTT CAAGCTTTCC
GGCGTGGGCT CCAAAACGGG CGGTCCGGAT TATCTGTTGC AGTTCATGGA CCCGCGCACG
GCCACGGAGA ACACCATGCG CCGCGGTTTT GCCCCGATCA TCGAAGGCGA CGACTGGCTC
GATTAA
 
Protein sequence
MDMQTLNPKI KARGREFFQS ISGEAPTVFN KGWWTGKVMD WAMRNEQFKI QLFRFVDVLP 
YLNTTESLNR HIQEYFVAED QEIPAVLKWG AKGAGMGKGL AGKVMTKTIR SNIEGMAKQF
IIGENTKQAI KNLNKLRKDG FAFTVDILGE ATVSEEEGVE YQTNYLHLMD ALAQEAKSWK
GLGGNSGDWG HAPLTNISIK PSALYSQARP SDFENTVQAI FDRLMPVLDK AMAMGCHVCI
DMEQYKYKDI TLEVFKRLRS HEKFREYPHI AIVLQSYLLD TDQDLDALLH WARTEHLPIS
IRLVKGAYWD YETVLAKQHG WDIPVYINKH ETDAAFERQT ATILRNSDIC HYACGSHNIR
SIAAAQEMAE ELGVPEEQYE FQVLYGMAEP VRKGLRNVAK RVRLYCPYGE LIPGMAYLVR
RLLENTANES FLKQSFADHE DVDHLLEDPA LLAQSGPLSR EKTPAPPPLS PSEPFRNEPA
PDFTKESERV AYPEALAEVR RQLGRTYPLY INGQEVTTED LLPSLNPADP GEVVGSVCQA
STREIDEAIA AANEALPAWR DLAPEDRAQY IFRAADIARK NIHTLSAWQI LEVGKQWNQA
HGDVAEAIDF MEYYARDMIR LGRPRRTGKA PGELTHYFYQ SKGIAAVIGP WNFPLAISCG
MSAASLVTGN CVLYKPAGLS SVVGHTLCQI FRDAGLPPGV FNFVPGRGSV IGDYLVDHPD
IALVAFTGSL DVGLRIIERA ARLQPGQEHV KKVIAEMGGK NAIIVDDDAD LDEAVADIIY
SAFGYQGQKC SACSRVIVLD SIYDKFMHRL TKAAQSLPIG PAEDPGNFMG PVVDKGQQDK
VNAAVALAEE EGNVVLKRDD HPDTGSYAPL TIVENITPEH RLAQEEIFGP VLAVMRVKDF
DQALQWANST PYALTGAVFS RSPQHLDRAR TEFRVGNLYL NRGSTGSMVE RHPFGGFKLS
GVGSKTGGPD YLLQFMDPRT ATENTMRRGF APIIEGDDWL D