Gene Caul_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1044 
Symbol 
ID5898499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1103095 
End bp1106187 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content71% 
IMG OID641561526 
Productbifunctional proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_001682672 
Protein GI167645009 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0506] Proline dehydrogenase
[COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase 
TIGRFAM ID[TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.566511 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATT GGGACTCCCT GGACGACGGC AAGTATCGCG ACGAAGCCGC CGTGATCGCC 
GATCTGCTGG CCGCCCAGGC CCTGGGTTCG GAAGATCGCG CCGCCGTCCG CGCCGAGGCC
GAAGCCCTGG TGCGCGGCGC CCGCCGCAGC GTCCGCAAGC AGGGTGTGGT CGAAAGCTTC
CTGCAGGAGT TCAGCCTGGG CACCCGCGAG GGCCTGGCCC TGATGTGCCT GGCCGAGGCC
CTGCTGCGCA CCCCCGACGA CGACACCCGC GACAAGCTGA TCGCCGAGAA GATCGGCTCG
GCTGACTGGG CGTCCCACCT GGGCGGCAGC GACAGCCTGT TCGTCAACGC CTCGACCTGG
GGCCTGATGC TGACCGGCAA GATCGTCGAG CCCGACGACC AGGCCCAGAA GGACCTGCCA
GGGTTCATCA AGAAGATCGC CGGCCGCCTG GGCGAGCCGG TGATCCGCGC CGCCGTCGGC
CAGGCCATCC GCATCATGGG CGAGCAGTTC GTGCTGGGCC GCACCATCGA GGCGGCCATC
AAGCGCGCCG CCCATGACGG CGACATCTGC AGCTTCGACA TGCTGGGCGA AGGCGCCCGC
ACCGCCGCCG ACGCCGCCCG CTACGAAAAG TCCTATGCCG ATGCGATCGA AACCGTCGGC
AAGCTGTCGA ACAAGGCCGG CCCCGAGGCC GGCCACGGCG TGTCGGTCAA GCTGTCGGCC
CTGACCCCGC GCTATGAGGC CACGCAAGAG GCCCGCGTCT GGGACGAGCT CTATCCCCGC
ATCCTGCGCC TGGCCCTGGT GGCGGCCAAA TACGACATCA ACTTCACCAT CGACGCCGAA
GAGGCGGACC GCCTGGCGCT GTCGCTGAAG CTGCTGGATC GCCTGTGCCG CGAGCCTGAG
CTGGGCGCGT GGACGGGTCT GGGTCTGGCC GTCCAGGCCT ACCAGAAGCG CTGCCCGGAG
GTGATCGCCC GCCTGACCGC CCTGTCCAAG GAAACCGGCC GCCGCCTGAT GGTCCGCCTG
GTCAAGGGCG CCTATTGGGA CAGCGAGATC AAGCGCGCCC AGGTCGCCGG CCGCCCGGAC
TACCCGGTCT ACACCACCAA GCCGGCCACC GACCTGTCGT ACCTGGTCTG CGCCAAGGCC
CTGATCGCGG CCGCCCCGCA TCTCTACGCC CAGTTCGCCA CCCACAACGC CCACACCCTG
GCCGCCGTGG TCCGGATGTC GAAGAACGCC GGCGTCAAGA TCGAACACCA GCGCCTGCAC
GGCATGGGCG AGGCGCTCTA CAAGGCCGCC GACGACCTCT ATGACGGCGT CACCCTGCGG
GCCTACGCCC CGGTGGGCGG CCACGAGGAC CTGCTGCCCT ACCTGGTCCG CCGCCTGCTG
GAGAACGGCG CCAACACATC GTTCGTCCAC GCCCTGCTCG ACGAGCGGGT GCCGGTGGAG
AAGGTGGTGG TCGATCCCAT CACCGCCGTC GAAGCCCATC CAGGCCCCCA CGCCAAGATC
CCGACCCCGG TCAACGTCTA TGGCCCGCGC CGCCAGAACA GCAAGGGCCT GGACCTGTCG
GTGAAAGCCG ACCGCGACCG CCTGGCCGCC CATGTCGCCG AACTGGACAA GCTGACCCTG
TCGGCCGGTC CGCTGATCGG CGGCAAGCTG ACGGCCGGCG CCCCGCCGAT GCCGGTGCAG
AGCCCCACCG ACCACGACCG CGTGGTCGGG GTGGTGTCCG AAGCCCAGCT GCCGCAGATC
GACCAGGCCT TCAAGCTGGC CCGCGCCGCC CAGCCCGCCT GGGACCAGGC CGGCGGCCCG
GTTCGGGCCG AAATCCTGCG CGCCATGGGC GACGCCTTGG AAGCCAATCT CGAGCGCCTG
TGCGCGATCC TCTCGCGCGA AGCCGGCAAG ACCCAGCCCG ACGCCATCGC CGAGGTGCGC
GAGGCCGTCG ACTTCTGCCG CTACTACGCC AAGCTGGCCG AGGATCAGTT CGGCCTGGCC
GGCCAAGTCC TGACCGGTCC AGTCGGCGAG ACCAACACCC TGCGCCTGGC CGGGCGCGGC
GTCTTCGTCT GCATCAGCCC GTGGAACTTC CCACTGGCCA TCTTCACCGG CCAGATCGCC
GCCGCCCTGG CGGCCGGCAA CGCCGTTCTG GCCAAGCCGG CCGAACAGAC TCCGCTGATC
GCCTATGAGG CGGTAAAGCT CTATCACGCC GCCGGCCTGG ACCACCGCCT GTTGGCCCTG
CTGCCGGGAC GCGGCGAGAC CGTCGGCGCG GCCCTGACCG CCCACGAGGG CCTGGACGGC
GTGGCCTTCA CCGGCGGCAC GGACACCGCC TGGCGGATCA ACCAGACCCT GGCGCAACGG
CAAGGCCCGA TCGTGCCGTT CATCGCCGAG ACCGGCGGCC TCAACGGCAT GTTTGTCGAC
ACCACCGCCC AGCGCGAACA GGTGATCGAC GACGTGATCC TGTCGGCCTT CGGCAGCGCC
GGCCAGCGCT GCTCGGCCCT GCGCCTGCTG TTCCTGCCCG AAGACACCGC CGACCACATC
ATCGAGGGCC TGAAGGGCGC GATGGACGCC CTGGTGCTGG GCGACCCGGC CCTGGCCGTC
ACCGACGTCG GCCCGGTGAT CGACGCCGAG GCCAAGGCGG CGCTCGACAA GCACGTGGTC
CGCCTCAAGC ATGAGGCCAA GGTTGTGCAC ACCCTGGCCG CGCCGCGGAC CGGCACGTTC
TTCGCCCCGG TCCTGGCCGA GATCCCGGCG GCCGACTTCC TGGAGCGCGA GGTGTTCGGC
CCGGTGCTGC ACGTGGTGCG CTACCGTCCC GAAGACCTGG AGCAGGTCGC CGGCGCCCTG
GCGGCCCGCC GCTATGGCCT GACCCTGGGC GTCCATTCCC GGATCGAGAG CTTCGCGGCC
GACGTCCAGC GCCTGGTCCC GGCCGGCAAC TGTTACGTCA ACCGCTCAAT GACCGGCGCC
GTGGTCGGCG TCCAGCCGTT CGGCGGCGAG GGCCTGTCGG GCACCGGCCC CAAGGCCGGC
GGCCCCCACG CCCTGCTGCG CTACGCGGTC GAGCGAGCGC TGAGCATCAA CATCACGGCC
CAGGGCGGGG ATCCGACGCT GCTGAATCTC TGA
 
Protein sequence
MTDWDSLDDG KYRDEAAVIA DLLAAQALGS EDRAAVRAEA EALVRGARRS VRKQGVVESF 
LQEFSLGTRE GLALMCLAEA LLRTPDDDTR DKLIAEKIGS ADWASHLGGS DSLFVNASTW
GLMLTGKIVE PDDQAQKDLP GFIKKIAGRL GEPVIRAAVG QAIRIMGEQF VLGRTIEAAI
KRAAHDGDIC SFDMLGEGAR TAADAARYEK SYADAIETVG KLSNKAGPEA GHGVSVKLSA
LTPRYEATQE ARVWDELYPR ILRLALVAAK YDINFTIDAE EADRLALSLK LLDRLCREPE
LGAWTGLGLA VQAYQKRCPE VIARLTALSK ETGRRLMVRL VKGAYWDSEI KRAQVAGRPD
YPVYTTKPAT DLSYLVCAKA LIAAAPHLYA QFATHNAHTL AAVVRMSKNA GVKIEHQRLH
GMGEALYKAA DDLYDGVTLR AYAPVGGHED LLPYLVRRLL ENGANTSFVH ALLDERVPVE
KVVVDPITAV EAHPGPHAKI PTPVNVYGPR RQNSKGLDLS VKADRDRLAA HVAELDKLTL
SAGPLIGGKL TAGAPPMPVQ SPTDHDRVVG VVSEAQLPQI DQAFKLARAA QPAWDQAGGP
VRAEILRAMG DALEANLERL CAILSREAGK TQPDAIAEVR EAVDFCRYYA KLAEDQFGLA
GQVLTGPVGE TNTLRLAGRG VFVCISPWNF PLAIFTGQIA AALAAGNAVL AKPAEQTPLI
AYEAVKLYHA AGLDHRLLAL LPGRGETVGA ALTAHEGLDG VAFTGGTDTA WRINQTLAQR
QGPIVPFIAE TGGLNGMFVD TTAQREQVID DVILSAFGSA GQRCSALRLL FLPEDTADHI
IEGLKGAMDA LVLGDPALAV TDVGPVIDAE AKAALDKHVV RLKHEAKVVH TLAAPRTGTF
FAPVLAEIPA ADFLEREVFG PVLHVVRYRP EDLEQVAGAL AARRYGLTLG VHSRIESFAA
DVQRLVPAGN CYVNRSMTGA VVGVQPFGGE GLSGTGPKAG GPHALLRYAV ERALSINITA
QGGDPTLLNL