Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1044 |
Symbol | |
ID | 5898499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1103095 |
End bp | 1106187 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561526 |
Product | bifunctional proline dehydrogenase/pyrroline-5-carboxylate dehydrogenase |
Protein accession | YP_001682672 |
Protein GI | 167645009 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism |
COG ID | [COG0506] Proline dehydrogenase [COG4230] Delta 1-pyrroline-5-carboxylate dehydrogenase |
TIGRFAM ID | [TIGR01238] delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.566511 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATT GGGACTCCCT GGACGACGGC AAGTATCGCG ACGAAGCCGC CGTGATCGCC GATCTGCTGG CCGCCCAGGC CCTGGGTTCG GAAGATCGCG CCGCCGTCCG CGCCGAGGCC GAAGCCCTGG TGCGCGGCGC CCGCCGCAGC GTCCGCAAGC AGGGTGTGGT CGAAAGCTTC CTGCAGGAGT TCAGCCTGGG CACCCGCGAG GGCCTGGCCC TGATGTGCCT GGCCGAGGCC CTGCTGCGCA CCCCCGACGA CGACACCCGC GACAAGCTGA TCGCCGAGAA GATCGGCTCG GCTGACTGGG CGTCCCACCT GGGCGGCAGC GACAGCCTGT TCGTCAACGC CTCGACCTGG GGCCTGATGC TGACCGGCAA GATCGTCGAG CCCGACGACC AGGCCCAGAA GGACCTGCCA GGGTTCATCA AGAAGATCGC CGGCCGCCTG GGCGAGCCGG TGATCCGCGC CGCCGTCGGC CAGGCCATCC GCATCATGGG CGAGCAGTTC GTGCTGGGCC GCACCATCGA GGCGGCCATC AAGCGCGCCG CCCATGACGG CGACATCTGC AGCTTCGACA TGCTGGGCGA AGGCGCCCGC ACCGCCGCCG ACGCCGCCCG CTACGAAAAG TCCTATGCCG ATGCGATCGA AACCGTCGGC AAGCTGTCGA ACAAGGCCGG CCCCGAGGCC GGCCACGGCG TGTCGGTCAA GCTGTCGGCC CTGACCCCGC GCTATGAGGC CACGCAAGAG GCCCGCGTCT GGGACGAGCT CTATCCCCGC ATCCTGCGCC TGGCCCTGGT GGCGGCCAAA TACGACATCA ACTTCACCAT CGACGCCGAA GAGGCGGACC GCCTGGCGCT GTCGCTGAAG CTGCTGGATC GCCTGTGCCG CGAGCCTGAG CTGGGCGCGT GGACGGGTCT GGGTCTGGCC GTCCAGGCCT ACCAGAAGCG CTGCCCGGAG GTGATCGCCC GCCTGACCGC CCTGTCCAAG GAAACCGGCC GCCGCCTGAT GGTCCGCCTG GTCAAGGGCG CCTATTGGGA CAGCGAGATC AAGCGCGCCC AGGTCGCCGG CCGCCCGGAC TACCCGGTCT ACACCACCAA GCCGGCCACC GACCTGTCGT ACCTGGTCTG CGCCAAGGCC CTGATCGCGG CCGCCCCGCA TCTCTACGCC CAGTTCGCCA CCCACAACGC CCACACCCTG GCCGCCGTGG TCCGGATGTC GAAGAACGCC GGCGTCAAGA TCGAACACCA GCGCCTGCAC GGCATGGGCG AGGCGCTCTA CAAGGCCGCC GACGACCTCT ATGACGGCGT CACCCTGCGG GCCTACGCCC CGGTGGGCGG CCACGAGGAC CTGCTGCCCT ACCTGGTCCG CCGCCTGCTG GAGAACGGCG CCAACACATC GTTCGTCCAC GCCCTGCTCG ACGAGCGGGT GCCGGTGGAG AAGGTGGTGG TCGATCCCAT CACCGCCGTC GAAGCCCATC CAGGCCCCCA CGCCAAGATC CCGACCCCGG TCAACGTCTA TGGCCCGCGC CGCCAGAACA GCAAGGGCCT GGACCTGTCG GTGAAAGCCG ACCGCGACCG CCTGGCCGCC CATGTCGCCG AACTGGACAA GCTGACCCTG TCGGCCGGTC CGCTGATCGG CGGCAAGCTG ACGGCCGGCG CCCCGCCGAT GCCGGTGCAG AGCCCCACCG ACCACGACCG CGTGGTCGGG GTGGTGTCCG AAGCCCAGCT GCCGCAGATC GACCAGGCCT TCAAGCTGGC CCGCGCCGCC CAGCCCGCCT GGGACCAGGC CGGCGGCCCG GTTCGGGCCG AAATCCTGCG CGCCATGGGC GACGCCTTGG AAGCCAATCT CGAGCGCCTG TGCGCGATCC TCTCGCGCGA AGCCGGCAAG ACCCAGCCCG ACGCCATCGC CGAGGTGCGC GAGGCCGTCG ACTTCTGCCG CTACTACGCC AAGCTGGCCG AGGATCAGTT CGGCCTGGCC GGCCAAGTCC TGACCGGTCC AGTCGGCGAG ACCAACACCC TGCGCCTGGC CGGGCGCGGC GTCTTCGTCT GCATCAGCCC GTGGAACTTC CCACTGGCCA TCTTCACCGG CCAGATCGCC GCCGCCCTGG CGGCCGGCAA CGCCGTTCTG GCCAAGCCGG CCGAACAGAC TCCGCTGATC GCCTATGAGG CGGTAAAGCT CTATCACGCC GCCGGCCTGG ACCACCGCCT GTTGGCCCTG CTGCCGGGAC GCGGCGAGAC CGTCGGCGCG GCCCTGACCG CCCACGAGGG CCTGGACGGC GTGGCCTTCA CCGGCGGCAC GGACACCGCC TGGCGGATCA ACCAGACCCT GGCGCAACGG CAAGGCCCGA TCGTGCCGTT CATCGCCGAG ACCGGCGGCC TCAACGGCAT GTTTGTCGAC ACCACCGCCC AGCGCGAACA GGTGATCGAC GACGTGATCC TGTCGGCCTT CGGCAGCGCC GGCCAGCGCT GCTCGGCCCT GCGCCTGCTG TTCCTGCCCG AAGACACCGC CGACCACATC ATCGAGGGCC TGAAGGGCGC GATGGACGCC CTGGTGCTGG GCGACCCGGC CCTGGCCGTC ACCGACGTCG GCCCGGTGAT CGACGCCGAG GCCAAGGCGG CGCTCGACAA GCACGTGGTC CGCCTCAAGC ATGAGGCCAA GGTTGTGCAC ACCCTGGCCG CGCCGCGGAC CGGCACGTTC TTCGCCCCGG TCCTGGCCGA GATCCCGGCG GCCGACTTCC TGGAGCGCGA GGTGTTCGGC CCGGTGCTGC ACGTGGTGCG CTACCGTCCC GAAGACCTGG AGCAGGTCGC CGGCGCCCTG GCGGCCCGCC GCTATGGCCT GACCCTGGGC GTCCATTCCC GGATCGAGAG CTTCGCGGCC GACGTCCAGC GCCTGGTCCC GGCCGGCAAC TGTTACGTCA ACCGCTCAAT GACCGGCGCC GTGGTCGGCG TCCAGCCGTT CGGCGGCGAG GGCCTGTCGG GCACCGGCCC CAAGGCCGGC GGCCCCCACG CCCTGCTGCG CTACGCGGTC GAGCGAGCGC TGAGCATCAA CATCACGGCC CAGGGCGGGG ATCCGACGCT GCTGAATCTC TGA
|
Protein sequence | MTDWDSLDDG KYRDEAAVIA DLLAAQALGS EDRAAVRAEA EALVRGARRS VRKQGVVESF LQEFSLGTRE GLALMCLAEA LLRTPDDDTR DKLIAEKIGS ADWASHLGGS DSLFVNASTW GLMLTGKIVE PDDQAQKDLP GFIKKIAGRL GEPVIRAAVG QAIRIMGEQF VLGRTIEAAI KRAAHDGDIC SFDMLGEGAR TAADAARYEK SYADAIETVG KLSNKAGPEA GHGVSVKLSA LTPRYEATQE ARVWDELYPR ILRLALVAAK YDINFTIDAE EADRLALSLK LLDRLCREPE LGAWTGLGLA VQAYQKRCPE VIARLTALSK ETGRRLMVRL VKGAYWDSEI KRAQVAGRPD YPVYTTKPAT DLSYLVCAKA LIAAAPHLYA QFATHNAHTL AAVVRMSKNA GVKIEHQRLH GMGEALYKAA DDLYDGVTLR AYAPVGGHED LLPYLVRRLL ENGANTSFVH ALLDERVPVE KVVVDPITAV EAHPGPHAKI PTPVNVYGPR RQNSKGLDLS VKADRDRLAA HVAELDKLTL SAGPLIGGKL TAGAPPMPVQ SPTDHDRVVG VVSEAQLPQI DQAFKLARAA QPAWDQAGGP VRAEILRAMG DALEANLERL CAILSREAGK TQPDAIAEVR EAVDFCRYYA KLAEDQFGLA GQVLTGPVGE TNTLRLAGRG VFVCISPWNF PLAIFTGQIA AALAAGNAVL AKPAEQTPLI AYEAVKLYHA AGLDHRLLAL LPGRGETVGA ALTAHEGLDG VAFTGGTDTA WRINQTLAQR QGPIVPFIAE TGGLNGMFVD TTAQREQVID DVILSAFGSA GQRCSALRLL FLPEDTADHI IEGLKGAMDA LVLGDPALAV TDVGPVIDAE AKAALDKHVV RLKHEAKVVH TLAAPRTGTF FAPVLAEIPA ADFLEREVFG PVLHVVRYRP EDLEQVAGAL AARRYGLTLG VHSRIESFAA DVQRLVPAGN CYVNRSMTGA VVGVQPFGGE GLSGTGPKAG GPHALLRYAV ERALSINITA QGGDPTLLNL
|
| |