Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1987 |
Symbol | |
ID | 5899442 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2133707 |
End bp | 2136148 |
Gene Length | 2442 bp |
Protein Length | 813 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562476 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001683613 |
Protein GI | 167645950 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCGCT TCGCGCCGCG CCTGCTCGCC TGCGCCCTGA TGGGCGCGGG CGCCGCCCAG GCCTCGCCCT TCACGGTCGA TCAGCTGCTG GCCCAGCAAC GCCTGGGTCC GGTCAGCGTC GATCCCTCGC AACGCTGGCT GGTCGTGCCC ACCACGGGGC CTTACGATTC CGCCCCGCGG TGGGACCTGG AGGACCTGAC GCCGATGACC ATCACTCGGT TGTCGGTGTT CGACCTGCGC AAGGGCGGCG CGCCCCGAGT CTTTCCGTCC AACGCGGGGT CGCCGGAGGC CTGGGGGTAC AACCCCGGGC CCTACGCGCC CTCGGGCGCG AAGATGGCGG TGACGCGGGC GCGGAGCGGG AGCCTGGAGG TGGGGATCTT GAACCTCACC GACGGTGCGG TAGTCTGGAC AGGGCTTTTC CCGGTCACCG ACGTGATGGG ACGCTCGCTG CAATGGCGCT CGGACGACGT GCTGCTGGTC CTGGCCAAGG ACCCGGCGAC CCCGACCACG GCCGGGGTGG TCGGTCCGCC CGCCGAGCAA CGCCTGATCG ACCGATGGCG GGCGACCCGC GAGAACCGTG TGGGCGTGAC CGTGATGGGC AGCGGCCGGT ATCTGGACCA GACGCCGGCG CTGCCTCCCA GCCGACTGGT GGCGGTCGAT GCGACCCGGG GCCTTGGCCA GGTGCTGGCC CAAGGTGATT TCTTCGACCT GGAGGTCGCG CCGGGCGGGC GGTTCGTGGG CCTGCTGTCG AACGGCCCGC CGCTGGCGCC GGACCCTTCG GTTCCCGCCA TGACCACCGA CCCTTTCCGC CGTCACCGTC TGACCGTGGT GGATCTGGTG GCCGGCGGCG CCTGGTCGCC CTGCCCCAGC TGCGACGTCG CCGTGGATCT CCTGGCCTGG TCGCCTGGCG GCGCGGGCCT GCTGGTCTAC GCGCGGGGCG ACAACGATCC ATGGTCGGCG GCCCGCTATT GGCGGATCGA CGCCCTGACG CGCCGCGCCG CGCCGCTCCA TGACCGCGAC ATCACCCCCA TGGCCGAAAC AACCGGCTAT GGCAGCCGTG TCCCCCGCGC CGACTGGATG AGCGAAACCC CCGTGGTTCT GGGACGGCGC GGCGACGCGT CGCAAGACGG GGCAGACTGG TTCGCATGGG GCGCCAAGGC CCCCCTCAAT CTCACCGCGG CCCTGCCACC AGGGCCGCGC CGCCTCGAAG CCACGGGACC GGCGGGAATT GTCGCCTCGC AAGGCGGCCG TCTCTGGCGC ATCGACCCGC TGGGCCGCGC CACGCTGCTG GGGGCCGGCC GAAGCCTGCA GGGCGCGGGC CTGCCCGGCG GCGAACGGCT GGTCTTCAAC AATCGGCCCC GCCCCGAAGA CTTGGCGCTG ATGCTCGACG ACGGCCCGCG CCCCACCCCC GTCGTTCTCG ACAAGGCCCG GCTTCGCCCG CTCGGTCTGG ACGCTCCCAC CGACGAAACC CTTCTCTTGG TCGCGGGTCA GGCGCGGGCG GCGGTCACCC TCAAACAGGA CGCGCACGGC GTGGAGACGG TTCTGCTGCG TCGCGCCGGC CAGGCGCCGC AAGCCTTGGC GACGCTGAAT GCGCATCTGG CGCAGGTGGA CTTTTCGGCG CCGCGCGCGG TCAGCCACCT CGGCCCTCGG GGCGAGACAC TGACCAGTTG GCTCTACATG CCCACCACGC CGCCTTCCGG CGCCAAGGTC CCGCTGGTCG TCATTCCCTA TCCAGGCAAG GTCTATCCCA CCGCCCCGAC CAGCCAGGGT CCGCCCGCGC GCCAGCTTTA TCTCAACCTC CAGATCCTGG CCGGGGCCGG CTACGCCGTC TTGCTGCCCA GCTTGCCCGT CGATACGCGC CGCGAGCCGG CCGAAGGTCT CGCCGACCGC ATTCTGGCGG CGGCCGACGC GGCCGCCACT GTCGAGCCGC GGCTGGATCT CAACCGCATG GGCTTGTGGG GCCATAGCTA TGGGGGATAC GCCGTGCTCA GCGCCGCCGC CCAAAGCCGC CAGTTCAAGG CGGTGATCGC CGGGGCCTTC GCCGCCGATC TGGCCAGCCA CTATACCCGC AAGAGCCTGC TGGCGACGGT CGCCCCCGAC GCCGCTGTCG AGATCATGGT CGGGGCGGGC TGGATGGAAC AGGGCCAGGG ACGGATGGGC GCGCCGCCCT GGGTCGATCC TGATCGCTAT GTCCGCAACA GCCCCCTGCT CCATGCCGAC AGGATTACCG CCCCGGTCAT GCTGGTGATG GGCGACCTGG ACAGCGATCC CGGCCAGGCC CTGACCATGT TCGGCGCCCT GTTCCGCCAG AACAAGGACG CGGTCTCCTT GCAGTATCAT GGCGAGACCC ACGTGATCAT GACGGCCGCC AATGTCGCCG ATTTCCACCG CCGGCTTCTG GCTTTCCTGC GCGATAATCT CGGGACCTCC CCAGGATCAT GA
|
Protein sequence | MRRFAPRLLA CALMGAGAAQ ASPFTVDQLL AQQRLGPVSV DPSQRWLVVP TTGPYDSAPR WDLEDLTPMT ITRLSVFDLR KGGAPRVFPS NAGSPEAWGY NPGPYAPSGA KMAVTRARSG SLEVGILNLT DGAVVWTGLF PVTDVMGRSL QWRSDDVLLV LAKDPATPTT AGVVGPPAEQ RLIDRWRATR ENRVGVTVMG SGRYLDQTPA LPPSRLVAVD ATRGLGQVLA QGDFFDLEVA PGGRFVGLLS NGPPLAPDPS VPAMTTDPFR RHRLTVVDLV AGGAWSPCPS CDVAVDLLAW SPGGAGLLVY ARGDNDPWSA ARYWRIDALT RRAAPLHDRD ITPMAETTGY GSRVPRADWM SETPVVLGRR GDASQDGADW FAWGAKAPLN LTAALPPGPR RLEATGPAGI VASQGGRLWR IDPLGRATLL GAGRSLQGAG LPGGERLVFN NRPRPEDLAL MLDDGPRPTP VVLDKARLRP LGLDAPTDET LLLVAGQARA AVTLKQDAHG VETVLLRRAG QAPQALATLN AHLAQVDFSA PRAVSHLGPR GETLTSWLYM PTTPPSGAKV PLVVIPYPGK VYPTAPTSQG PPARQLYLNL QILAGAGYAV LLPSLPVDTR REPAEGLADR ILAAADAAAT VEPRLDLNRM GLWGHSYGGY AVLSAAAQSR QFKAVIAGAF AADLASHYTR KSLLATVAPD AAVEIMVGAG WMEQGQGRMG APPWVDPDRY VRNSPLLHAD RITAPVMLVM GDLDSDPGQA LTMFGALFRQ NKDAVSLQYH GETHVIMTAA NVADFHRRLL AFLRDNLGTS PGS
|
| |