Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1195 |
Symbol | |
ID | 5898650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1257199 |
End bp | 1259136 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641561678 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001682823 |
Protein GI | 167645160 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.430413 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGAGAA CCATTCTGGC CGCCGTGGCG GCGAGTTTTG CCATGGCCGG CGCCGCATTG GCCGAGGTCA CGCCGCTGTC GGTCTATGGC GGCTTGCCCA ACCTTGAGCA GGTCGAAATC TCGCCGGACG GCAAGTTCCT GGCGATCGCG GTCACCGACG GCGAAAAGCG GATGCTGGTC GTGCGCGAGG CCGGCGAGAA CGGTAAGCTG CTGAGCGCCA TGAACTTCGG CGACACCAAA CTTCGCGCCG TGCAATGGGC GGGTCCAGAG CACGTGCTGA TCACCACCTC AAGCACCGCC GAGGTCTATG GCCTGAGCGG ACCCAAGCGC GAATACCTGA TGGGCTTCGA CTTCAACCTG GTCACCAAGA AGCAGATTCC GCTGCTCAAG AACCAGGAAG ACGCGATGAA CGTCATCCTG GAGACCCCGG ATGTGCGGTT CATTGAAGGC GTGCCTCACG CTTTCGTCGA GGGGATCCAT TTTCACGACG GCCGGGGCTG GAACACCCTT TACCGGATCA ATCTGAAAAC CGGCGCCACC CGGATGCTGG ACAAGGGCGG CACGGAGAAC ACCGACGACT GGCTGGTCTC GCCTGAAGGT CAGCCTCTGG CCCAGTCGAT CTACGACGAA AAGCAGGGCG CCTGGAGCCT GAAGATCAGG GGGGCGGACG GCTGGACCAC GGTCGAGAAG ACCGTCTCGA AAATGGGCTC CTTCGGGCTG AGGGGCATGG GGCGCGACGG CCAAAGCGTG GTGGTCTGGG CCTATGACGA AGACACTGAC AAGACCCTGC TACGCGAATA CGCCCTGGAC GGCAGCCACG TCGACGTGCC GGGCAGCGGC GACTACGACC GCCCGATCCA TGCGCCGGAC GGGTCGCGCC TGCTGGGCGG CTACAGCCTG GTGGGCGACG AGAACCGCTA CGCCTTCTTC GACGCCAAGA CCCAGGCCAG CTGGAACGCC GTGCGCAAGG CCTTCCCAGG CGACCAGGTG TCGCTGGCGT CCTGGTCGGA TGATCGGCGC AAGGTGGTCG TCCAAGTCGA CTCGCCGACC CTGGGTCCGG CCTTCGCCCT GGTCGATCTC GACGCCAAGA GCGCGCGCTG GTTGGGCGAG ATCTACCGCG CCCTGACCGC CGACGGCGTC TCCGAGGTCC GCCCGATCAG GTACAAGGCC GCCGACGGTC TGGAGATCAC CGGCTACCTG ACCGTGCCGC GCGGCAAGGA CGCCAAGAAC CTGCCGCTGG TGGTGCTGCC GCACGGCGGT CCGGCGGCCC GCGACAAGCC GGGCTTCGAC TGGTGGTCCC AGGCGCTCGC CTCGCGCGGC TACGCGGTGC TGCAACCCAA TTTCCGTGGC TCCGACGGCT TTGGCCAAGC CTTCCTCGAA AAGGGCTATG GCCAGTGGGG CAAGGCCATG CAGACCGACC TGTCGGACGG TGTGCGCCAC CTGGCCAAGC AGGGCGTGAT CGATCCCAAA AGGGTCTGCA TCGTCGGCGC CAGCTATGGC GGCTATGCCG CCCTGGCCGG GGCGACGCTG GATCACGGCG TCTATCGCTG CGCCGTCTCG GTCGCCGGCC CCTCGGAGCT CAAGCGGTTC GTGTTCGACA GCAGCAAGCG CTACGAGACG GGCCGCAACT CGGCCCAGCG CTACTGGCTG CAGTTCATGG GCGCCGACGG CCTTAAGGAC CCCGACCTGG CCCTGATCTC GCCGGCCAAG CTGGCCGACA AGGTCGAGAT CCCGATCCTG TTGATCCATG GCAAGGACGA CACCGTCGTC CCCTACGTCC AGAGCACCCT GATGGCCGAC GCCCTGAAGA AAGCCGGCAA ACCGGTGGAG TTGGTCAGCC TGGACGGCGA GGATCACTTC CTGTCGCGCG GCGCCACCCG TCTGCGGATG CTGACCTCGG TGGTCGGCTT CCTCGAAAAG AACAACCCGC CGAACTGA
|
Protein sequence | MLRTILAAVA ASFAMAGAAL AEVTPLSVYG GLPNLEQVEI SPDGKFLAIA VTDGEKRMLV VREAGENGKL LSAMNFGDTK LRAVQWAGPE HVLITTSSTA EVYGLSGPKR EYLMGFDFNL VTKKQIPLLK NQEDAMNVIL ETPDVRFIEG VPHAFVEGIH FHDGRGWNTL YRINLKTGAT RMLDKGGTEN TDDWLVSPEG QPLAQSIYDE KQGAWSLKIR GADGWTTVEK TVSKMGSFGL RGMGRDGQSV VVWAYDEDTD KTLLREYALD GSHVDVPGSG DYDRPIHAPD GSRLLGGYSL VGDENRYAFF DAKTQASWNA VRKAFPGDQV SLASWSDDRR KVVVQVDSPT LGPAFALVDL DAKSARWLGE IYRALTADGV SEVRPIRYKA ADGLEITGYL TVPRGKDAKN LPLVVLPHGG PAARDKPGFD WWSQALASRG YAVLQPNFRG SDGFGQAFLE KGYGQWGKAM QTDLSDGVRH LAKQGVIDPK RVCIVGASYG GYAALAGATL DHGVYRCAVS VAGPSELKRF VFDSSKRYET GRNSAQRYWL QFMGADGLKD PDLALISPAK LADKVEIPIL LIHGKDDTVV PYVQSTLMAD ALKKAGKPVE LVSLDGEDHF LSRGATRLRM LTSVVGFLEK NNPPN
|
| |