Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4870 |
Symbol | |
ID | 5902332 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5265521 |
End bp | 5267689 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641565390 |
Product | prolyl oligopeptidase |
Protein accession | YP_001686488 |
Protein GI | 167648825 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGCC TGCTTCTCGT CCTCCTCGCC TCCACGAGCC TCATGACCAC AGCCGCCCAC GCCGCCGATC CCGCCCACGC CAACACCCCC ATCCCTGACA AGGAAGCCCG CACGCCGCTG GCCGACCTCG GCAAGGACGA TCCCTACAGG TGGATGGAGG AGATCGAGGG CGAGCGGCCG CTGGCCTGGG CCAAGGCGCA GAACACCCGC AGCCTGGCCG TGCTGCAGCG CGACACGCGC TATGCCGAGC TGGAAAGTCA GGCCCTGGCC ATCCTCAACG CCAAGGACCG GGTGCCGGGG GTGTCGTTCG CCGGCGACGG TAATTTGCGC AACTTCTGGC AGGACGCCGA CCACGTCCGC GGCCTATGGC GCGCGACGAC GCTGGAAAGC TACCGCACGG CCGAGCCGGC CTGGGAGACG CTCCTCGACA TCGACGCCCT GTCCAAGGCC GAGAACGCCA ACTGGGTGTT CAAGGGCGCC GACTGCCTGC CGCCCGAGGA CACCCGCTGC CTGGTCACCC TGTCGGACGG TGGCAAGGAC GCGGTGTCGA TCCGCGAGTT CGACACCGTG ACCAGGGCCT TCGTCGACCC CGTCCATGGG GGCGGCTTCG ACCTGCCCGA GGGCAAGCAG AGCGTCTCGT GGCTGGACAA GGACACCCTG CTGGTCGCCC GCGAATGGGA GCCGGGCCAG GTGACCAAGT CCGGCTACGC CTATGTGGTC AAGGCATGGA AGCGCGGCGC GCCCCTGGCC TCGGCCAAGG AAGTCTTCCG GGGCACGCCG GACGATGTCG CGGCCTCGGC CTACGCCCTG ACCGACGCCG ACGGCCGGGT CGTGGCGACC CTGGCCTCCC GCGCGGTCAG CTTCTTCGAG AGCGAGAGCT ATTTCCTGAC GGCTCAGGGG CCGGTGAAGC TGCCCCTGCC GCTGAAGCAT TCGATCCAGG GTTATGTCGC AGGTCAGTTG GCGGTTTCGC TGGAACAGGA CTGGCCCGAG AAGGGCTTCA AGACCGGCGA CCTGGTCAGC TTCGACCTGG CGGCCCTGAA GGCCGACCCC GCCCAGGCCG GGGCGACCCT GGTCCTGCGC CCCACCGCCA AGCAGTCGGT CGAGTCGGTG ACCGCCACCC GTGACAAGCT GGTGGTCGGC CTGCTCGACA ACGTCACCGG CGTCGCCTTC GCCTACAGCC ACGGCCCCAA GGGCTGGACG TCCCAGAAGC TGGCCCTGCC GGCCAATTCG ACCATCGGCC TGGGCTCGGC CTCGCGGAAG GACGACCGCC TGTTCGTCAG CGTCACCGGC TATCTGACGC CCTCGACCTA TTGGCTGGCC GACGCCGCCT CGCTGAAGCT CGAGCAGGTC AAGGCCTCGC CGGCCCGGTT CGACGCCTCC ACCCACGTGG TCGAGCAGTT CGAGGCTGTC AGCAGCGACG GCGTGAAGAT CCCCTACTTC GTCGTGCGGC CCAGGGGCGT CGAATACGAC GGGACGGCCC CGACCCTGCT CTACGCCTAT GGCGGCTTCC AGGTGTCGAT GACCCCGGCC TATTCGGGCG TGATGGGCAA GCTGTGGCTG GAGCGCGGCG GGACCTATGT GGTGGCCAAT ATCCGCGGCG GCGGCGAGTT CGGCCCCGCC TGGCACGAGG CGGCCCTGAA GGCCAATCGC CAGAAGGCCT ATGACGACTT CTTCGCCGTC TCCCAGGACC TGATCGACCG CAAGATAACC TCGCCGCGCC ATCTGGGGAT CATGGGCGGC AGCAATGGCG GCTTGCTGAT GGGCGTGGCC CTGACCCAGC GGCCCGAGCT CTACAACGCC GTCGTCGTGC AGGTGCCGCT GTTCGACATG ATCCGCTACA GCCAGATCGG GGCCGGGGCC TCGTGGGTGG GCGAATATGG CGACCCGGCC ATTCCGTCGG AACGGGCGGT GATCGCCAGG TACGATCCCT ATTCCAACCT CAAGGCCGGC CAGAACTATC CCGAGGTGTT CATCGAGACC TCGACCAAGG ACGACCGCGT CCACCCCGCC CACGCCCGCA AGGCCGCCGC GCGGCTGGAG GCGCTGGGCT ATCCGGTGCT GTACTACGAG AACATCGACG GCGGCCACGC CGCCAGCGCC AACCTGGCCG AGACCGCCCG CCGCCAGGCC CTGGAATATG TCTACCTGTC GAAGAAGCTG ATGGATTGA
|
Protein sequence | MRSLLLVLLA STSLMTTAAH AADPAHANTP IPDKEARTPL ADLGKDDPYR WMEEIEGERP LAWAKAQNTR SLAVLQRDTR YAELESQALA ILNAKDRVPG VSFAGDGNLR NFWQDADHVR GLWRATTLES YRTAEPAWET LLDIDALSKA ENANWVFKGA DCLPPEDTRC LVTLSDGGKD AVSIREFDTV TRAFVDPVHG GGFDLPEGKQ SVSWLDKDTL LVAREWEPGQ VTKSGYAYVV KAWKRGAPLA SAKEVFRGTP DDVAASAYAL TDADGRVVAT LASRAVSFFE SESYFLTAQG PVKLPLPLKH SIQGYVAGQL AVSLEQDWPE KGFKTGDLVS FDLAALKADP AQAGATLVLR PTAKQSVESV TATRDKLVVG LLDNVTGVAF AYSHGPKGWT SQKLALPANS TIGLGSASRK DDRLFVSVTG YLTPSTYWLA DAASLKLEQV KASPARFDAS THVVEQFEAV SSDGVKIPYF VVRPRGVEYD GTAPTLLYAY GGFQVSMTPA YSGVMGKLWL ERGGTYVVAN IRGGGEFGPA WHEAALKANR QKAYDDFFAV SQDLIDRKIT SPRHLGIMGG SNGGLLMGVA LTQRPELYNA VVVQVPLFDM IRYSQIGAGA SWVGEYGDPA IPSERAVIAR YDPYSNLKAG QNYPEVFIET STKDDRVHPA HARKAAARLE ALGYPVLYYE NIDGGHAASA NLAETARRQA LEYVYLSKKL MD
|
| |