Gene Caul_4870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4870 
Symbol 
ID5902332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5265521 
End bp5267689 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content68% 
IMG OID641565390 
Productprolyl oligopeptidase 
Protein accessionYP_001686488 
Protein GI167648825 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCC TGCTTCTCGT CCTCCTCGCC TCCACGAGCC TCATGACCAC AGCCGCCCAC 
GCCGCCGATC CCGCCCACGC CAACACCCCC ATCCCTGACA AGGAAGCCCG CACGCCGCTG
GCCGACCTCG GCAAGGACGA TCCCTACAGG TGGATGGAGG AGATCGAGGG CGAGCGGCCG
CTGGCCTGGG CCAAGGCGCA GAACACCCGC AGCCTGGCCG TGCTGCAGCG CGACACGCGC
TATGCCGAGC TGGAAAGTCA GGCCCTGGCC ATCCTCAACG CCAAGGACCG GGTGCCGGGG
GTGTCGTTCG CCGGCGACGG TAATTTGCGC AACTTCTGGC AGGACGCCGA CCACGTCCGC
GGCCTATGGC GCGCGACGAC GCTGGAAAGC TACCGCACGG CCGAGCCGGC CTGGGAGACG
CTCCTCGACA TCGACGCCCT GTCCAAGGCC GAGAACGCCA ACTGGGTGTT CAAGGGCGCC
GACTGCCTGC CGCCCGAGGA CACCCGCTGC CTGGTCACCC TGTCGGACGG TGGCAAGGAC
GCGGTGTCGA TCCGCGAGTT CGACACCGTG ACCAGGGCCT TCGTCGACCC CGTCCATGGG
GGCGGCTTCG ACCTGCCCGA GGGCAAGCAG AGCGTCTCGT GGCTGGACAA GGACACCCTG
CTGGTCGCCC GCGAATGGGA GCCGGGCCAG GTGACCAAGT CCGGCTACGC CTATGTGGTC
AAGGCATGGA AGCGCGGCGC GCCCCTGGCC TCGGCCAAGG AAGTCTTCCG GGGCACGCCG
GACGATGTCG CGGCCTCGGC CTACGCCCTG ACCGACGCCG ACGGCCGGGT CGTGGCGACC
CTGGCCTCCC GCGCGGTCAG CTTCTTCGAG AGCGAGAGCT ATTTCCTGAC GGCTCAGGGG
CCGGTGAAGC TGCCCCTGCC GCTGAAGCAT TCGATCCAGG GTTATGTCGC AGGTCAGTTG
GCGGTTTCGC TGGAACAGGA CTGGCCCGAG AAGGGCTTCA AGACCGGCGA CCTGGTCAGC
TTCGACCTGG CGGCCCTGAA GGCCGACCCC GCCCAGGCCG GGGCGACCCT GGTCCTGCGC
CCCACCGCCA AGCAGTCGGT CGAGTCGGTG ACCGCCACCC GTGACAAGCT GGTGGTCGGC
CTGCTCGACA ACGTCACCGG CGTCGCCTTC GCCTACAGCC ACGGCCCCAA GGGCTGGACG
TCCCAGAAGC TGGCCCTGCC GGCCAATTCG ACCATCGGCC TGGGCTCGGC CTCGCGGAAG
GACGACCGCC TGTTCGTCAG CGTCACCGGC TATCTGACGC CCTCGACCTA TTGGCTGGCC
GACGCCGCCT CGCTGAAGCT CGAGCAGGTC AAGGCCTCGC CGGCCCGGTT CGACGCCTCC
ACCCACGTGG TCGAGCAGTT CGAGGCTGTC AGCAGCGACG GCGTGAAGAT CCCCTACTTC
GTCGTGCGGC CCAGGGGCGT CGAATACGAC GGGACGGCCC CGACCCTGCT CTACGCCTAT
GGCGGCTTCC AGGTGTCGAT GACCCCGGCC TATTCGGGCG TGATGGGCAA GCTGTGGCTG
GAGCGCGGCG GGACCTATGT GGTGGCCAAT ATCCGCGGCG GCGGCGAGTT CGGCCCCGCC
TGGCACGAGG CGGCCCTGAA GGCCAATCGC CAGAAGGCCT ATGACGACTT CTTCGCCGTC
TCCCAGGACC TGATCGACCG CAAGATAACC TCGCCGCGCC ATCTGGGGAT CATGGGCGGC
AGCAATGGCG GCTTGCTGAT GGGCGTGGCC CTGACCCAGC GGCCCGAGCT CTACAACGCC
GTCGTCGTGC AGGTGCCGCT GTTCGACATG ATCCGCTACA GCCAGATCGG GGCCGGGGCC
TCGTGGGTGG GCGAATATGG CGACCCGGCC ATTCCGTCGG AACGGGCGGT GATCGCCAGG
TACGATCCCT ATTCCAACCT CAAGGCCGGC CAGAACTATC CCGAGGTGTT CATCGAGACC
TCGACCAAGG ACGACCGCGT CCACCCCGCC CACGCCCGCA AGGCCGCCGC GCGGCTGGAG
GCGCTGGGCT ATCCGGTGCT GTACTACGAG AACATCGACG GCGGCCACGC CGCCAGCGCC
AACCTGGCCG AGACCGCCCG CCGCCAGGCC CTGGAATATG TCTACCTGTC GAAGAAGCTG
ATGGATTGA
 
Protein sequence
MRSLLLVLLA STSLMTTAAH AADPAHANTP IPDKEARTPL ADLGKDDPYR WMEEIEGERP 
LAWAKAQNTR SLAVLQRDTR YAELESQALA ILNAKDRVPG VSFAGDGNLR NFWQDADHVR
GLWRATTLES YRTAEPAWET LLDIDALSKA ENANWVFKGA DCLPPEDTRC LVTLSDGGKD
AVSIREFDTV TRAFVDPVHG GGFDLPEGKQ SVSWLDKDTL LVAREWEPGQ VTKSGYAYVV
KAWKRGAPLA SAKEVFRGTP DDVAASAYAL TDADGRVVAT LASRAVSFFE SESYFLTAQG
PVKLPLPLKH SIQGYVAGQL AVSLEQDWPE KGFKTGDLVS FDLAALKADP AQAGATLVLR
PTAKQSVESV TATRDKLVVG LLDNVTGVAF AYSHGPKGWT SQKLALPANS TIGLGSASRK
DDRLFVSVTG YLTPSTYWLA DAASLKLEQV KASPARFDAS THVVEQFEAV SSDGVKIPYF
VVRPRGVEYD GTAPTLLYAY GGFQVSMTPA YSGVMGKLWL ERGGTYVVAN IRGGGEFGPA
WHEAALKANR QKAYDDFFAV SQDLIDRKIT SPRHLGIMGG SNGGLLMGVA LTQRPELYNA
VVVQVPLFDM IRYSQIGAGA SWVGEYGDPA IPSERAVIAR YDPYSNLKAG QNYPEVFIET
STKDDRVHPA HARKAAARLE ALGYPVLYYE NIDGGHAASA NLAETARRQA LEYVYLSKKL
MD