Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2242 |
Symbol | |
ID | 5899697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2435158 |
End bp | 2437581 |
Gene Length | 2424 bp |
Protein Length | 807 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641562733 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001683867 |
Protein GI | 167646204 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.140093 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAAT TGCTCAAGCG CCGCGCCGCG GCGATCGCCC TGGCATGCCT GGTCGCTGCC GCGCCTCCCG CGCTGGCGAC CACCAGACCC TACGCTGTCG AGGATCTCCT TGAATTACGC ACCCTGGGTC CGGCGATGAT CGACCCCAGC CAGCGATGGC TGGTCGTCGG CACAACCGCG CCGTGGTCAC AGGCCCCGCG CTACGATCTG GACTGGGCGA CTTTCGAAGC GCTCGGTGAG CTGACGGTAG TGGATTTGGA GGCGCCCGGT TCCCCTCGGC CGCTGCTGCC GACTGAGCGG GGGGTCGGCT ACATCGCCGG CCCCTTCTCG CCCTCGGGAG CGAAGATGGT TGTGTTCCGC CTTCGGGGTC ACAGCCGAGA GATCGGCGTG GTCGTGGTGG CGACCGGGGC GGTTCGGTGG CTCGGCCTGG AGGTCGAGCC CGCGCCTTGG GGCCGATCGG TGCAATGGCG CGACGATGAG TCGGTGGTCG CCATCGTAAG GCCTCCAGGC GCGGCCTCCC GAGGCGTCGG CTACGGATGG CAGGTCCAGG CCCGCCTGGA GGCGCTCTGG GCGCGAGCGG CGCGGGGCGA GGCGGCCGTG ACCGCCTTGG GCGCGGGCCG TTACGCCACA ATCAACCCGC GCCTGCAGGG CGAGCGGCTG GTGGCGGTCG CGGTCTCCAC TGGCGCGGTC CGCCCCCTCG TCGAGGGGGA GATTGTCGAT CTGGAGATCG CCCCAGGCGG CCGCACGGCG GCGATCGTGG TCGAGGGCGA GCCCATCGCC GTCACGGCGA CCGACACCGT CACGCCCTCC ACGTCCTGGC GCCGCCGGCG ACTGCGATTG GTTGATTTGG CCAGCGGCGA CGTCCGCGAC CCCTTCGCCC GTGCCGACCT CCTGCCCACC AGCCTCGTCT GGTCGCCCAG CGGCCGGCGG CTGCTTGCGT TCGCCAGGGC GGACGGCGCG GATTGGGCCT CCGGATCGCT GCGCGAGATC GCCGTCGACG GCGCGGTCAA GGACCTGGCC GGCTCGGGTG TCGCGCCCAA GCTGACCACG GCGCGCGATG GCCAGGTTTC GGTGCGGGCG GGCTTCGTTG GCGAGCGGGC GGCGATCTTC GGCGCGCCGA TCGGCGGCGG GACGGACGTA ACCGAAGCCT GGCGCGGGTG GGACGGCCTG CCTTTGCCGG TGTCGCAGAG CGTTCGGCTC GAGGTCAGCG ATCGCGATGG CGCGGTCTTC TCCGGTCCCG AGGGAGTCGT TCGTCTGAAG GCGAGCGGCG GCCTGCAGCG CCTGGCCTTG CCTGGATCGA GATTGCAGCG CCCGGCCGAA CCTTCGGTGG GCGTTCGGCC GCTGGTCACG CCAACTCTGG GCGGGCAGCT CTGCTGCGCC CTGGTCTCTG CCACGAACTT GAGGGTGGGC GGCAAGACCT TGGCCTTGAA GCCGGATGAG ACCGTTTTGG CCTATGCGCC GACCAAGGGC ATGGTGGTCG TCGAGCAGCG CGCCTCCAGC GGGGTCTCGA CGGTAGCGCT TCGGACGTCG GCCGGCGATC GCCTCCTGCT GACGCTCAAC CCAGCGCTGG CGCAGGTCGA TCGGCCTGTA ATCCAGGCGA TCGGGCATCG TGGCCCCCAG GGCGAGACCT TGCCCAGTTG GCTGTTTCGG CCCGCCGATG CGCCCCCTGA CAAACGGTTG CCGGTGATCA TCGTGCCTTA TCCTGGCTCC ATCTATCCCG CGCCGCCGGC GATGACGCAG CCGCAGCATC CGCAATTTTC GGCGAGCATC CAGGCGATGG TGGGTCAGGG CTACGCCGTC ATCGCGCCCA GCCTGCCCTT GTCGGCGCAG TCGGAGCCCG GCGCGTCCCT CGCCCAGTCA ATGCTGGACA TCGTCGACAA GGCCGCCGTG AGCGGCGGCG TGGATCCGGA CCGGGTCGCG ATATGGGGAC AAAGCTTTGG TGGCTACGCC GCTCTGCTGG CGGCCGTCCA GAGCGAGCGC TTCTCCGCCG TGATCGCTTC GGCGCCGGTC TCTGACCTCG CCAGCTTCTG GGCGGCGGTG CCGCCCCAGG TTTCCCTGAT CGCCGAGCCC GGCCTGCCGG TGGGCGCCTT GGCCGGTTGG GCCGAGGCTG GTCAGGGGCG GATGCTGGGT CCGCCCTGGC AGGACCCCGA ACGCTGGCGG CGCAACAGCC CCCTGTGGTC GGCCCAGCGC GTCAAAGCGC CGGTTCTGCT GATCCAGGGC GACATCGACG CCGATCCAAC CCAGTCGGCC ATGATGTTCC AAGCCCTGGC GCGCCAGAAC AAGGATGTCC TCTGGTTGAC CTATCACGGT GAGGGCCACG TGGTGATCGG GCCGGGCAAT CTGCGTGACC TCTATTCGCG CGCGTTCGCG TTCCTGGCCG ACAGCTTCGC CGCGAAGCGG GCGACGATAG AAGACGCGCC TTAG
|
Protein sequence | MTQLLKRRAA AIALACLVAA APPALATTRP YAVEDLLELR TLGPAMIDPS QRWLVVGTTA PWSQAPRYDL DWATFEALGE LTVVDLEAPG SPRPLLPTER GVGYIAGPFS PSGAKMVVFR LRGHSREIGV VVVATGAVRW LGLEVEPAPW GRSVQWRDDE SVVAIVRPPG AASRGVGYGW QVQARLEALW ARAARGEAAV TALGAGRYAT INPRLQGERL VAVAVSTGAV RPLVEGEIVD LEIAPGGRTA AIVVEGEPIA VTATDTVTPS TSWRRRRLRL VDLASGDVRD PFARADLLPT SLVWSPSGRR LLAFARADGA DWASGSLREI AVDGAVKDLA GSGVAPKLTT ARDGQVSVRA GFVGERAAIF GAPIGGGTDV TEAWRGWDGL PLPVSQSVRL EVSDRDGAVF SGPEGVVRLK ASGGLQRLAL PGSRLQRPAE PSVGVRPLVT PTLGGQLCCA LVSATNLRVG GKTLALKPDE TVLAYAPTKG MVVVEQRASS GVSTVALRTS AGDRLLLTLN PALAQVDRPV IQAIGHRGPQ GETLPSWLFR PADAPPDKRL PVIIVPYPGS IYPAPPAMTQ PQHPQFSASI QAMVGQGYAV IAPSLPLSAQ SEPGASLAQS MLDIVDKAAV SGGVDPDRVA IWGQSFGGYA ALLAAVQSER FSAVIASAPV SDLASFWAAV PPQVSLIAEP GLPVGALAGW AEAGQGRMLG PPWQDPERWR RNSPLWSAQR VKAPVLLIQG DIDADPTQSA MMFQALARQN KDVLWLTYHG EGHVVIGPGN LRDLYSRAFA FLADSFAAKR ATIEDAP
|
| |