Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1193 |
Symbol | |
ID | 5898648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1253878 |
End bp | 1255806 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641561676 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001682821 |
Protein GI | 167645158 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGAAGT TTTTCGCCGG CGCCGCGATG GCCGCGCTGA TCGGGTCGAC CACCGCCGCG GCCGCGCCGA TCGAAGCGTA TGGTCGCCTA CCAGCCATGA GCGACGTGTC GATCTCGGCC AACGGCGCCA ATCTCGTCTA CATCCTGAAC GAGGGCGGCA CGGCCACCGT CGTCGCCCAG GGCCTGGACG GCACGGTGCT GCAATCGGCC AATCTGGGCG CGCGCAAGGT GCGGGGGGTG GCCTGGATCG ACGACACCCA CGCGATGATC GAGATCTCGT CGACGGCCGG CATCGAGGGC GTGGCCTATG TCCACGAATG GTTTCAGGCC GTCAGCCTCA ACATCAAGAC CGGCGCGGTC GTGCGGGTGC CCGACAGCGC TGCGTCGAAA GACCTGCTGA ACGTCATGAT GAGCACGCCC CAGGGCGGAA CCTATGGCGG CAAGTCGGTG ATCTGGGCCA GCCTGTACGC CAAGGACGGG ACCAGCATCC AGGACGACGG ACACATGGAT TTCTACCGTG TGGACCCCGA CACCGGCGTG GGCCGCGTCC AGCAACCCGG CGACGGCGAG ACCCAGGAGT TCCTCGCCAA GCCGGACGGC ACGGTGCTGG CGCGCGTCAA GTACAAGCCC AAGGATGGTC ACTGGCGGCT CGAGCTGCGC CATTCCGGCT GGGACGAGGC CTATTCGGTG TTCGCTCCGA TCGATACGCC GGACCTGATC GGCCGGTCGC TGGACGACAA GTCCCTGATC CTGCGCCTGT GGGATGACAA GGAAGAGATG TGGCGGCTGG CGCCCGTCTC CCTGGCCGAC GGCAAGATCG GCGACTATTT CGGCCCGGAC AGGCCGTTCG GCGTCGTCAC CGACGACGAA CAACGTTTGA TCGGCCTGTC CTCCACCGAC GTCTACACGG AATACGAGTT CTTCGAGCCC CGCCTGAAGG CGGTCTGGCC GCAGGTGCGC CAGGTGTTCG CCGGACGCCA GGTGACCCTG ACCTCCAACA CGCCCGACTA CGCCAAGCTG ATCGTCTATG TCGAAGGCAC CGGCGAGCCC GGCGGCTACT ATCTGGTCGA TCTGGGAGCC AAGAAGGTCA AGCGGATCGG CGCGGCCTAT CCCGCCCTGA CCGGCGGCGA CATCGCCCAG GTACAGGCGA TCAAGTACAA GGCCGCCGAC GGCCTGGAGA TTAACGCCTA CCTTACCCTG CCCAACGGCA GGCCCGCCAA GAGCCTACCC CTGATCGTTT TCCCGCACGG CGGGCCGCAG TCGCGCGATG GCGCGGGTTT CGACTGGTGG GCCCAGGCCA TGGCTTCGCG CGGCTATGCC GTGCTGCAGC CCAACTTCCG CGGCTCGTCC GGCTATGGCC GCAAGTTCGT GGAGGCCGCA TACGGTGAGT GGGGCGGCAA GATGCAGACC GACCTGTCCG ACGGCGTCCG CGCCCTGGCC AAGGCAGGCA CGATCGATCC CAAGCGCGTC TGCATCGTCG GGGGCAGCTA CGGCGGCTAC GCGGCCCTGG CTGGCATTAC CCTGGACAAG GGCGTCTACC GCTGCGCCGT GGCCGTGGCC GGCGTGTCCG ACATGGGTAA GATGCTCGAC CGCGAAACGG CTCGGTCGGG CGCCGACAGC AGCACCGTCC GTTACTGGAA ACGCTACATG GGCGTGGAGA AGTCGTCGGA CGCCTTGCTT AACCAGCGCT CGCCGGTGAA CTTTGCCAAC AACGCCGACG GCCCTGTCCT CCTGATCCAT GGCAAGGACG ACACCGTGGT CAACTATGAT CAGAGCGCAG CCATGCGCCA TGCCCTCGAA AAGGCCGGGA AGCCGGTGGA ACTGGTCACG CTGAAGGCCG AAGACCACTG GCTTTCACGT GAAGGCACCC GCCAACAGAT GCTGTCGGAG ACCGTCACCT TCCTGGAAAA GAACAACCCG CCGAACTAG
|
Protein sequence | MLKFFAGAAM AALIGSTTAA AAPIEAYGRL PAMSDVSISA NGANLVYILN EGGTATVVAQ GLDGTVLQSA NLGARKVRGV AWIDDTHAMI EISSTAGIEG VAYVHEWFQA VSLNIKTGAV VRVPDSAASK DLLNVMMSTP QGGTYGGKSV IWASLYAKDG TSIQDDGHMD FYRVDPDTGV GRVQQPGDGE TQEFLAKPDG TVLARVKYKP KDGHWRLELR HSGWDEAYSV FAPIDTPDLI GRSLDDKSLI LRLWDDKEEM WRLAPVSLAD GKIGDYFGPD RPFGVVTDDE QRLIGLSSTD VYTEYEFFEP RLKAVWPQVR QVFAGRQVTL TSNTPDYAKL IVYVEGTGEP GGYYLVDLGA KKVKRIGAAY PALTGGDIAQ VQAIKYKAAD GLEINAYLTL PNGRPAKSLP LIVFPHGGPQ SRDGAGFDWW AQAMASRGYA VLQPNFRGSS GYGRKFVEAA YGEWGGKMQT DLSDGVRALA KAGTIDPKRV CIVGGSYGGY AALAGITLDK GVYRCAVAVA GVSDMGKMLD RETARSGADS STVRYWKRYM GVEKSSDALL NQRSPVNFAN NADGPVLLIH GKDDTVVNYD QSAAMRHALE KAGKPVELVT LKAEDHWLSR EGTRQQMLSE TVTFLEKNNP PN
|
| |