Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0107 |
Symbol | |
ID | 5897819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 116410 |
End bp | 119274 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641560591 |
Product | peptidase M16 domain-containing protein |
Protein accession | YP_001681743 |
Protein GI | 167644080 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.224669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGTC CCGCCGCCAA GGCCGCCCTC GTGGCGCTGG CCCTCTCGAC CACCGCCCTT TCGTCGATGA CCTTGGCGCC CTTGGCCCAC GCCCAGACGC AAGCCAAGCC GGCCGCCGCC GGGGTCGCCG TGCCGCCGAT CGTCTACAAG GAACGGACCC TGGCCAACGG CATGAAGGTC TACACCTCGC GGGACGCGAC CACGCCCAAC GTGACGGTGC AGGTCTGGTA CGGGGTCGGC AGCAAGGACG ATCCGGAGGG CCGCTCGGGC TTCGCGCACC TGTTCGAACA CCTGATGTTC AAGTCGACGC GCAACATGCC CAACGAGGCC TTCGACCGCC TGACCGAGGA CGTCGGCGGC TTCAACAACG CCTCCACCTA CGACGACTTC ACCAACTATT ACGAGGTCGT GCCCGCCAAC CACCTGCAGC GCCTGCTATG GGCCGAGGCT GAGCGCCTGG GCTCGCTGGT CATCAACGAC GCAGTGTTCA AGTCCGAGCG CGACGTGGTC AAGGAAGAGC TGCGCCAGCG GGTTCTGGCC AATCCCTATG GGCGGTTCTT CAACCTCTAC ATCACCCAGG CCTCGTTCGC GCAGCATCCC TACAAGCGCC CGGGCATCGG CTCGATCGAG GAGCTGGACG CCGCCACGGT CGACGACGTG CGCGCCTTCC ACGCCGCCTA TTACCGCCCC GACAACGCCG CCCTGATCGT GGTCGGCAAC TATGACGAAG CCCAGCTCAA CGCCTGGATC GACCAGTACT TCGCGCCGCT GAAGACCCCG GCCGGGGCGA TCAAGCCGGT GAGCGTGGTC GAGCCGCCCC GCGCCGGACC CAAGACCGTC ACCACCTACG GCCCCAACGT GCCGCTGCCG GGCGTGGCCA TGACCTGGCT GGCTCCGGCC GCCGCCGATC CGGACGCCCC GGCCCTGTCT GTGCTGGACG CCATCCTGTC GGCCGGCAAG TCGTCGCGCC TCTACAACAG CCTGGTCTAC GACCAGCAGA TCGCTCAGCA GATCTTCTCC TCGACCAGCA CCAACGCTCA GCCCGGCATC TTCTACGTCG GCGCGATCAT GGCGGGCGGC AAGACCGTGG AGCAGGGCGA GGCCTCGCTG TCGGCCCAGG TCGCCAAGCT GCGCGACGCC CCGCCGACGC CCGCCGAACT GGCCGAGGCC AAGGCCGGCC TGCTGGCCGA CGCCGTGCGC GGTCGCGAGA CCATCGACGG CCGCGCCTTC GCCATCGGCT ACGCCCTGCG CACCGAGGGC GACGCCCAGC GCGCCAACAC CGACCTGGCC GCCCTGCAGG CGGTCACCGC CGCCGACGTC CAGCGCGTGG CCAGGAAGTA CCTGACCGAC AACGGCCGCA CGACGATCCG CTACCTGCCG GAATCGTCCA AGCCCGCCGG CGCCAAGGAC CCGTTCGAGG CGCCCAAGCC CACCGCCTCG GTCAGCTACG GCGGCCCGGT CACCACCCTG GCCCCCGAGG GCGAACGCAT CGCCCCGCCG GCCCCGGGCA AGCCGGTGTC GCCGGTCCTG CCCTCGCCCG TCGAGAAGAC CCTGGCCAAC GGCCTGCGGG TCATCGTCGC CAAGTCCAGC GACCTGCCGC TGATCACCGC CGACCTGACC GTCAAAGGCG GCTCGGGAGC CGATCCCGCC GGCCTGGCGG GCGTCTCCAG CCTGACCGCC GAGCTGCTGA CCGAGGGCAC CAAGACCCGC ACCGCCACGC AAGTCGCCGC CGCCACCGAG GCCCTGGGCG CCAATCTGGA GGCCGGTTCG GGCTGGGAGG CCTCGTCCCT GACCCTCAGC GTCATCGCCG ACAAGGCTCC GCAAGGCCTG GCGATCATGG CCGACGTGGC CGAGAACCCG GCCTTCAAGG TCGAGGAGCT GGAGCGGGTG AAGACCGAGG CCCTGGACGG GCTGTCCGTC GCGTTCCAGC GCCCGGGCAG CGTCGCCGGC TTCGTCGTGC CGACCGTGAT CTATGGCGGC TCAGGCTTTG GCCACGTCTC CGGCGGCACA CCCGGCTCGC TGCCGAAGAT CCAGCGCGAC GCCCTGGTCA AGACCCACGC CGCCCACTGG CGTCCGGACA ACGCCATCCT GGTGCTGACC GGCGACCTGA CGCCGGAGCA GGGCTTCGCC TTGGCCGAGA AGGCGTTCGG CGGCTGGGCC AAGCCGGCCG GTCCGCCGCC GGCCCCGGTC AAGGGTCCCG CCGGCTATGC GCCCAGGAAC ATCGTCATCG ACCTGCCGGG CACCGGCCAG GCCGCGGTCG TGGTGACCAA GCCGGCGATC CTGCGCGCCG ACCCGCGCTA CTATGCCGGC CTGGTGGCCA ACGGCGTGCT GGGCGGCGGC TATTCCTCGC GCCTGAACCA GGAGATCCGC ATCAAGCGCG GCCTGTCCTA TGGCGCCGGC TCCAGCCTGT CGCCGCGCGC GGCGATCGGC GGTTTCTCGG CCAATGTCCA GACCAAGAAC GAGTCCGCCG CCCAGGTGGT GACCCTGATC AAGGGCGAAC TGAGCCGGCT GGGCGCCGAG CCCACCTCGA CCGCCGAGCT GGCGGCCCGC AAGTCGGTGC TGGTCGGCGA CTTCGGCCGC GACCTGGGCA CGTCGGGCGG CTTGGCCGAC ATCCTGGGCA ACCTGGCCGT CTACGGGGTC CCACTGAACG AGATCCAAGC CTATACCGGC AAGGTCGAGG CGGTGACCGC CGCCGACGTC CAGGACTTCT CCAAGGCGGT GCTCGACCCG GCCCAGACCA GCGTCATCGT GGTCGGCGAC GCCAAGACCT TCGGCGACAC GGTCAAGGCG GTGCTGCCGG GCGCGACCGA GATCCCGATC GATCAGTTGG ATCTCGACAG CCCCACCTTG AAGAAAGCCA AGTAG
|
Protein sequence | MIRPAAKAAL VALALSTTAL SSMTLAPLAH AQTQAKPAAA GVAVPPIVYK ERTLANGMKV YTSRDATTPN VTVQVWYGVG SKDDPEGRSG FAHLFEHLMF KSTRNMPNEA FDRLTEDVGG FNNASTYDDF TNYYEVVPAN HLQRLLWAEA ERLGSLVIND AVFKSERDVV KEELRQRVLA NPYGRFFNLY ITQASFAQHP YKRPGIGSIE ELDAATVDDV RAFHAAYYRP DNAALIVVGN YDEAQLNAWI DQYFAPLKTP AGAIKPVSVV EPPRAGPKTV TTYGPNVPLP GVAMTWLAPA AADPDAPALS VLDAILSAGK SSRLYNSLVY DQQIAQQIFS STSTNAQPGI FYVGAIMAGG KTVEQGEASL SAQVAKLRDA PPTPAELAEA KAGLLADAVR GRETIDGRAF AIGYALRTEG DAQRANTDLA ALQAVTAADV QRVARKYLTD NGRTTIRYLP ESSKPAGAKD PFEAPKPTAS VSYGGPVTTL APEGERIAPP APGKPVSPVL PSPVEKTLAN GLRVIVAKSS DLPLITADLT VKGGSGADPA GLAGVSSLTA ELLTEGTKTR TATQVAAATE ALGANLEAGS GWEASSLTLS VIADKAPQGL AIMADVAENP AFKVEELERV KTEALDGLSV AFQRPGSVAG FVVPTVIYGG SGFGHVSGGT PGSLPKIQRD ALVKTHAAHW RPDNAILVLT GDLTPEQGFA LAEKAFGGWA KPAGPPPAPV KGPAGYAPRN IVIDLPGTGQ AAVVVTKPAI LRADPRYYAG LVANGVLGGG YSSRLNQEIR IKRGLSYGAG SSLSPRAAIG GFSANVQTKN ESAAQVVTLI KGELSRLGAE PTSTAELAAR KSVLVGDFGR DLGTSGGLAD ILGNLAVYGV PLNEIQAYTG KVEAVTAADV QDFSKAVLDP AQTSVIVVGD AKTFGDTVKA VLPGATEIPI DQLDLDSPTL KKAK
|
| |