Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3006 |
Symbol | |
ID | 5900461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3272267 |
End bp | 3273175 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641563503 |
Product | peptidase U32 |
Protein accession | YP_001684631 |
Protein GI | 167646968 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0826] Collagenase and related proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.154697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCC CCCGGCTGGA GCTGGCCGTC GGGCCGCTGC TGTTCAACTG GTCGCCCGAT CGCGTCGCGG CCTTCTACGA CCAGATCGCC GCTGACCCGG CGATCGATCG CGTCTATCTG GGCGAGGTAG TGTGCGGCAA GCGTGGGCCG TTGCTGGCGC AGACCTTGGC CCAGGCCGCC ATCGGCCTGG AGGCGGCGGG CAAGACCGTC GTCTGGTCGA CCCTGGCCCT GCCCGCCTTG CCACGTGATC GCCAGGCGAT CGCCGCCCTG GCGGCCGACC CGGGCCTGAT CGAGGTCAAC GACCTCAGCG CCCTGGCCCA TCGCCCGCTC GGCGCGCCCT TCGTCGCCGG ACCCATGCTC AACATCTACA ACGAGGCGGC GGCCGGCGAA CTGATCGCCC GCGGCTGCGT GCGCCTGTGC GCCAATGTCG AGCTGTCGCT TCCGACCCTG GCGGCGCTGT CGGCGCGCTG TCCGGGACTG GAGATCGAGC TCTTCGCCTT CGGCCGGTTG CCGCTGGCCC TGTCGGGCCG CTGCTATCAC GCCCGCCATC ACGGCCTGCA CAAGGACAAT TGCCAGTTCG TCTGCGATCG CGACCTCGAC GGCCTGGCGG TCGAGACCCT CGACGCGGTC GGCTTCCTGG CGGTCAATGG CGTCCAGACC CTGTCCCACG GCGTGCAGGT GGCCGACAAT CCGTTGGCCG AGCTCCGCGC CGCCGGCGTC ACCTGCCTGC GCCTGTCACC CCATTCCGGC GACATGGGCA AGGTGATCGG CGGCTTTCGG GCCTATGCCG ACGGCGAGCT GACGCCCGCC GACCTGGCGA CGGCGATCCT CGCCGCCGAT CCGCCCGGTC CCCTGGTCAA TGGCTACCTT CAGGGTCAGG CCGGCGCGCG GTGGACGGCC CGGTCATGA
|
Protein sequence | MSAPRLELAV GPLLFNWSPD RVAAFYDQIA ADPAIDRVYL GEVVCGKRGP LLAQTLAQAA IGLEAAGKTV VWSTLALPAL PRDRQAIAAL AADPGLIEVN DLSALAHRPL GAPFVAGPML NIYNEAAAGE LIARGCVRLC ANVELSLPTL AALSARCPGL EIELFAFGRL PLALSGRCYH ARHHGLHKDN CQFVCDRDLD GLAVETLDAV GFLAVNGVQT LSHGVQVADN PLAELRAAGV TCLRLSPHSG DMGKVIGGFR AYADGELTPA DLATAILAAD PPGPLVNGYL QGQAGARWTA RS
|
| |