Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0858 |
Symbol | |
ID | 5705123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 959000 |
End bp | 960193 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641270377 |
Product | peptidase M42 family protein |
Protein accession | YP_001535767 |
Protein GI | 159036514 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | [TIGR03106] hydrolase, peptidase M42 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.967973 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0317113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCGA ACAAGCCCGC ACCGCTGCCG CTCGACCTCG ACTACCTACG CCAAGTGCTG GTTGAGCTGC TGGAGATCCC GAGCCCGTCC GGCCGCACCG ATCACGTACA GCAGTACGTG GGCGAGCGGC TGGCGGCGCT CGGGATCCCG TCGACGCTGA CCCGGCGGGG TGCCCTCAGC GCCTGCCTCC CGGGACCGCG TACCACCGGT GCGGACCGGG CGATCGTGGT GCACACCGAC GTCATCGGCG GAATGGTCAA ACGGCTCAAG GAGAACGGCC GGCTGGAGCT CAAACCGATC GGGACACACA GTGCGCGCTT CGCGGAGGGC GCCCACGTAC GCGTCTTCAC CGACCACCTG GATCAGGTGA TCACCGGCCA GGTGCTACCG CTCAAGGCCA GTGGCCACCG CTACCACGAG GCGGTGGACT CCCAGGGCAT CGGCTGGGAG TTGGTCGAGG TCCGGGTGGA CGAGCCGGTG GACGACATCG CCGGCCTGCG CGCACTGGGG ATCGACGCGG GCGACTTCGT GGCGCTCCTG CCCAACCCGC AGGTCACCCC CTCCGGGTAT GTCAAATCCC GCCACCTGGA CGACAAGGCG GGCGTGGCGG CGGTGCTGAC CGCCTGCAAG GCGTTGGTCG ACGCGGGTGT CACCCCGGCG GTCAGCGCAC ACTTGTTGAT CACTGTCACG GAGGAGATCG GCCACGGCGC CTCGCACGGG CTGGATCCGG ATGTGGCCGA GATCGTCTCG GTGGACGCGG CCGTGGTGGC CCCCGGGCAG CAGTCCCGGG AGGATGCGGC AACCCTGGCG ATGGGCGACG GGGTCGGCCC GTTCGACTAC CACCTGACCC GCAACCTGGC GGCGATCGCC CGTGAGCACG ACGTCGACCT GGTCCGCGAT GTCTTCGACT ACTACCGTTC GGACGTCGCG GCGGCGGTCG AGGCCGGTGC GCACGCCCGG GTGGCGCTGC TCGGGTTCGG GGTGGACGCC ACCCACGGCC ATGAACGCAC CCACCTGGAC GGGCTGCGTC ACCTGACCCA ACTGCTGTGC CTCTACCTCC AGAGCGAGTT GGTCTTCCCG GAGTGGGACG CGGAACCGGC GGGCGAACTC GCCGACTTCC CGTCGCTGGC CGTTCAACCC GCCCAGGAGG ACGGGCCGCG GGACGGTCCG ATCGGCATCA CCGCCGTTTC GTGA
|
Protein sequence | MTPNKPAPLP LDLDYLRQVL VELLEIPSPS GRTDHVQQYV GERLAALGIP STLTRRGALS ACLPGPRTTG ADRAIVVHTD VIGGMVKRLK ENGRLELKPI GTHSARFAEG AHVRVFTDHL DQVITGQVLP LKASGHRYHE AVDSQGIGWE LVEVRVDEPV DDIAGLRALG IDAGDFVALL PNPQVTPSGY VKSRHLDDKA GVAAVLTACK ALVDAGVTPA VSAHLLITVT EEIGHGASHG LDPDVAEIVS VDAAVVAPGQ QSREDAATLA MGDGVGPFDY HLTRNLAAIA REHDVDLVRD VFDYYRSDVA AAVEAGAHAR VALLGFGVDA THGHERTHLD GLRHLTQLLC LYLQSELVFP EWDAEPAGEL ADFPSLAVQP AQEDGPRDGP IGITAVS
|
| |