Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3648 |
Symbol | |
ID | 5703342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4210382 |
End bp | 4211527 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273073 |
Product | peptidase M50 |
Protein accession | YP_001538437 |
Protein GI | 159039184 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0203123 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCAA GCTTCCGGCT CGGCCGGATC GTGGGCGTAC CGGTCGGGGT CAACTGGAGT GTCCTGGTCA TCTTCCTGTT GATCGCTTGG GCGCTGTCCG CCAACCAGTT CCCCCGCACC TACCCGGACC GGCCGGGATT CGCATACGTC CTCGCCGGTC TCGCCGCCGC CGTGGTCTTC TTCCTCGGCC TGCTCGCCCA CGAGGTCTCC CACGCGGTCG TCGCCAAGCG CAACGGCCTC GAGGTCGGCG GGATCACGCT GTGGCTCTTC GGCGGGGTCG CCGAGCTGAA GGGTGAGGCG CCGAACCCGG GTGCGGAGCT GCGCATCGCG GGAATCGGGC CGCTGGTGAG CCTGCTGATC GGCGCGTTCT TCGGTGTGAT CGCGGCGGGC CTGGCGATGA CCGGGTACAA CGGGCTGTGG CTCGGCGCGC TGGCCTGGCT CGCCGGCATC AACGTGCTGC TGGCCGTCTT CAACGTGCTG CCCGCCGCCC CGTTGGACGG CGGGCGACTG CTACGCGCGG CGGTCTGGAA GGCCACCGGG GACCGGACCA AGGCGTCGGT GGTGGCCGCC CGGGCCGGCT GGGTGCTCGG CGCGCTCCTG ATCGGTCTGG GTTTCTGGCA GTTCTTCGCC GGCGGTGGGT TCGGTGGCCT GTGGTTGATC CTGATCGGGT GGTTCCTGAT CGGCGCCGCC GGGATGGAGG AGCGGCAGGC CCGGATGGGC AGTGCCCTGC ACGGCCTACG GGTCGCCGAC GTGATGACAC CGCAGCCGCA GACCGCCTCC GGTGACATGA CCGTCGCCGA CTTCGTCGGC CACTACCTCT TCGCGTACCG GCACTCGGCA TTGCCACTGG TGGAGGACGA TCGGCCGGTC GGGCTGGTCA CCCTCGACCG GGTTCGCGGC GTGCCGGCCG ACCAGCGGTA CGGCACCACC CTCGCCGAGG TCGCCTGCCG CGCCGACGAC CTGGTACTCG CCTCCCCGGA CGAGTCACTG AACGAACTCC TGCCCCGGCT CAGCGAGTGC GCCGACGGGC GGGCGCTCGT GGTGGTCGAC GGACGGCTGG TCGGAATCGT GTCACCCAGC GACATCAGCC GGGCGGCACA GCGTGGCAGC ATGCGCGAGC AGATGGCCGG CCGGCCTGTC CCCTGA
|
Protein sequence | MRASFRLGRI VGVPVGVNWS VLVIFLLIAW ALSANQFPRT YPDRPGFAYV LAGLAAAVVF FLGLLAHEVS HAVVAKRNGL EVGGITLWLF GGVAELKGEA PNPGAELRIA GIGPLVSLLI GAFFGVIAAG LAMTGYNGLW LGALAWLAGI NVLLAVFNVL PAAPLDGGRL LRAAVWKATG DRTKASVVAA RAGWVLGALL IGLGFWQFFA GGGFGGLWLI LIGWFLIGAA GMEERQARMG SALHGLRVAD VMTPQPQTAS GDMTVADFVG HYLFAYRHSA LPLVEDDRPV GLVTLDRVRG VPADQRYGTT LAEVACRADD LVLASPDESL NELLPRLSEC ADGRALVVVD GRLVGIVSPS DISRAAQRGS MREQMAGRPV P
|
| |