Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2354 |
Symbol | |
ID | 5706938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2708937 |
End bp | 2710097 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641271832 |
Product | peptidase M50 |
Protein accession | YP_001537203 |
Protein GI | 159037950 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.176917 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTCCA AGCGGCAAGC CCCACGCCCG CCTACCCGGC ACTCCGGTGT GACCGTCGGT CGGGTGGTCG GGGTGCCGCT GCGCCTGGAC TGGTCGATGC TGCTGCTCGC CCTGGCCGTC GCCGTGATGT ACGCCGAATT CGCCCGCCAC CAGCTCGCCC TCTCGCCGGC CGGTGGCTAC GTGATCGGCC TCGGCTTCGT GGTTTCGCTG CTCGGGTCGG TGCTCCTGCA CGAACTCGGG CACGCCCTCA CCGCCCGCCG GTACGGCATC GGGGTCCGCG GCATCACCCT GGAGCTGCTC GGCGGCTACA CCGAGATGGA CCGCGACGCC CCGACTCCCC GCGTCGACCT GCTGGTGTCG CTGGCCGGGC CGGCCGTCTC CGCGGTACTG GGCGGGGCAG CGGTCGCCGT CACGATGGCG CTGCCGGACC GTACGGTGGG TCACCAGCTC GCCTTCCAGC TCGCGGTGAG TAACGTCGTT GTCGCAGCGT TCAACGTGCT ACCCGGGCTG CCGCTCGATG GTGGCCGCGC GCTGCGAGCC GCCCTCTGGG CCGCCACCCG GGACCGGCAC CGGGCCACCG AGGTGGCTGG CTGGGTCGGC CGTGTCGTTG CCATCGGTAC CGTCGGGGCG GCAGTCGTCC TTGCCCTCAC CCGTCCCCCG ACACCTCCGG TACTGCTCGC GCTACCACTG ATGCTGCTGG TCGCGTTCAC CCTCTGGCGG GGCGCCGGGC AGTCGATCCG GCTGGCCCGG GTCACCCGCC GGTTCCCGCT GATCGATCTC TCGCGGTTGG CCCGTCCGGT GTGCGCCGTC CCGGCCGGAA CCCCTCTCGC CGAGGCGCAG CGCCGCGCTG CCGGGACCGA CCCTCCGGCC GCGCTGCTGG TCACCGACTC CGCGGGTGGC CCGCACGCCC TGGTCAATCC GGTCGAGGTG GCGGCGGTAG CGGTGGACCG TCGACCCTGG GTGCCGGTGG ACGCGGTGTC CCGGCCACTG GCCGAGGTGC CGGCCGTGTC GGTCGGCCTC GACGGCGAGC AGGTGATGGA GACGGTGCGG CGCCACCCGG GCGCACAGTA CGTGGTGACC TCAGGCGAAG ATGTCGTCGG CATCCTGTAC CTCGCGGATC TGGCTCAGCT ACTCGAACCT CACCGGAAGA TGAACACGTG A
|
Protein sequence | MESKRQAPRP PTRHSGVTVG RVVGVPLRLD WSMLLLALAV AVMYAEFARH QLALSPAGGY VIGLGFVVSL LGSVLLHELG HALTARRYGI GVRGITLELL GGYTEMDRDA PTPRVDLLVS LAGPAVSAVL GGAAVAVTMA LPDRTVGHQL AFQLAVSNVV VAAFNVLPGL PLDGGRALRA ALWAATRDRH RATEVAGWVG RVVAIGTVGA AVVLALTRPP TPPVLLALPL MLLVAFTLWR GAGQSIRLAR VTRRFPLIDL SRLARPVCAV PAGTPLAEAQ RRAAGTDPPA ALLVTDSAGG PHALVNPVEV AAVAVDRRPW VPVDAVSRPL AEVPAVSVGL DGEQVMETVR RHPGAQYVVT SGEDVVGILY LADLAQLLEP HRKMNT
|
| |