Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4672 |
Symbol | |
ID | 5704851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5292116 |
End bp | 5293492 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274070 |
Product | SCP-like extracellular |
Protein accession | YP_001539416 |
Protein GI | 159040163 |
COG category | [S] Function unknown |
COG ID | [COG2340] Uncharacterized protein with SCP/PR1 domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.988125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.181044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTATGGCT GGAGCGAATC GATGGAGCCG GATGACAACC GCCGGCGATC CGAACCAACG GCCGACGGCT GGGACCGGCC CGCGGACGGC TGGGACGGCC CTACCGACGG CTGGGATCGG CCTGCCGACG GCTGGGACCG GCGGCAGCAG AACGCGCACT GGCGGGCCGA GTCCGAGACC CCGCAGGGCT GGGGAACCGC GGGCGGCTCG TACCACGACC AGCCCTCGCA AAGTGGGGAG TCCGTCCCCA CCGGGCAGCA GCGTGGGTCC ACCGACGGCT GGCAGCAGGA ACTATCGGGC GGCTGGCGCC GCCGGCAGCC CACGGACTGG CGGAACGAGT CCTCCGGAGG CTGGCGTCAC GCGGCTGGTG CCACCGGCGG ACCCGAGGTC ACCGGCGGTT GGCGGCCCGA CGAGACGACC GCCAGGCCGA CCAGCGGGCA CGCAGACGAC CTGACGAGCA CGCGGCCGAT CGCTGGCGCC GCGCCGGCCG ACGCTACACC CGCCAGCAAG GGCCATCGGG CCGGCCACCG GAACCGGCGG CCCGTGCTCA TCGGGGCCGC GGCGGCTGCC ACGCTGGTCG TGAGCCTCGG AGTCGGCACC GCCGCACTCT CCGACGGTGG CGACACGATC CCCACCTCGG CCCACAAGGA CATGGCGGCC ACGAGTCCGA CGTCGTACGA GAGGGGCATG ACGTCGTCCT CGACCAACAC CGCGTGGGGC AACGGGTCAC GGTCGCGGTC AGATTCCGCG CACCGAAAGT CAGCCTCGAA GGCGTCGCGC CACTCCGCCA AGTCCCGCTC AGACCAGGAA CGCAAGAGGG CCAGGCACCG CCCCAAGCCG GCACACACCA CCACCGCGCC CAGTCCCACT CAGGTCCCCA CGACCGCTCC TAGTCCCACT CAGGTCCCCA CGACCGCACC CAGTCCCACT CAGGTCCCCA CGACCGCACC CAGCCCCACT CAGGTCCCCA CCACCGTGCC CCCCAACGGT GACGTGAGCA CCGAGGCCAG CGAAGTGGTC CGGCTGGTTA ACGCCGAACG TGCGAAGGCC GGCTGCGAGG CGCTGAGTAT CAACGAGAAG CTGATGACCG CTGCCCAGCA ACACAGCCAG GACCAGGCCG ACCACCAGAA GATGTCACAC ACGGGCAGCA ACGGCAGTAG CCCCGGCGAC CGGATCAACG CTGTCGGCTA CGAGTGGCGC GCCTACGGGG AGAACGTCGC CTGGAACCAG CAGTCACCCG AGGCCGTGAT GGACGCGTGG ATGAATAGCT CCGGCCACCG GGCGAACATC CTGAACTGCT CCTTCACCGA GATCGGAGTC GGTGTCGCGA GCAGCAACGG ACCGTACTGG ACACAGGTCT TCGCCGCGCC TCGCTGA
|
Protein sequence | MYGWSESMEP DDNRRRSEPT ADGWDRPADG WDGPTDGWDR PADGWDRRQQ NAHWRAESET PQGWGTAGGS YHDQPSQSGE SVPTGQQRGS TDGWQQELSG GWRRRQPTDW RNESSGGWRH AAGATGGPEV TGGWRPDETT ARPTSGHADD LTSTRPIAGA APADATPASK GHRAGHRNRR PVLIGAAAAA TLVVSLGVGT AALSDGGDTI PTSAHKDMAA TSPTSYERGM TSSSTNTAWG NGSRSRSDSA HRKSASKASR HSAKSRSDQE RKRARHRPKP AHTTTAPSPT QVPTTAPSPT QVPTTAPSPT QVPTTAPSPT QVPTTVPPNG DVSTEASEVV RLVNAERAKA GCEALSINEK LMTAAQQHSQ DQADHQKMSH TGSNGSSPGD RINAVGYEWR AYGENVAWNQ QSPEAVMDAW MNSSGHRANI LNCSFTEIGV GVASSNGPYW TQVFAAPR
|
| |