Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0509 |
Symbol | |
ID | 5705527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 578448 |
End bp | 579518 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641270035 |
Product | hypothetical protein |
Protein accession | YP_001535429 |
Protein GI | 159036176 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.259003 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00742101 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCCCAG TACGCTTCGT CGCCCTCTCC GAGGACGGCC AGGCACTGGT ACTCACCGAC GAGGTTGGGC GACTTCTCGC GCTACCCATC GACGAGCGCG TCTCGACCGC CCTGCACACC GAGCCCGGGG CCGCGCCTCT GGCCGTGGCC TCGACGTCGG GCGCCGACCC GACCCCGTCC CTGTCCCCGC GAGACATCCA GGCCCGGATC CGCGCCGGCG AGTCCGCCGA GGATGTCGCC CGGATCGCCG GCGTGCCGGT GGACCGCGTG CTGCGCTACG CCGGCCCGGT TCTCCAGGAG CGAGCCATGC TCGCCCAGCA CGCCCGTCGC ACCCGCCTGC GTGGAGCGGA GAAGCCGACC CCGCTCGCCG AGGTGGTCAA CGGTCGACTG GCCCAACACG GCATCGACAC GGAAAAGATC TCGTGGGATG CGTGGCGCCG TGACGACGGT GCCTGGCGGA TCGTCGCCAC CTGGCCCTCC GGCAAGGCCA CCGCCCAAGC AGTCTGGGAT CTGGAGAAGA CCCGGCAGTC GGTCACGCCG CACGACGACA TGGCCCAGTA CCTCTGCGCC GAGCGGCCCA CGCCGATCCT CGGCCAGGAG CCGGCGCCCG AGCGGGGCGG CCACGGGCTG CCCGGCCCGG CGCGGGCCGA ACCCGGTCGC GGTGGGCACG GCCTACCGAG CCCGGCCGAG CCCAACCGGC CGAGCCGTGA TCCGATCCGC GCCGGTCGGG ACGCGCTGCT CGCCTCCCTG GATCGCCCAC TCGGCGGTGC CTCCGGCCGT GGCCTCGAGC CACGGACTCC GGCCAGCCCG GAGGCACCGC GTTCGCGACC AGTCGGCGGC GGCGCGGCGG CGCTGCTCGG CGGCGGCCCG GGATCAGCCT TCGACGACGA CTCGGACGCG CCGAAGGAGG TGCCGGCCGT CCCGTCACTG GCCGTGCTCC GACCACGCCG CACGGGTACC GCCACGGCGG GTGGCACCGA GCAGGGCGAG GGCAGCAAGC CACGCAAGCG GCTACCAAGC TGGGACGACG TGCTCTTCGG GAGCGCGCCG GCGGCCCGCG AGTCCTCCTA G
|
Protein sequence | MRPVRFVALS EDGQALVLTD EVGRLLALPI DERVSTALHT EPGAAPLAVA STSGADPTPS LSPRDIQARI RAGESAEDVA RIAGVPVDRV LRYAGPVLQE RAMLAQHARR TRLRGAEKPT PLAEVVNGRL AQHGIDTEKI SWDAWRRDDG AWRIVATWPS GKATAQAVWD LEKTRQSVTP HDDMAQYLCA ERPTPILGQE PAPERGGHGL PGPARAEPGR GGHGLPSPAE PNRPSRDPIR AGRDALLASL DRPLGGASGR GLEPRTPASP EAPRSRPVGG GAAALLGGGP GSAFDDDSDA PKEVPAVPSL AVLRPRRTGT ATAGGTEQGE GSKPRKRLPS WDDVLFGSAP AARESS
|
| |