Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2501 |
Symbol | |
ID | 5703951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2858664 |
End bp | 2859779 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641271965 |
Product | hypothetical protein |
Protein accession | YP_001537335 |
Protein GI | 159038082 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.942803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0300138 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCATCA ATCGCCGTAC CCTGCTCGGT CGGGTCGCCG TGGTCGGCAC GGGTATTGCG GCCGGTGGGC TGCTTGCTCC CGACGCGGCC CGAGCCGCGT TCTGGAAGAA GCGCCTCACC GGCGCCGACC TGGACACCAA CCGCCGGTGG CAGATCGCCG GGACCGACCT CGGCATCCCC TACGTACTGG AGAACGGCTC CATCGGGTAC CTCCTCGGCG ACACCTTCAA CACCCCGTGG CCTGAGGGCC CGCCGCTGCC CAACGACTGG CGCTCACCGG TGATGCTGCG CTCCCACGCC CACCCTGGCG CCGCCGACGG TGTGGTCTTC GACAACGCCG CCGGGGTGCT CGGCGACGGG CGGGCGCCGG AGCTGATGCA CAACGGCCAC CGGGGCATCG GCATCGACGG CCTCTGGGAG GTGACCGTCA TCCCCAACGA CGGCATCAGC TTCCCGGAGA CCGGCCGGCA GGTGATCTCG TACATGAGCA TCGAGTACTG GACACCGCCC GGGCAACCCG GAGCCCGCTG GCGGTCGCGC TACGCCGGGC TGGCCTTCAG CGACAACGGT AACGACTTCA CTCGCACGTC GCTGACGTGG TGGAACGACA GCACCAACAC CGACCCGTTC CAGATGTGGA CGATGCAGCG TGACGGCGAC TGGGTGTACG TCTTCTCGGT GCGCCCGGGG CGCCAGGACG GTCCGATGAT GCTGCGTCGG GTCTTCTGGG ATCGGATGTT CTATCCCGAG TCGTACGAGG GCTGGGGCTG GAACGGCAGC ACCTGGGGCT GGGGCCGGCC GTGCACGCCG ATCCTGACCG GCTCGTTCGG GGAGCCCTCG GTCCGGCGGC TCGCGGACGG CACCTGGGTG ATGTCCTACC TCAACTGCGT CACCGGGTGC GTCGTCACCC GCACCGCCGG CGGGCCGGAC CAGGCCTGGA CGGCGGAAAA GGTGCAGATC ACGCCGTGGC AGGAGCCGGG GCTCTACGGC GGGTTCATCC ACCCGTGGTC CAGCCGGCAG GTCAACGACC TGCATCTGAT GGTCTCGACG TGGACCACGA CACCCGATAA CCGAAGCACC GCCTACCACG TCAGCCAGTT CGTCGGCACT GCCTGA
|
Protein sequence | MAINRRTLLG RVAVVGTGIA AGGLLAPDAA RAAFWKKRLT GADLDTNRRW QIAGTDLGIP YVLENGSIGY LLGDTFNTPW PEGPPLPNDW RSPVMLRSHA HPGAADGVVF DNAAGVLGDG RAPELMHNGH RGIGIDGLWE VTVIPNDGIS FPETGRQVIS YMSIEYWTPP GQPGARWRSR YAGLAFSDNG NDFTRTSLTW WNDSTNTDPF QMWTMQRDGD WVYVFSVRPG RQDGPMMLRR VFWDRMFYPE SYEGWGWNGS TWGWGRPCTP ILTGSFGEPS VRRLADGTWV MSYLNCVTGC VVTRTAGGPD QAWTAEKVQI TPWQEPGLYG GFIHPWSSRQ VNDLHLMVST WTTTPDNRST AYHVSQFVGT A
|
| |