Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0956 |
Symbol | |
ID | 5704492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1078024 |
End bp | 1079124 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641270471 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_001535859 |
Protein GI | 159036606 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.112676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0742081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGCCG CGACGGTGCG CGTCACCGGC GACCTCGACC TCGCCGAGGA GTGCGTGCAG GACGCGTACG TGATCGCGCT CGACGCCTGG CTGCACGACG GCGTCCCGGA CAACCCCGGC GCTTGGCTGA CGACGACCGC ACGGCGGCGG GCCCTCGACG GCCGCCGCCG CGAGCGCACC CTGCGCGCCA AGCTGCCGCT GCTGGTCGAA CCCGAGGAGT CCACGGTGGA GGACATCACC GACGACCGGC TGCGCCTGCT CTTCACCTGC TGCCACCCGG CCCTCACCCG GGAGGCACAG GTCGCGCTCA CGCTCCGGCT GGTCTGCGGG CTCACCACAG CCGAGATTGC TCATGCGTTT TTGATCTCCG AGGCGACGAT GGCGGCCCGG CTCACCCGGG CCAAGAAGAG GATCGCCGCG GCCCGGATCG CCTACCGCGC GCCGGCTCCC GAGGAGCTGC CGGACCGGCT GGACGCGGTA CTGACCGTGG TGCACCTGCT CTACACCACC GGGCACACCG CCCCGGCCGG GGACCGGCTG GTGCGGGTGG ACCTGGTGGA GAAGGTGTTC GACCTGGCCC GGATGCTGCG GATGCTCATG CCCGATGAGC GGGAGGTACG CGGGCTGCTG GCCCTGCTGC TGCTCACCGA CGCCCGCCGG GCGACCCGAA CGGCTACCGA TGGGCGGCTG CTTCTCCTCG CCGAGCAAGA CCGCGGCCGG TGGGACCGGG CATTGATCGC CGAGGGCGCG GCGCTGGTCC CGGGCGCGCT GCGCGGCGGG GCCGGCCGTT TCGCGCTGCA GGCGGCCATC GCGGCGCTGC ACGCCGAGGC ACCCACCTAC GAAGACACCG ACTGGCGCCA GATCGTCGGC CTGTACGACG TGCTGCTGAC GGTCTGGACG TCACCGGTGG TGGCCCTGAA CCGGGCGGTC GCTGTGTCCA TGGCGGACGG ACCGACCGCC GCCCTGGCAA CCATCGAGGC GCTGGACGCC GACGGCCGGC TCGCCGGTTA CCGGTACCTG CCGGCGACTC GGGCTGACCT GCTGCGGCGG CTGGGCCGGC ACACCGAGGC GGCGGCAGGT ACCGGCAGGC GCTGGAGCTG A
|
Protein sequence | MLAATVRVTG DLDLAEECVQ DAYVIALDAW LHDGVPDNPG AWLTTTARRR ALDGRRRERT LRAKLPLLVE PEESTVEDIT DDRLRLLFTC CHPALTREAQ VALTLRLVCG LTTAEIAHAF LISEATMAAR LTRAKKRIAA ARIAYRAPAP EELPDRLDAV LTVVHLLYTT GHTAPAGDRL VRVDLVEKVF DLARMLRMLM PDEREVRGLL ALLLLTDARR ATRTATDGRL LLLAEQDRGR WDRALIAEGA ALVPGALRGG AGRFALQAAI AALHAEAPTY EDTDWRQIVG LYDVLLTVWT SPVVALNRAV AVSMADGPTA ALATIEALDA DGRLAGYRYL PATRADLLRR LGRHTEAAAG TGRRWS
|
| |