Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4650 |
Symbol | |
ID | 5705706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5268851 |
End bp | 5270080 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274050 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_001539397 |
Protein GI | 159040144 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.267181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.578872 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGCGGG TCAGCGAGGC GGTCGCCACC GCCTACGCCG AGCACTGGGG TCGCATTGTC GCCCTGCTGA TCCGGCTTGC TGGGGACTGG GACCTGGCGG AGGAGTGCGC CCAGGACGCC TTTGCCGAGG CGCTCACGCG CTGGCCGACG GAGGGGATAC CGAACCGCCC TGGGGGATGG CTCACCACCA CGGCACGTAA TCGTGCGGTG GACCGGCTCC GTCGGTCCAC CGTGGAAGCC AGAAAGCTGC GCGACGTGTC CCGGCTGGGG CAGCCGGCCC CGGTCGGCAA CTTTCCCGAC GAACGCCTCG AGCTGATGTT CACCTGTGCG CACCCGGCGC TCACCAGCGA GGCGCAGGTC GCGCTCATTC TGCGCTGTCT GGTCGGGCTC CGGACCGCGG AGATCGCTCG GGCGTTCCTG GTGTCGGAAC ACACCATGGG GCAACGGCTG TTCCGCGCGA AGAACAAGAT CCGCCATGCC GGAATCCCGT TTCGGGTGCC GCCGGCCCAG CTGCTGCCGG AGCGACTGTC GGCCGTACTC GCGGTGCTCT ACCTGCTGTT CAACGAGGGC TACGCGGCGA CCGCCGGTAC GAACCTCGTC AAGGCCAGCC TCTCCGGAGA GGCGATCCGG CTGGCCCGGC TCCTCACCAC CCTGATGCCG GCTGAGCCCG AAGCGCGCGG ACTACTTGCG CTCATGCTGC TGCACGACGC CCGCCGTGCG TCTCGCGTAG ATGAGCACGG TGACCTCGTC ACCCTCGCCG ACCAGGACCG TTCGGCCTGG AACCACACCC AGATCGCCGA AGCGGTCGCA CTGCTGGAAC AGGCGCTGGC CCAGCGCCGC CCCGGCGCCT ACCAGGTGCA GGCGGCGATC GCCGCGGTCC ACGCCGAGGC GTCCGAGGCG GCGACGACGG ACTGGCCGCA GATCGTCGGC CTGTACGCGC AACTCATCCG CCTGGCACCC AGCCCGGTCG TCGAGCTCAA CCGGGCGGTG GCCGTGGCGA TGACCGACGG GCCCGAAGCC GGACTGGCGT TGGTGGATCG CCTGGCCGCC GCCGGTACGC TCAACGACTA CTACCTGCTG CCGGCGACCC GGGCCGACCT GCTGCGCCGC CTGGGAAAGC ACTCCGAGGC GACGGTCGCC TACCGCCGGG CACTCGATCT GTGCGCTACC GACGCCGAGC GCCGGTACCT GTGCCGGCGC CTGCGCGAGG TGTCGGCACG CCTCTCGTAG
|
Protein sequence | MGRVSEAVAT AYAEHWGRIV ALLIRLAGDW DLAEECAQDA FAEALTRWPT EGIPNRPGGW LTTTARNRAV DRLRRSTVEA RKLRDVSRLG QPAPVGNFPD ERLELMFTCA HPALTSEAQV ALILRCLVGL RTAEIARAFL VSEHTMGQRL FRAKNKIRHA GIPFRVPPAQ LLPERLSAVL AVLYLLFNEG YAATAGTNLV KASLSGEAIR LARLLTTLMP AEPEARGLLA LMLLHDARRA SRVDEHGDLV TLADQDRSAW NHTQIAEAVA LLEQALAQRR PGAYQVQAAI AAVHAEASEA ATTDWPQIVG LYAQLIRLAP SPVVELNRAV AVAMTDGPEA GLALVDRLAA AGTLNDYYLL PATRADLLRR LGKHSEATVA YRRALDLCAT DAERRYLCRR LREVSARLS
|
| |