Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1424 |
Symbol | |
ID | 5704813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1647859 |
End bp | 1648845 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641270934 |
Product | RNA polymerase sigma factor SigB |
Protein accession | YP_001536315 |
Protein GI | 159037062 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0865315 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000203032 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATGGACC ACAGGATGCG CGCCACAAGC GGCACCGACA GCCTGACCGA TCTGGATGCC ACTGACGAGC GCGGTGTATC CACTGATCTG GTTCGGGCCT ACCTCTACGG CATCGGCAAG ACGAAGCTGC TGACCGCCGC TCAGGAGGTG GAGCTGTCCC GCCGAATCGA GGCCGGGCTC TTCGCCGAGG CGAAGTTGGC CGCCTGCACG CCGGTCTCCG CCACGCTCCG GGCCGACCTG GAACTCGTCG CCGTCGAGGG GCGCGCCGCC AAGGACCACC TGTTGGAGGC GAACCTCCGC CTGGTGGTCA GCATCGCCAA GCGGTACACC GGCCGTGGGA TGGCCTTCCT CGACCTGATC CAGGAAGGCA ACCTCGGCCT GATCCGCGCG GTCGAGAAGT TCGACTACAC CAAGGGCTAC AAGTTCTCCA CCTACGCCAC CTGGTGGATC CGCCAGGCCA TCACCCGCGC CATGGCCGAC CAGTCCCGCA CCATCCGCAT TCCGGTACAC ATGGTCGAGC AGGTCAACCG GATGGTACGG ACGCGGCGTG ACCTGTCGGT CTCGCTTGGT CGGGAGCCCA CGGTCACGGA GGTGGCCCGC GCGTTGGACG TCCCGGAAGT CCAGATCATC GAGCTGATCT CGTACGACCG GGAGCCGGTG AGCCTGGACC AGGCCGTCGG CGAGGACGGC GAGAGCCCAC TCGGCGACTT CGTCGCGGTG GTGAACGCGA CGGCCGCGCC CGACAACACC GCCGAGCGAG GCGAGCTGCG TCAGGAGGTA CGGGGTGTGC TCGCCACCCT GTCCCAGCGG GAACAGGCGG TGATCCGGCT CCGGTTCGGG CTGGACGACG GGCGACAGCG CACCCTGGAC GAGGTCGGTC GGGAATTCGG CCTCTCCCGG GAGCGGATCC GCCAGATCGA GAAGGGGACA CTGCGCAAGC TACGCGCCCC GGAGCGGGCG CAGCGGCTGG CGGCGTACGC CTGCTGA
|
Protein sequence | MMDHRMRATS GTDSLTDLDA TDERGVSTDL VRAYLYGIGK TKLLTAAQEV ELSRRIEAGL FAEAKLAACT PVSATLRADL ELVAVEGRAA KDHLLEANLR LVVSIAKRYT GRGMAFLDLI QEGNLGLIRA VEKFDYTKGY KFSTYATWWI RQAITRAMAD QSRTIRIPVH MVEQVNRMVR TRRDLSVSLG REPTVTEVAR ALDVPEVQII ELISYDREPV SLDQAVGEDG ESPLGDFVAV VNATAAPDNT AERGELRQEV RGVLATLSQR EQAVIRLRFG LDDGRQRTLD EVGREFGLSR ERIRQIEKGT LRKLRAPERA QRLAAYAC
|
| |