Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4286 |
Symbol | |
ID | 5706998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4861451 |
End bp | 4862473 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641273705 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001539058 |
Protein GI | 159039805 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.665586 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00623467 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCATCT CCCAGCGACC GTCCCTCTCC GAGGAGTCGA TCAACGAGAC CCGGTCCCGG TTCACCATCG AACCGCTGGA GCCCGGCTTC GGCTACACCC TGGGCAACTC GCTGCGCCGG ACGCTGCTGT CGTCCATTCC CGGCGCGGCG GTGACCTCGA TCAAGATCGA TGGTGTGCTG CACGAGTTCA CCACGATCCC GGGGGTCAAG GAGGATGTGG TCGAACTCGT CATGAACATC AAGGAGCTGT GCGTCAGCTC CGAGCATGAC GAGCCGGTCA GCATGTACCT GCGCAAGCAG GGCCCGGGTG ACGTGACCGC GGGTGACATC CAGCCCCCGG CCGGTGTCTC GGTACACAAC CCGGACCTGA AGCTCGCCAC CCTGAACGGT AAGGGCCGGC TCGACATGGA GCTGACCGTC GAGCGGGGCC GTGGTTACGT CACCGCGGCG CAGAACAAGC AGGCCGGCGC CGAGATCGGT CGGATCCCGG TCGACTCGAT CTACTCACCG GTACTGCGGG TCACCTACCG GGTCGAGGCG ACCCGAGTCG AGCAGCGGAC CGACTTCGAT CGGCTGATCA TCGACGTCGA GTCCAAGCCG TCGATGGGGC CACGTACGGC CCTGGCCTCG GCCGGCTCCA CGCTGGTCGA ACTCTTCGGC CTGGCCCGCG AGCTGGACGA GACCGCGGAG GGTATCGACA TCGGGCCGTC CCCGCAGGAC GCCCAGCTGG CGGCGGACCT GGCGCTGCCG ATCGAGGAGC TGGACCTCAC CGTCCGCTCC TACAACTGCC TCAAGCGCGA GGGCATCAAC TCCGTTGGTG AGCTCATCGG GCGTACCGAG GCTGACCTCC TCGACATCCG TAACTTCGGT CAGAAGTCGA TCGACGAGGT CAAGATGAAG CTCGCCGGGA TGGGACTGGG GCTGAAGGAC TCGGCCCCGA ACTTCGACCC GGCGAACGTC GTGGACGCCT TCGGTGAGGC TGACTACGAC ACCGAGGACT ACCGCGAGAC TGAGCAGCTG TAA
|
Protein sequence | MLISQRPSLS EESINETRSR FTIEPLEPGF GYTLGNSLRR TLLSSIPGAA VTSIKIDGVL HEFTTIPGVK EDVVELVMNI KELCVSSEHD EPVSMYLRKQ GPGDVTAGDI QPPAGVSVHN PDLKLATLNG KGRLDMELTV ERGRGYVTAA QNKQAGAEIG RIPVDSIYSP VLRVTYRVEA TRVEQRTDFD RLIIDVESKP SMGPRTALAS AGSTLVELFG LARELDETAE GIDIGPSPQD AQLAADLALP IEELDLTVRS YNCLKREGIN SVGELIGRTE ADLLDIRNFG QKSIDEVKMK LAGMGLGLKD SAPNFDPANV VDAFGEADYD TEDYRETEQL
|
| |