Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0035 |
Symbol | |
ID | 5105174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 34829 |
End bp | 36007 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640505929 |
Product | DNA-directed RNA polymerase subunit A'' |
Protein accession | YP_001190136 |
Protein GI | 146302820 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02389] DNA-directed RNA polymerase, subunit A'' |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000531723 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.113614 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAAATA ACACAGATAA TGAATATCTG GAACAGAAGT TATCCGAATT AAGTAAAAAA GTTCCTGCCT CAATTATATC AAAGTTAAGG GAATCAATAA CTAATTCCCC TATAGAGATT ACAAGGGATG AAATAGACAA AATAATAGAA ATAGTAATGA AAGACTATCT AAGTTCCCTT GTCCATCCTG GAGAGGCAAT AGGTGTTGTG GCTGCCCAAT CAATAGGCGA GCCGGGTACG CAGATGACAT TGAGGACCTT CCACTTTGCG GGTGTAAGGG AGTTAAACGT AACCCTAGGT CTTCCGAGGC TCATAGAGAT TGTAGATGCG AGGAAAGTTC CTTCAACTCC TATGATGACC ATTTATCTCA ACGAGGAGTA TGCAAAGGAC CGTGATATGG CATTAGAGGT TGCTAGGAGA ATCGAATATA CTAGAGTGGA GCATGTGGTG GAGACGGTTA ACCTGGACGT TGGTGGTATG GGCATTATAC TCAAACTTGA TCCAGTACTG CTTAAAGATA AGGGATTATC CACAGAGGAT GTGGAAAAGG TCATCAAGAA GCTCAAGATG GGCGATTATA GGGTAGAAAA CTCCGATGAA TATACAATAG CGATATACTT TGAAAACATG GAGACAGTGA CCGGATTATT CAAGGCTAGA GAGAAAATCC TTTCAACGAA AATAAAAGGG GTTAAGGGAA TTAAGAGGGC GATAATCAGG AAAAAGGGTG ACGAATACGT TATTATTACC GATGGGTCTA ACCTTGAGGG TGTTCTTGGA GTTAAGGGAG TTGATGTCTC GAGGATAGAG ACAAATAATC TACACGAGGT AGAAAGCGTG CTGGGTGTGG AGGCAGCAAG GGAGTTAATA ACGAGAGAGA TAAAGAGGGT CCTCGAGGAA CAGGGTCTGG ACGTCGATAT TAGACATATC GAACTAGTTT CCGATATAAT GACAAGGACA GGGGAAGTTA GGCAGATAGG GAGGCATGGT GTCACAGGTG AAAAGACTAG CGTATTGGCA AGAGCCGCCT TCGAGGTTAC AGTAAAACAC CTTCTCGATG CCGCTGCTAG AGGTGACATG GAAGAGTTTA AAGGAGTGGT AGAAAACATT ATTATAGGCC AACCAATTAA GCTGGGTACC GGAATGGTTG AACTTTTAAT GAGACCCGCA AATAGGTGA
|
Protein sequence | MINNTDNEYL EQKLSELSKK VPASIISKLR ESITNSPIEI TRDEIDKIIE IVMKDYLSSL VHPGEAIGVV AAQSIGEPGT QMTLRTFHFA GVRELNVTLG LPRLIEIVDA RKVPSTPMMT IYLNEEYAKD RDMALEVARR IEYTRVEHVV ETVNLDVGGM GIILKLDPVL LKDKGLSTED VEKVIKKLKM GDYRVENSDE YTIAIYFENM ETVTGLFKAR EKILSTKIKG VKGIKRAIIR KKGDEYVIIT DGSNLEGVLG VKGVDVSRIE TNNLHEVESV LGVEAARELI TREIKRVLEE QGLDVDIRHI ELVSDIMTRT GEVRQIGRHG VTGEKTSVLA RAAFEVTVKH LLDAAARGDM EEFKGVVENI IIGQPIKLGT GMVELLMRPA NR
|
| |