Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0033 |
Symbol | |
ID | 5105172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 28796 |
End bp | 32170 |
Gene Length | 3375 bp |
Protein Length | 1124 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640505927 |
Product | DNA-directed RNA polymerase subunit B |
Protein accession | YP_001190134 |
Protein GI | 146302818 |
COG category | [K] Transcription |
COG ID | [COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit |
TIGRFAM ID | [TIGR03670] DNA-directed RNA polymerase subunit B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000339757 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0488457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTCAG TTGACGATAG ATGGGTAATA GTAGAGGCAT ACTTTAAGTC CAGAGGGTTA GTAAGACAGC ATTTAGACTC ATTTAATGAT TTTATAAAGA ATAAGCTTCA GGAAATTATA GATGAGCAGG GAGAAATAAT CACCGAGATA CCTGGTCTAA AGATTAGACT AGGTAAGATA AGGGTAGGAA AACCTAGGGT TAGGGAAGCA GACAAGGGCG ACAAGGAGAT AACGCCCATG GAAGCCAGGC TGAGAAACCT GACCTACGCT GCCCCAATTT TCCTAACCAT GACACCTGTA GAGAACAACA TAGAGGGAGA TCCCACCGAG GTTCATATAG GCGATATTCC TATTATGCTC AAATCGATAG CTGATCCAAC TTCGGAGTTC AGCCAAGACA AACTAGTGGA AATAGGAGAG GATCCAAAGG ACCCAGGTGG ATACTTTATC ATTAATGGAA GTGAAAGAGT AATCGTAACT CAGGAAGACT TAGCAACAAA TAGAGTTCTG ACTGATGTGG GAAAGGCGGG CTCCAATGTT ACCCATACAG CTAAAATAAT TTCGAGCACC TCTGGGTATA GGGTTCCTAT CACCATAGAG AGATTGAAAG ATTCCACAAT CCACGTTTCA TTCCCTTCAG CGCCCGGTAG AATTCCATTC GCAATATTAA TGAGAGCCCT TGGGGTAGAG ACCGACGAAG ATATTACACT GGCAGTGTCC CTTGACCCAC AAATACAGAA CGAACTTTTA CCTTCTTTGG AACAGGCGAG CTCTATTTCT TCAAAGGAAG ACGCATTGGA CTTCATAGGC AACAGGGTAG CTATAGGTCA GAAAAGGGAA ATGAGGATAC AGAAAGCTGA ACAGATTTTA GATAAATATT TCTTACCACA TATAGGTACT AATCCATCAG ATAGAGTAGC CAAGGCTTAC TATCTGGCCT TCGCAGTCTC AAAGCTGATC GAGCTTTATC TTGGCAGAAG GGAACCCGAC GACAAGGACC ATTATGCTAA CAAGAGGCTG AAATTGGCGG GAGACTTATT CGCTAGTCTC TTTAGAGTAG CGTTTAAGGC CTTTGTTAAG GACCTAGTGT TTCAGCTCGA GAAGTCAAAG GTTAGGGGAA GAAGGTTAGC CATTAATGCC CTTGTAAGGC CTGACATAAT TTCGGAGAGA ATTAGGCATG CACTTGCCAC AGGAAATTGG GTTGGAGGAA GAACAGGCGT AAGCCAGCTT TTAGATCGTA CCAATTGGTT ATCAATGTTA AGCCATCTAA GGAGGGTTGT ATCTTCCCTA GCCAGGGGTC AGCCTAACTT TGAGGCCAGG GATCTACACG GAACCCAATG GGGCAGGATG TGCCCATTCG AGACACCAGA GGGTCCTAAC AGCGGTCTTG TGAAGAACAT AGCTCTCCTG GCCCAGATAT CAGTTGGAAT AAATGAGAAG AGCCTTGAGA GAACTCTTTA CAATTTGGGT GTAGTCCCTA TAGACGACGC CATAAAGAAG GTTAAGTCTG AGGAAGGCCA ATCTGAGAAT TATCAGACCT GGAGCAAGGT AATTCTAAAC GGGAGACTAA TAGGCTATTA TCCTGATGGG CGTGAACTGG CAGAGAAAAT AAGAGAGAGC AGAAGAGAGG GTGAGTTAAG CGATGAGGTT AATGTAAGCT ATAACATTAC AGAAACCTTT AACGAGGTAT ACATTAATTC AGATAGTGGA AGAGTAAGAA GGCCCCTTAT CGTTGTTAAG CACGGTAAAC CACTTGTGAC TAAGGAAGAT ATAGAGAACC TGAAAAAGGG TAAGATAACC TTTGATACCT TAGTTAAAGA GGGGAAAATT GAATTCCTTG ATGCAGAGGA AGAAGAGAAT GCGTATATAG CTCTTGAGCC AAAGGATGTA ACCAAGGACC ATACCCACTT AGAGATATGG GCTCCAGCAA TTCTAGGCAT AACAGCTTCA ATTATTCCAT ACCCTGAGCA CAACCAGTCT CCTAGAAATA CCTATCAGTC GGCTATGGCA AAGCAGGCCT TAGGACTGTA TGCAGCCAAT TATCAAATAA GGACAGACAC TAGAGCTCAT CTCCTTCATT ATCCTCAGAA GCCAATAGTC CAAACTAGGG CTTTAGAGGC CATAGGATAT ACGGAGAGAC CTGCAGGAAA TAATGCGATA TTTGCCCTTA TGTCCTACAC AGGATACAAC ATGGAAGATG CCGTTATCAT GAATAAGTCG TCCGTGGATA GAGGGATGTT TAGGTCCACT TTCTTCAGGC TTTACTCCGC CGAGGAAATC AAGTATCCAG GAGGGCAAGA GGATGAAATC CTTCTTCCCG AGCCCGGCGT AAGGGGATAT AAGGGAAAGG ATTACTATAG GCTTCTGGAA TCTAACGGAA TAGTTTCTCC TGAGGTTGAC GTTAAGGGAG GAGATGTATT AATAGGGAAG GTAAGCCCAC CAAGGTTCCT TCAGGAATTC AAGGAATTGT CACCTGATCA GGCTAAGCGT GATACGTCGA TAATAACTAG GCACGGAGAA CGTGGTACTG TAGATCTAGT ATTGGTGACC GAGACCTCTG AGGGAAATAA GTTAGTAAAG GTGAGAGTTA GGGACCTGAG AATACCAGAA ATAGGAGACA AATTCGCTAC TAGACATGGG CAGAAGGGAG TAATAGGAAT GCTAATACCC CAAGTGGACA TGCCATATAC AACTAGTGGG CTAGTTCCAG ATATAATACT GAATCCTCAT GCGCTTCCCT CCAGAATGAC TGTAGGTCAG ATAATGGAAG CGATAGCAGG AAAGTTCGTA GCAGCCACTG GTAATCCTAT TGACGCAACA CCGTTCTATA ATACTCCCAT AGAGGAAATT CAGAGGAAGC TTCTGGAACA TGGTTATCTA AGCGACGGAT CAGAGGTTGT ATATGATGGT AGAACAGGAG AGAAATTGAA AGGAAGAATA TTGTTTGGTA TAGTTTACTA TCAAAAATTA CACCATATGG TAGCGGATAA GATGCATGGG CGTGGTAGAG GGCCAGTTCA AATACTCACT AGACAGCCCA CCGAAGGAAG AGCTAGGGAA GGAGGACTTA GATTTGGGGA AATGGAAAGG GACTGCCTTA TAGGATACGG TGCAGCGATG TTAATCAAGG ATAGATTACT AGATAACTCA GATAGGGCAA CAGTATACGT TTGTGAACAG TGCGGATATG TAGGCTGGTA TGATAGGACT AAGAACAAGT ATATATGCCC AATTCATGGT GATAAGACAA CTTTATACCC CGTGGTAATA TCATATGCCT TCAAGTTGTT GCTTCAGGAA CTTATGAGCA TGGTTATCGC ACCTAAGTTA GTTCTGGGAG ACAAAGTTCC TGTTGGAGGT AATTCCAATG AGTGA
|
Protein sequence | MLSVDDRWVI VEAYFKSRGL VRQHLDSFND FIKNKLQEII DEQGEIITEI PGLKIRLGKI RVGKPRVREA DKGDKEITPM EARLRNLTYA APIFLTMTPV ENNIEGDPTE VHIGDIPIML KSIADPTSEF SQDKLVEIGE DPKDPGGYFI INGSERVIVT QEDLATNRVL TDVGKAGSNV THTAKIISST SGYRVPITIE RLKDSTIHVS FPSAPGRIPF AILMRALGVE TDEDITLAVS LDPQIQNELL PSLEQASSIS SKEDALDFIG NRVAIGQKRE MRIQKAEQIL DKYFLPHIGT NPSDRVAKAY YLAFAVSKLI ELYLGRREPD DKDHYANKRL KLAGDLFASL FRVAFKAFVK DLVFQLEKSK VRGRRLAINA LVRPDIISER IRHALATGNW VGGRTGVSQL LDRTNWLSML SHLRRVVSSL ARGQPNFEAR DLHGTQWGRM CPFETPEGPN SGLVKNIALL AQISVGINEK SLERTLYNLG VVPIDDAIKK VKSEEGQSEN YQTWSKVILN GRLIGYYPDG RELAEKIRES RREGELSDEV NVSYNITETF NEVYINSDSG RVRRPLIVVK HGKPLVTKED IENLKKGKIT FDTLVKEGKI EFLDAEEEEN AYIALEPKDV TKDHTHLEIW APAILGITAS IIPYPEHNQS PRNTYQSAMA KQALGLYAAN YQIRTDTRAH LLHYPQKPIV QTRALEAIGY TERPAGNNAI FALMSYTGYN MEDAVIMNKS SVDRGMFRST FFRLYSAEEI KYPGGQEDEI LLPEPGVRGY KGKDYYRLLE SNGIVSPEVD VKGGDVLIGK VSPPRFLQEF KELSPDQAKR DTSIITRHGE RGTVDLVLVT ETSEGNKLVK VRVRDLRIPE IGDKFATRHG QKGVIGMLIP QVDMPYTTSG LVPDIILNPH ALPSRMTVGQ IMEAIAGKFV AATGNPIDAT PFYNTPIEEI QRKLLEHGYL SDGSEVVYDG RTGEKLKGRI LFGIVYYQKL HHMVADKMHG RGRGPVQILT RQPTEGRARE GGLRFGEMER DCLIGYGAAM LIKDRLLDNS DRATVYVCEQ CGYVGWYDRT KNKYICPIHG DKTTLYPVVI SYAFKLLLQE LMSMVIAPKL VLGDKVPVGG NSNE
|
| |