Gene Msed_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0033 
Symbol 
ID5105172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp28796 
End bp32170 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content44% 
IMG OID640505927 
ProductDNA-directed RNA polymerase subunit B 
Protein accessionYP_001190134 
Protein GI146302818 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR03670] DNA-directed RNA polymerase subunit B 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000339757 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0488457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCAG TTGACGATAG ATGGGTAATA GTAGAGGCAT ACTTTAAGTC CAGAGGGTTA 
GTAAGACAGC ATTTAGACTC ATTTAATGAT TTTATAAAGA ATAAGCTTCA GGAAATTATA
GATGAGCAGG GAGAAATAAT CACCGAGATA CCTGGTCTAA AGATTAGACT AGGTAAGATA
AGGGTAGGAA AACCTAGGGT TAGGGAAGCA GACAAGGGCG ACAAGGAGAT AACGCCCATG
GAAGCCAGGC TGAGAAACCT GACCTACGCT GCCCCAATTT TCCTAACCAT GACACCTGTA
GAGAACAACA TAGAGGGAGA TCCCACCGAG GTTCATATAG GCGATATTCC TATTATGCTC
AAATCGATAG CTGATCCAAC TTCGGAGTTC AGCCAAGACA AACTAGTGGA AATAGGAGAG
GATCCAAAGG ACCCAGGTGG ATACTTTATC ATTAATGGAA GTGAAAGAGT AATCGTAACT
CAGGAAGACT TAGCAACAAA TAGAGTTCTG ACTGATGTGG GAAAGGCGGG CTCCAATGTT
ACCCATACAG CTAAAATAAT TTCGAGCACC TCTGGGTATA GGGTTCCTAT CACCATAGAG
AGATTGAAAG ATTCCACAAT CCACGTTTCA TTCCCTTCAG CGCCCGGTAG AATTCCATTC
GCAATATTAA TGAGAGCCCT TGGGGTAGAG ACCGACGAAG ATATTACACT GGCAGTGTCC
CTTGACCCAC AAATACAGAA CGAACTTTTA CCTTCTTTGG AACAGGCGAG CTCTATTTCT
TCAAAGGAAG ACGCATTGGA CTTCATAGGC AACAGGGTAG CTATAGGTCA GAAAAGGGAA
ATGAGGATAC AGAAAGCTGA ACAGATTTTA GATAAATATT TCTTACCACA TATAGGTACT
AATCCATCAG ATAGAGTAGC CAAGGCTTAC TATCTGGCCT TCGCAGTCTC AAAGCTGATC
GAGCTTTATC TTGGCAGAAG GGAACCCGAC GACAAGGACC ATTATGCTAA CAAGAGGCTG
AAATTGGCGG GAGACTTATT CGCTAGTCTC TTTAGAGTAG CGTTTAAGGC CTTTGTTAAG
GACCTAGTGT TTCAGCTCGA GAAGTCAAAG GTTAGGGGAA GAAGGTTAGC CATTAATGCC
CTTGTAAGGC CTGACATAAT TTCGGAGAGA ATTAGGCATG CACTTGCCAC AGGAAATTGG
GTTGGAGGAA GAACAGGCGT AAGCCAGCTT TTAGATCGTA CCAATTGGTT ATCAATGTTA
AGCCATCTAA GGAGGGTTGT ATCTTCCCTA GCCAGGGGTC AGCCTAACTT TGAGGCCAGG
GATCTACACG GAACCCAATG GGGCAGGATG TGCCCATTCG AGACACCAGA GGGTCCTAAC
AGCGGTCTTG TGAAGAACAT AGCTCTCCTG GCCCAGATAT CAGTTGGAAT AAATGAGAAG
AGCCTTGAGA GAACTCTTTA CAATTTGGGT GTAGTCCCTA TAGACGACGC CATAAAGAAG
GTTAAGTCTG AGGAAGGCCA ATCTGAGAAT TATCAGACCT GGAGCAAGGT AATTCTAAAC
GGGAGACTAA TAGGCTATTA TCCTGATGGG CGTGAACTGG CAGAGAAAAT AAGAGAGAGC
AGAAGAGAGG GTGAGTTAAG CGATGAGGTT AATGTAAGCT ATAACATTAC AGAAACCTTT
AACGAGGTAT ACATTAATTC AGATAGTGGA AGAGTAAGAA GGCCCCTTAT CGTTGTTAAG
CACGGTAAAC CACTTGTGAC TAAGGAAGAT ATAGAGAACC TGAAAAAGGG TAAGATAACC
TTTGATACCT TAGTTAAAGA GGGGAAAATT GAATTCCTTG ATGCAGAGGA AGAAGAGAAT
GCGTATATAG CTCTTGAGCC AAAGGATGTA ACCAAGGACC ATACCCACTT AGAGATATGG
GCTCCAGCAA TTCTAGGCAT AACAGCTTCA ATTATTCCAT ACCCTGAGCA CAACCAGTCT
CCTAGAAATA CCTATCAGTC GGCTATGGCA AAGCAGGCCT TAGGACTGTA TGCAGCCAAT
TATCAAATAA GGACAGACAC TAGAGCTCAT CTCCTTCATT ATCCTCAGAA GCCAATAGTC
CAAACTAGGG CTTTAGAGGC CATAGGATAT ACGGAGAGAC CTGCAGGAAA TAATGCGATA
TTTGCCCTTA TGTCCTACAC AGGATACAAC ATGGAAGATG CCGTTATCAT GAATAAGTCG
TCCGTGGATA GAGGGATGTT TAGGTCCACT TTCTTCAGGC TTTACTCCGC CGAGGAAATC
AAGTATCCAG GAGGGCAAGA GGATGAAATC CTTCTTCCCG AGCCCGGCGT AAGGGGATAT
AAGGGAAAGG ATTACTATAG GCTTCTGGAA TCTAACGGAA TAGTTTCTCC TGAGGTTGAC
GTTAAGGGAG GAGATGTATT AATAGGGAAG GTAAGCCCAC CAAGGTTCCT TCAGGAATTC
AAGGAATTGT CACCTGATCA GGCTAAGCGT GATACGTCGA TAATAACTAG GCACGGAGAA
CGTGGTACTG TAGATCTAGT ATTGGTGACC GAGACCTCTG AGGGAAATAA GTTAGTAAAG
GTGAGAGTTA GGGACCTGAG AATACCAGAA ATAGGAGACA AATTCGCTAC TAGACATGGG
CAGAAGGGAG TAATAGGAAT GCTAATACCC CAAGTGGACA TGCCATATAC AACTAGTGGG
CTAGTTCCAG ATATAATACT GAATCCTCAT GCGCTTCCCT CCAGAATGAC TGTAGGTCAG
ATAATGGAAG CGATAGCAGG AAAGTTCGTA GCAGCCACTG GTAATCCTAT TGACGCAACA
CCGTTCTATA ATACTCCCAT AGAGGAAATT CAGAGGAAGC TTCTGGAACA TGGTTATCTA
AGCGACGGAT CAGAGGTTGT ATATGATGGT AGAACAGGAG AGAAATTGAA AGGAAGAATA
TTGTTTGGTA TAGTTTACTA TCAAAAATTA CACCATATGG TAGCGGATAA GATGCATGGG
CGTGGTAGAG GGCCAGTTCA AATACTCACT AGACAGCCCA CCGAAGGAAG AGCTAGGGAA
GGAGGACTTA GATTTGGGGA AATGGAAAGG GACTGCCTTA TAGGATACGG TGCAGCGATG
TTAATCAAGG ATAGATTACT AGATAACTCA GATAGGGCAA CAGTATACGT TTGTGAACAG
TGCGGATATG TAGGCTGGTA TGATAGGACT AAGAACAAGT ATATATGCCC AATTCATGGT
GATAAGACAA CTTTATACCC CGTGGTAATA TCATATGCCT TCAAGTTGTT GCTTCAGGAA
CTTATGAGCA TGGTTATCGC ACCTAAGTTA GTTCTGGGAG ACAAAGTTCC TGTTGGAGGT
AATTCCAATG AGTGA
 
Protein sequence
MLSVDDRWVI VEAYFKSRGL VRQHLDSFND FIKNKLQEII DEQGEIITEI PGLKIRLGKI 
RVGKPRVREA DKGDKEITPM EARLRNLTYA APIFLTMTPV ENNIEGDPTE VHIGDIPIML
KSIADPTSEF SQDKLVEIGE DPKDPGGYFI INGSERVIVT QEDLATNRVL TDVGKAGSNV
THTAKIISST SGYRVPITIE RLKDSTIHVS FPSAPGRIPF AILMRALGVE TDEDITLAVS
LDPQIQNELL PSLEQASSIS SKEDALDFIG NRVAIGQKRE MRIQKAEQIL DKYFLPHIGT
NPSDRVAKAY YLAFAVSKLI ELYLGRREPD DKDHYANKRL KLAGDLFASL FRVAFKAFVK
DLVFQLEKSK VRGRRLAINA LVRPDIISER IRHALATGNW VGGRTGVSQL LDRTNWLSML
SHLRRVVSSL ARGQPNFEAR DLHGTQWGRM CPFETPEGPN SGLVKNIALL AQISVGINEK
SLERTLYNLG VVPIDDAIKK VKSEEGQSEN YQTWSKVILN GRLIGYYPDG RELAEKIRES
RREGELSDEV NVSYNITETF NEVYINSDSG RVRRPLIVVK HGKPLVTKED IENLKKGKIT
FDTLVKEGKI EFLDAEEEEN AYIALEPKDV TKDHTHLEIW APAILGITAS IIPYPEHNQS
PRNTYQSAMA KQALGLYAAN YQIRTDTRAH LLHYPQKPIV QTRALEAIGY TERPAGNNAI
FALMSYTGYN MEDAVIMNKS SVDRGMFRST FFRLYSAEEI KYPGGQEDEI LLPEPGVRGY
KGKDYYRLLE SNGIVSPEVD VKGGDVLIGK VSPPRFLQEF KELSPDQAKR DTSIITRHGE
RGTVDLVLVT ETSEGNKLVK VRVRDLRIPE IGDKFATRHG QKGVIGMLIP QVDMPYTTSG
LVPDIILNPH ALPSRMTVGQ IMEAIAGKFV AATGNPIDAT PFYNTPIEEI QRKLLEHGYL
SDGSEVVYDG RTGEKLKGRI LFGIVYYQKL HHMVADKMHG RGRGPVQILT RQPTEGRARE
GGLRFGEMER DCLIGYGAAM LIKDRLLDNS DRATVYVCEQ CGYVGWYDRT KNKYICPIHG
DKTTLYPVVI SYAFKLLLQE LMSMVIAPKL VLGDKVPVGG NSNE