Gene Msed_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0034 
Symbol 
ID5105173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp32163 
End bp34823 
Gene Length2661 bp 
Protein Length886 aa 
Translation table11 
GC content45% 
IMG OID640505928 
ProductDNA-directed RNA polymerase subunit A' 
Protein accessionYP_001190135 
Protein GI146302819 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02390] DNA-directed RNA polymerase subunit A' 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000287475 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0408299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGA GGTATCAAGT TAACGAGAAG ATAATAAAGG GAGTTAGATT CGGGATACTC 
TCCCCCGATG AAATAAGGAA GATGTCAGTT ACTGCCATAA TAACCGCTGA TGTGTATGAC
GAAGATGGGA CGCCCATAGA AGGAAGCGTA ATGGATTCAA GACTCGGGAT AATAGAGCCT
GGTCAGAAGT GTCCAATCTG TGGAAATGTT ATAGGCAACT GTCCTGGACA CTTTGGACAC
ATTGAATTAG TAAGGCCCGT TATACATGTT GGATTCGTTA AGCATGTCTA TGATCTATTA
AGATCCACAT GTAGGAGATG TGGTAGGGTA AAGATCAGCG AGAATGATAT AGAGAAATAC
AGGAAGATTT ATGACGCGAT AAAGAAGAGG TGGCCCTCAG CGGCTAAAAG ACTCATTGAT
CACGTGAAAA AAACGTCTAT GAAGGCAGCT GTATGTCCGC ATTGCTCAGA GAAACAACTG
AAAATCAAAC TGGAGAAGCC CTACAACTTT TACGAGGAGA GAAAGGAGGG TGTAATGAGG
CTCACTCCCT CCGATATCAG GGAAAGATTA GAAAGAATAC CTGATTCTGA TGTGGAGCTT
CTAGGTTATG ATCCAACCAC TAGCAGACCC GAGTGGATAA TTCTTACCGT ACTACCTGTA
CCTCCTATTA CCATCAGGCC CTCAATAATG ATAGAGAGCG GGATAAGGGC AGAGGACGAT
CTAACTCACA AGCTCGTGGA CATAGTTAGA ATAAATGAGA GACTAAAGGA GAGCATAGAT
GCAGGTGCCC CTCAGTTAAT TGTGGAAGAT CTTTGGGATC TCCTGCAGTA TCATGTGGCT
ACCTATTTCG ATAATGAGAT ACCAGGCCTG CCCACATCTA AGCATAGATC GGGTAGACCG
CTTAGAACTC TGGCACAAAG ACTCAAGGGT AAAGAGGGAA GATTTAGAGG TAACCTTTCA
GGAAAGAGGG TTGATTTTTC AGCCAGAACC GTAATATCGC CTGATCCAAA TATCAGTATA
GATGAGGTTG GAGTTCCATA CGACGTCGCA CAAATTCTCA CTGTACCAGA GAAGGTAACT
AAATGGAACA TAGAGAGAAT GAGACAATAC GTTATAAACG GTCCAGATAA GTGGCCCGGT
GCCAATTATG TGGTTAGGCC TGATGGAAGG AGAATAGACC TTAGATATGT TAAGGACAGA
AAGGAGCTGG CTGCAACCCT AGCTCCTAAC TTTATAGTGG AAAGGCATTT GGTTGACGGA
GATATAGTAA TTTTCAACAG ACAACCATCG CTTCATAGAA TATCCATGAT GGGTCATAAG
GTGAGAGTTC TCCCAGGGAG AACCTTTAGG CTTAACCTTT TGGTGTGTCC ACCGTATAAC
GCTGATTTTG ATGGCGATGA GATGAACCTC CATGTTCCTC AGTCAGAGGA GGCAATAGCG
GAGACCAAGG AATTGATGAC AGTTCATCGC AATATATTAA CCCCTAGATA CGGAGGCCCC
ATTATAGGTT CAGCCCAGGA TTATATTAGT GGCGCATATC TGCTCACCGT CAAAACGACT
CTACTCACAG AAGATGAAGT TCAGACGATA CTGGGGGTAG CTAACATTAA CAAGAATTTA
GAGGAACCAG CGATCCTGGC CCCTAAAAGG CTCTACACGG GGAAGCAGAT AGTCAGTCAC
TTCCTTCCTG AAGATTTTAA CTTCCACGGT CCTGCAAACA TATCGAGCGG TGTAAGGTCA
TGCAAGGATG AGGACTGCCC TCATGACTCT TACGTGGTGA TCAAGAAGGG GAAACTCCTG
GAGGGAGTGT TTGACAAGAA AGCGCTAGGA AACCAGCAGG CTGAGAGTAT ACTCCACTGG
CTCGTCAGAG AATATGATGA GGATTATGTT CTCTGGTTAA TGGACAATCT GTTTAAGGTA
TTCCTTAGGT ACATCGAGCT CCACGGTCTA ACCATGACAC TCTCTGATGT AACTATTCCG
GAGGAAGCCA CAAAGAAAAT CGCGGAAAGG GCACAGGAGG CGAGGAAAAA GGTTAACGAG
CTCATTGAAA GTTATGAAAA AGGCCAACTC GAGGTCATCC CTGGTAGAAC GTTAGAAGAA
AGTTTGGAAA GTTACATCCT TGATGCGCTA GACAAGCTAA GAAATGAAGC TGGAGAGATA
GCCACGACTT ATCTGGATCC ATTTAATAAT GCCTATATCA TGGCAAGAAC TGGAGCAAGG
GGAAGCGTAC TCAATATCAC TCAAATGGCA GCCATGCTAG GTCAACAATC AGTCAGAGGA
GAGAGAATAA AACGTGGGTA TGCCACTAGG ACGCTCCCTC ACTTTAAGCC TGGGGATATA
ACGCCAGAGG CTAGGGGATT CATTTACTCG TCCTTCAGAA GCGGATTAAA TCCAATCGAG
ACCTTCTTCC ACGCTGCAGG TGGTAGAGAG GGTCTAGTGG ACACCGCTGT GAGGACGTCT
CAGAGCGGTT ACATGCAGAG GAGGCTCATA AACGCGCTCT CCGACCTCAG GGTTGAGTAC
GATGGTACGG TTAGAACGCT GTATGGAGAG ATGATCCAGA CTCTCTACGG TGGAGATGGA
GTCCATCCAA TGCAAAGTGC ACATGGCAAG ACCATAGATG TAGATAGAGT CCTAGAGAGA
GTAGTTGGTT GGAAGAGGTG A
 
Protein sequence
MSERYQVNEK IIKGVRFGIL SPDEIRKMSV TAIITADVYD EDGTPIEGSV MDSRLGIIEP 
GQKCPICGNV IGNCPGHFGH IELVRPVIHV GFVKHVYDLL RSTCRRCGRV KISENDIEKY
RKIYDAIKKR WPSAAKRLID HVKKTSMKAA VCPHCSEKQL KIKLEKPYNF YEERKEGVMR
LTPSDIRERL ERIPDSDVEL LGYDPTTSRP EWIILTVLPV PPITIRPSIM IESGIRAEDD
LTHKLVDIVR INERLKESID AGAPQLIVED LWDLLQYHVA TYFDNEIPGL PTSKHRSGRP
LRTLAQRLKG KEGRFRGNLS GKRVDFSART VISPDPNISI DEVGVPYDVA QILTVPEKVT
KWNIERMRQY VINGPDKWPG ANYVVRPDGR RIDLRYVKDR KELAATLAPN FIVERHLVDG
DIVIFNRQPS LHRISMMGHK VRVLPGRTFR LNLLVCPPYN ADFDGDEMNL HVPQSEEAIA
ETKELMTVHR NILTPRYGGP IIGSAQDYIS GAYLLTVKTT LLTEDEVQTI LGVANINKNL
EEPAILAPKR LYTGKQIVSH FLPEDFNFHG PANISSGVRS CKDEDCPHDS YVVIKKGKLL
EGVFDKKALG NQQAESILHW LVREYDEDYV LWLMDNLFKV FLRYIELHGL TMTLSDVTIP
EEATKKIAER AQEARKKVNE LIESYEKGQL EVIPGRTLEE SLESYILDAL DKLRNEAGEI
ATTYLDPFNN AYIMARTGAR GSVLNITQMA AMLGQQSVRG ERIKRGYATR TLPHFKPGDI
TPEARGFIYS SFRSGLNPIE TFFHAAGGRE GLVDTAVRTS QSGYMQRRLI NALSDLRVEY
DGTVRTLYGE MIQTLYGGDG VHPMQSAHGK TIDVDRVLER VVGWKR