Gene Msed_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0194 
Symbol 
ID5103938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp156749 
End bp158887 
Gene Length2139 bp 
Protein Length712 aa 
Translation table11 
GC content48% 
IMG OID640506099 
Productxanthine dehydrogenase, molybdenum binding subunit apoprotein 
Protein accessionYP_001190295 
Protein GI146302979 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0755616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATACG TAGGTAAACC TGTTAGAAGG GTTGAGGATC CGAAGCTTAT AACAGGAAGG 
GGTTCATTTG TTGACGACAT CCAGATACCG GGGACCTACT ACGTGGCCTT TGTAAGATCT
AAATATCCAC ACGCGCGGAT TAGTGTTAAA CCAAGTCAGA ACGTTTTCAC TGGTTCTCAA
ATAAATCCAG GGAAGGATTT TCCTATTCCG TCTAACGAAG TCATATATGC AGGTCAACCT
ATTGCAGCAG TGATCGCCAG AGACCGATAT GAGGCTTACG ATCTTTTAGA GAGCGTTGAA
GTGGAATACG AGCAACTACC CTATGAAACA GATCCCTTCA GGGCCATGGA AGACAAGGTC
AAGGTCTACT CTAAGGCTGA AAGTAATATT TACGCGAAAA AGGAGTTCGT TGGAGGAGAA
GCGAAGAAAG AGCTGGAACA ATCTCCAATC GTGTTATCCG GAGAACTCCA TAACCAAAGG
ATTATAGCTT CGCCCATGGA GACCAGAGGG ACTCTAGCCT GGTTTGACGG TAATAGGCTG
AACGTCTGGT CCTCTACTCA GTCGGCGCAC TATTTAAGGA GAAACCTGGT GAGCTTCCTG
GGAATACAGA ATATCAGGGT CATTCAACCA GACGTGGGAG GGGCCTTTGG GAGCAAGATA
ATAACTCACC CAGAGGAATA TGCGGTGAGC TTCCTTGCCC TTAAACTTGG GATCCCCCTA
AAGTGGATAC CCACGAGAAC AGAGGAGATG CTGAGCGCTG GACATGGAAG AGATAAGTGG
CTGAGATACA AGGTTGGTGT AAAGAGGGAC GGAACTATCA CGGCAGTTGT TGGAACTGTG
GTGGGAAATC TAGGGGCTCC CTATCGTGAT GCCAACGACG ATGATTCTGG GAACGTCATG
AGCGCCGCTA GGATGCTCCC TGGACCGTAT AGAATAAAGC ATGGGTTTGT CGCTGCTTAC
AGCGTTAACA CTAACCTAAC TCCGACTACC TCATATAGGG GAGCCGGTAG ACCAGAGGCT
ACCTATTTCA TTGAAAGTAT AATCGAGGAG ATAGCTGAAG AGCTGAAGCT GGACCCGCTA
GAGGTCAGGC TCAGAAACGT GATAAGACCT GAGGAGATGC CCTACACTAA CGTGTTTGGT
ATAACCTATG ATTCCGGAAA CTACCCAGAG TTACTAAACT CAGCTAAATC ATACTATGAA
CAACTTAAGG CTGAAGCTAA AGACAACCAG TGCGTAGGAC TGGCGATGTA CGTTGAAATA
ACAGCGTTCG GCCCCTGGGA GACAGCCAGG GTTTACGCTA AGTACGACGG AAGGATAGTT
GTGGTCACTG GCACTGGACC GCACGGCCAA GGTGATGCAA CTGCTTTTGC ACAGCTCGCA
GCCGATGCCT TGGAAATCTC CATGGATCTT GTGGAGGTCA GGTGGGGAGA TACTGATGTG
ATTGAAGATG GAATTGGGAC TTGGGGAAGT AGAACAGTTA CTATTGGAGG CTCGGCAGTA
ATAATGGCTT CTCAAGAACT TAGGAAAAGG CTAACAGAGG CTGGCGCGAA AGCCCTTGAA
GCTGACGTAG AGGAGGTAGA GTATAGGGAG GGCAAAGTGG TTCATAAAAA GACTGGAAAA
AGTCTAGAGT TAGCTGAGAT CATCAAGTCA GCCTATAAGC TCGGTATTTC CCTTGATGTG
ACCTCCGTCT ATCCGGTCAA AAAGCCCACA TCACCTTATG GAGTTCACAT GGCATTAATG
GAGATAGATA GGGAGACTGG ACTAATTAGC GTGAAGAAAT ACATTGCCGT CGATGACGTG
GGGAACGTTA TTAATCCCCT ATTGGCTGAG GGACAGATCC ACGGAGGTGC GCTTCAAGGA
ATAAGCCAGG CCTTATATGA GGAGGCCGTA ATTAATGATG GTACCTTACA GAATCCTACC
TTTGGCGACT ATGCGTTACC CACAGCGGTA GAGACCCCAC GCTTCACATG GAAGTATCTC
ACCAACGGTC TTTCACCCCA TCCCACTGGA AGCAAGGGAA TAGGCGAGGC CGGGACGGTC
GTAGGAACTC CGGTTATATC CAATGCGATC TCCAGTTGTC TGAAGAGGAA GTTCAGTACC
ATGCCTATCT TATTGGAGAA GGTATTAGGT GACCAGTAG
 
Protein sequence
MRYVGKPVRR VEDPKLITGR GSFVDDIQIP GTYYVAFVRS KYPHARISVK PSQNVFTGSQ 
INPGKDFPIP SNEVIYAGQP IAAVIARDRY EAYDLLESVE VEYEQLPYET DPFRAMEDKV
KVYSKAESNI YAKKEFVGGE AKKELEQSPI VLSGELHNQR IIASPMETRG TLAWFDGNRL
NVWSSTQSAH YLRRNLVSFL GIQNIRVIQP DVGGAFGSKI ITHPEEYAVS FLALKLGIPL
KWIPTRTEEM LSAGHGRDKW LRYKVGVKRD GTITAVVGTV VGNLGAPYRD ANDDDSGNVM
SAARMLPGPY RIKHGFVAAY SVNTNLTPTT SYRGAGRPEA TYFIESIIEE IAEELKLDPL
EVRLRNVIRP EEMPYTNVFG ITYDSGNYPE LLNSAKSYYE QLKAEAKDNQ CVGLAMYVEI
TAFGPWETAR VYAKYDGRIV VVTGTGPHGQ GDATAFAQLA ADALEISMDL VEVRWGDTDV
IEDGIGTWGS RTVTIGGSAV IMASQELRKR LTEAGAKALE ADVEEVEYRE GKVVHKKTGK
SLELAEIIKS AYKLGISLDV TSVYPVKKPT SPYGVHMALM EIDRETGLIS VKKYIAVDDV
GNVINPLLAE GQIHGGALQG ISQALYEEAV INDGTLQNPT FGDYALPTAV ETPRFTWKYL
TNGLSPHPTG SKGIGEAGTV VGTPVISNAI SSCLKRKFST MPILLEKVLG DQ