Gene Msed_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1064 
Symbol 
ID5104275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp993194 
End bp995215 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content52% 
IMG OID640506959 
Productaldehyde oxidase and xanthine dehydrogenase, molybdopterin binding 
Protein accessionYP_001191152 
Protein GI146303836 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAAGG AACATTATCC CATAGTAGTG GGAAAGTCAC TCTACATAGA TGATATAACT 
CCCTCAAATA CTGCTTACCT TCACGTAGTT AGATCCCCGA TAGCTAGGGG AGTAATCAAG
TCCGTGTCGG GTCCGGAGAA GGCCCTTCTA ACCTTTACGT GGGAACAGGT GAGGAACTGG
ATCCCCGTAA GGCTCTTTGG ACCCTCAGAA GGCCTTCAGG TTACCAGAAT GCCAGTCCTA
GCGAATGGAA GAGTGAACTT TGTGGGTCAA CCAGTAATAG CCTTTGTGGT ACAGGATAGG
TATGAGGGAG AGGACCTCGC GGACGACGTT TCTGTAGATT ACGAGGAGCT AAATGCCGTT
ACTGATCCTG AAACTGCCCT TGAAAGCGAG CCAATTCACC CAGAGCTCAA GAGCAATATC
TTCATGGATC AACTCCTTCA GGGAGGCAAC CTCTCCCTTA AGGACAAGGC TGACGTGGTG
GTGAGGAGAA AGATTAAGCA GAGCAGGGTG GCGACAAATC CCATGGAACC AAAGGGCATA
CTGTGCTGGT GGGATGGTGA CACGCTGAAC GTTAAGGTCT CGACTCAGGC CCCCTTCGGT
GTGAGGAATG ACCTTCACGA GTTGTTAGGG ATACCTCCGG AGAAGATCAA GGTGAGCTCA
CCTCCAAATG TGGGGGGAGG TTTTGGAAAC AAGAGCGGAG GATACCCTGA GTACGTTCTG
GCCGCTCTGG CCTCCCTAAA GCTGGGAAGA CCCGTGAAGT GGATTGAGAC GAGGTCTGAG
ATACTTAACA ACGCCCAATC ACAGGGAAGA GGAGAAGTCT CAGACATGAA GCTCTACGCT
ACCAGGTCAG GAGAAATGCT AGGAATGGAG GGAGAGGTCA TAGCGAATAT GGGTGCATAC
GCATATGGAA TAAATTATTT CACCTCTCAG TTCGTGGCTA GGCTCTCCAA CGGTCCCTAC
AAACTGAAGT TCGCCTCAGT TAGGGCGATT ACAGTCTACA CCAATACTCC GCCCATGGGC
TTTTACAGGG GTGCAGGGAG ACCCGAGGCG GCATTGATTC ATGAGACCTT GGTGGAAGAT
CTGGCGGAGG AGCTTGGAAT GGATCCTGTG GAGATAAGGA GAAAGAACCT AGTTGACGAC
TCTGGTTACG TGACTCCACT AGGCCTGAGG TACGACGCAG CTGGATACAG GGAAGTTTTT
GATAGGGCCG CGAACTACTA TAGGAAGCTC AGGGAAACAT CTAAGGGAGT CTCCCTAGTT
ACCTTCACAG AAATTGTCAG AACCTCCCCA GGAGAGAGCG CCAGAATTGA GGTCAAAGAC
AGGAAAGTGA TAGTTCACCT AGGGTTGGGC GCTCATGGAC AGGCCTATGA ATCCTCGTTC
AGGACAGTCG TGGCTGAAGA GCTGGGGATT GACCCAGAGA AGGTTGAGGT CAAGACTGGA
GACAGTGAAG GGGTTAAGGA GGGTATAGGG AGCTTCGGTT CCAGAGGGGG AACGATAGGT
AGTTCAGCTG CGCTAGCTGC AGCCCAGGAA CTCAAGAGGA AGATGGGAGG AAAGGTGGAT
CTGAGCAGGG AGATGAGTGT TGAGGTCTTT TACAGGGCAG AGGACATATT TGCCCCAGGG
GCACATGTGG CTAAGGTAGA GCTTGACCCT GAGACGGGGA TCTTCAAGGT CGTGGAGTAC
TATGCCGTAG ATGACGTAGG GAGAGTCCTA AACCGTGAGG AAATTGAGGG TCAGATCATA
GGAGGTGTCC TTCAGGGAGT TTCTCAGGTC ATGATGGAGG CAGTGAAGTT CGATGAGAGA
GGTAATCCCA TGTGCAGTTC CGTTGCAGAT TGCGGGATGT TAACGGCGGT GGAAGGGCCT
AGAAGGGTTA ACGCAGAGTA CGTTGAGTTC AGGTCATCCC TGTTGTCGGG GTCCAGGGGA
GTGGGCGAAG CTGGGACAAC AGGAGCCCTT CCCGCCACCT TCATAGCCCT AGAAAAGGCC
CTAGGCAAGA AATTGAGTTC ATTACCGTTT GAGCCTCAGT AG
 
Protein sequence
MLKEHYPIVV GKSLYIDDIT PSNTAYLHVV RSPIARGVIK SVSGPEKALL TFTWEQVRNW 
IPVRLFGPSE GLQVTRMPVL ANGRVNFVGQ PVIAFVVQDR YEGEDLADDV SVDYEELNAV
TDPETALESE PIHPELKSNI FMDQLLQGGN LSLKDKADVV VRRKIKQSRV ATNPMEPKGI
LCWWDGDTLN VKVSTQAPFG VRNDLHELLG IPPEKIKVSS PPNVGGGFGN KSGGYPEYVL
AALASLKLGR PVKWIETRSE ILNNAQSQGR GEVSDMKLYA TRSGEMLGME GEVIANMGAY
AYGINYFTSQ FVARLSNGPY KLKFASVRAI TVYTNTPPMG FYRGAGRPEA ALIHETLVED
LAEELGMDPV EIRRKNLVDD SGYVTPLGLR YDAAGYREVF DRAANYYRKL RETSKGVSLV
TFTEIVRTSP GESARIEVKD RKVIVHLGLG AHGQAYESSF RTVVAEELGI DPEKVEVKTG
DSEGVKEGIG SFGSRGGTIG SSAALAAAQE LKRKMGGKVD LSREMSVEVF YRAEDIFAPG
AHVAKVELDP ETGIFKVVEY YAVDDVGRVL NREEIEGQII GGVLQGVSQV MMEAVKFDER
GNPMCSSVAD CGMLTAVEGP RRVNAEYVEF RSSLLSGSRG VGEAGTTGAL PATFIALEKA
LGKKLSSLPF EPQ