Gene Msed_1270 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1270 
Symbol 
ID5104683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1246081 
End bp1248120 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content55% 
IMG OID640507161 
Productaldehyde oxidase and xanthine dehydrogenase, molybdopterin binding 
Protein accessionYP_001191354 
Protein GI146304038 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.529705 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGA TTAGGGAACA CCTAGATGAA ATCACGGGAC ACGGTAAATA CATAGATGAT 
GTGGATCTCC CCAACACTGT CTATCTTGGC GTGGTTAGAT CGCAGGTGGC CAGGGGAAAG
GTGCTCGATA TCTCGAGAAG CGATAATGTC CTCCTCTTCC TGGATTGGGA CTCAGTCTCA
ACTTACATGC CAGTTAGACC AGACCCTAGG ACGAAGAATG TCGTAAAGAT GCCTATCGTG
AGCGATGGGA GAGTGAACTT CGTTGGCCAG CCCGTGGTCG CCTTCGTTGT TAAGGACAGA
TACGAGGTTG AAGACGTGAC TGACGAGATA GGCGTGGACT ACGCCCAAGA GACGCCCATC
CTCTCCGTGA AGGACTCCAT GAGGGAGGAG ATCAAGATAC ATGAGAAAGG GAACATTGCA
ATAGACCTTG ATCTAAGTGG CGGAGATCTA GAGCAACTGG TGAACTCTGA GGTCACGGTC
GAAAGGGAGC TCCTCCAAGA CAGGATAGTT CAGCACCCCA TGGAGCCTAA GGGTGTCATC
TCGTACTACA ACGGTGAAAC ACTCACCGTG ATTGGATCCT TCCAGTCGGC TTTCAGGGTG
AGGGCAGATC TCCAGGAAGC ACTTGGGGTC TCACCCGAGA AGATTGTGGT CCAATCTCCT
CCAAACGTTG GTGGAGGCTT TGGGAACAAG GTCCCTGCCT ATCCGGAGTA CGTCCTGACT
GCCCTGGCCT CTATGAAGCT GAGAAGACCT GTGAAGTGGA TCGAGACTAG GAGGGAACAC
CTTACCAACC CAACCCAAGG GCGGGGAGTC TGGTCTAAGG TTAAACTTCA CGCCAAGAGG
GACGGTACAA TCCTAGGCCT TGAGGGGACC ATTGCCGTGG ATCTAGGTGC TTACGCCTTC
ACCATCAACA CAACCACCCC AGCATTCATT GCCTCCCTCA CGAACGGTCC CTACAAGATG
AGGTTCGCTA AGTTGAGGGC ACTAGGCGTT TACACGAATA AACCTCCCAC CGGCCCATAC
CGCGGAGCCG GGAGGCCTGA GGCAGCCCTA ATCACTGAGA CACTTGTTGA AGATCTGGCA
GAGACGCTGG GTCTTAGCCC CCTTGAGGTT AGGAAGAAGA ACCTCCTGGA CGGAGAGTTC
ACAACCCCCC TGGGAGTAAA GATAGATAAG GCTGCGTACA GGGAGATGTT TACAAGGGCC
GAGCAGGTTT ACCACACTCT CAAGGAGAGG CACAAGGGAA AGGCGATCTC CTTCATAGCC
TTCACTGAAG TTGTGAGGGC GTCCCCCGGC GAGGGTGCTA AGGTGAGGGT AGGAAGGGGA
GAAGTCTTCA TTGCGGTGGG CAGTGGACCC CACGGACAAG CCCACAGAAC CACGTTCGCT
CTCCTAGCAG GAGAGGTGCT TGGGATTGAC CCGAACGAGA TCAAGGTTGA GGTAAACAAC
ACCTCCCTAA TCAAGGAGGG GATAGGTAGC TTCGGGAGTA GGAGCGCAGC TGCCGGTGGG
TCAGCCGTGA TTGAGGCCTC CAGGGCAGTC CTCCAGAAGA TAAGGGAAAG GGGTCTCACC
GTGAGGCAGG CCATAAACTC AGACGAGGTT TTCGAGGCTG AGACCTTCAC CAAGACTTCG
GACCTCTACA CCCCTGGAAT ACATATGGCA GTCCTCGACC TGGACAAGGA GACGGGCTTC
GCCAGGGTCC TCGAGTATTA CGCTATCGAC GACGTTGGGA GGGCAATAAT CCCATCAGAG
GTTGAGGGAC AGATCGTGGG AGGCGTTCTA CAGGGAGCTT CCCAGGTCCT GCTTGAGTAT
GCACCGTATG ACGAGAACGG TAATCCCGTG TATGGATCAA TCGCAGATAA CGGGTTCCCC
ACTGCGGTTG AGGCAGTTCG CAGGGTAATT TCAGAGTCCT TCTCTACCCC ATCCAACACC
TTGAGCCAGG CAAGGGGTGT CGGTGAGGCC GGAACCACTG GGGCCCTCCC TGCAGTTTTC
ATTGCCCTTG AAAAGGCACT CCACAGGAAG TTGAGCGGGA CCCCCTACCT ACCAGGGTAA
 
Protein sequence
MIKIREHLDE ITGHGKYIDD VDLPNTVYLG VVRSQVARGK VLDISRSDNV LLFLDWDSVS 
TYMPVRPDPR TKNVVKMPIV SDGRVNFVGQ PVVAFVVKDR YEVEDVTDEI GVDYAQETPI
LSVKDSMREE IKIHEKGNIA IDLDLSGGDL EQLVNSEVTV ERELLQDRIV QHPMEPKGVI
SYYNGETLTV IGSFQSAFRV RADLQEALGV SPEKIVVQSP PNVGGGFGNK VPAYPEYVLT
ALASMKLRRP VKWIETRREH LTNPTQGRGV WSKVKLHAKR DGTILGLEGT IAVDLGAYAF
TINTTTPAFI ASLTNGPYKM RFAKLRALGV YTNKPPTGPY RGAGRPEAAL ITETLVEDLA
ETLGLSPLEV RKKNLLDGEF TTPLGVKIDK AAYREMFTRA EQVYHTLKER HKGKAISFIA
FTEVVRASPG EGAKVRVGRG EVFIAVGSGP HGQAHRTTFA LLAGEVLGID PNEIKVEVNN
TSLIKEGIGS FGSRSAAAGG SAVIEASRAV LQKIRERGLT VRQAINSDEV FEAETFTKTS
DLYTPGIHMA VLDLDKETGF ARVLEYYAID DVGRAIIPSE VEGQIVGGVL QGASQVLLEY
APYDENGNPV YGSIADNGFP TAVEAVRRVI SESFSTPSNT LSQARGVGEA GTTGALPAVF
IALEKALHRK LSGTPYLPG