Gene Msed_0297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0297 
Symbol 
ID5104933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp253731 
End bp255962 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content50% 
IMG OID640506203 
Productcarbon-monoxide dehydrogenase (acceptor) 
Protein accessionYP_001190398 
Protein GI146303082 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.752062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTACG CAGGTAAGGC CGTAAAAAGA CTATATGACG ATAAGTTTGT AACTGGTAGA 
AGCACCTACG TTGATGATAT CAAGGTAAAT TCGTTATATG CCACCTTCGT GAGGAGCAAT
GTTGCTCACG GAGTCATAAA GAGGATACAT AGGGACGACG CCCTGAAGAT GAGGGGTGTT
GTTGCAGTAT TCACCGGGGA GGAGCTTAAT CAGATCATTA AGGGAGGAAT AGGTCCATGG
ACCACTTACA TCGATCCAAG GCCATGGAAG ATACCCCTGT GGAGGTTCGC CCAGGGTGAG
ACCAAGTATA ACGGTGAACC CATTGCCATG GTAATTGCCC AGGACAAGTA CACTGCCAGG
GACGCTGCCG AGGCAGTTAA CGTGGATATA GAACCCCTTG ATGCCGTAGT TGACATGGAG
AAGGCAAAGG AGGATAAGAT CCTTGTGCAT AAGGAACTCG GAACCAACTT CGGATACGTT
GGAGACTTCA ATGCAGGGGA CGCTGAAAAG GCACTCTCAA GTGCGGATAA GAGGGTTGAG
GTGGAGATAG CTAATAATCG TCTTATTCCA TCCCCCATGG AGCCTAGGGG AATTGTGTCC
CAGTATGATG GGGCAAATCT CACCATCTGG TACTCAACAC AAATACCCCA TTTCGCTAGG
TCAGAATTCA GCAGGATCTT CAACATACCC GAAAGCCGGA TCAGGGTAAT AATGCCCGAT
GTGGGAGGAG GTTTCGGAAG TAAGGCTCAC ATACTACCGG AGGAGCTCGC GGTAATCGCT
GCGTCCATAA GGCTAGGGAG GACAGTTAGG TGGACAGCGA CAAGGACGGA GGAGATGATT
GCCACCAACT CGAGGCATAA CGTGTTCAGA GGAGAAATAG GCTTCAAGAA TGACGGTACC
CTCGTGGCAA TTAAGGGCAC TCTTGACGTT GAACTTGGAG CTTACCTAAC CTATACAGAG
GGTCTACAGC CAACCATTAT CCCGCCAATG ATTCCAGGTC CCTACAGGGT TAGGGATTTG
GCCATAAGAA GCAGGGCAAT ATACACCAAT ACCATACCTA TCACCATGTA CAGAGGAGCC
AGTAGACCCG AGGCAACCTT CATCATTGAG AGAATAATGA GTACGGTGGC TGATGAACTC
AAGATGGACG ACGTTGAGGT GAGGATGAGG AACCTCGTGA GGGAGGACCA AATGCCCTAC
ACTAACCCAT TCGGCCTTAA GTATGATACA GGTGATTACC CAACCCTGCT CAAGGAAGGC
GTGAAAGTTC TCGAGTACCA CAAGCTGAAG GAATGGGCAG AGAATGAAAG AAAGAGGGGT
AAGAAGGTTG GCGTTGGCTT AGCCTACTAC CTTGAGATAT GTGGTTTCGG ACCATACGAG
TATTCAGAGA CTAGGGTGAA CGAAGACGGT AGCGTTATAG TGGCCATTGG AGGTACGCCA
CACGGGCAAG GTACGGAGAC GGCAATAGCT CAGCTGGTAG CTGATGAGCT CCAGATACCC
ATAGAGAGGA TTAAGGTTAC TTGGGGAGAC AGCGAGGCTA TTCCAGCCGG TACAGGCACT
TACGGGTCTA GAACTTTGGC CATAGCTGGA AGCGCTGCAA TTACCTCAAG CAGGCAAGCC
CTCGAGAAGA TGAAACGCGT GGCAGCCAGG TCCATGAAGG CTGACGTAGA GGAAATAGAG
TATAGAAATG GAGAGTTCGT GCACAAGAAG GAGGGCAAGA AGATGAGCTG GGATGCTGTG
GCGAGGGAGG CGTACTCGGG GAAAGAACCT GGAATTTCAG CAAGCGTTAT GCTCGAGGGA
GATGTGACAT TCCCATATGG AGTGCATGTG GCGGTAGTTG AGGTTGACGA TTACGGTATT
GCAAGGGTGA AGGAATATAG GGCCTATGAC GATATTGGTA GGGTCATAAA CCCTGCATTG
GCTGAGGGTC AGGTACATGG AGGAGGAACG CAGGCTGTGG GTCAGGCGCT CTATGAACTG
GCTATCATCA ACGAGAATGG TCAATTAGCG GTGACCTATG CGGACTATTT CGTCCCAACA
GCTGTGGAAG CACCTAAGTT CAAGAGCTAC TTCGCAGAGA AATATCATCC TTCAAACTAT
CTTACCAAGA GCAAGGGAGT TGGTGAGGCA TCCCTCATTG TGGGACCAGC TGCCATAGTA
AGGGCCATAG AGAACGCAAC AGGAAAGAGA TTTAACAAGA CGCCGGTCAC TCCTGAGGAT
ATCCTTAGTT AA
 
Protein sequence
MSYAGKAVKR LYDDKFVTGR STYVDDIKVN SLYATFVRSN VAHGVIKRIH RDDALKMRGV 
VAVFTGEELN QIIKGGIGPW TTYIDPRPWK IPLWRFAQGE TKYNGEPIAM VIAQDKYTAR
DAAEAVNVDI EPLDAVVDME KAKEDKILVH KELGTNFGYV GDFNAGDAEK ALSSADKRVE
VEIANNRLIP SPMEPRGIVS QYDGANLTIW YSTQIPHFAR SEFSRIFNIP ESRIRVIMPD
VGGGFGSKAH ILPEELAVIA ASIRLGRTVR WTATRTEEMI ATNSRHNVFR GEIGFKNDGT
LVAIKGTLDV ELGAYLTYTE GLQPTIIPPM IPGPYRVRDL AIRSRAIYTN TIPITMYRGA
SRPEATFIIE RIMSTVADEL KMDDVEVRMR NLVREDQMPY TNPFGLKYDT GDYPTLLKEG
VKVLEYHKLK EWAENERKRG KKVGVGLAYY LEICGFGPYE YSETRVNEDG SVIVAIGGTP
HGQGTETAIA QLVADELQIP IERIKVTWGD SEAIPAGTGT YGSRTLAIAG SAAITSSRQA
LEKMKRVAAR SMKADVEEIE YRNGEFVHKK EGKKMSWDAV AREAYSGKEP GISASVMLEG
DVTFPYGVHV AVVEVDDYGI ARVKEYRAYD DIGRVINPAL AEGQVHGGGT QAVGQALYEL
AIINENGQLA VTYADYFVPT AVEAPKFKSY FAEKYHPSNY LTKSKGVGEA SLIVGPAAIV
RAIENATGKR FNKTPVTPED ILS