Gene Msed_0290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0290 
Symbol 
ID5104926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp245965 
End bp247617 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content47% 
IMG OID640506196 
Productcytochrome-c oxidase 
Protein accessionYP_001190391 
Protein GI146303075 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0048384 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.666159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTTC AAGTCAAAGA TATTTTAAAA ATGGTTTCCA ATTACCTTAG TATTTTTTCT 
GCCTCTAATA TAAAGAAAGT GCTGTTCCCA TCGACTACCT CTGGGGTCAT ATGGCAATAC
TTTGCTGGTT CTCTGGCATG GCTAGCGGTA GTAGGTATGG CAGCTATGAA CCTCAGGACC
TACCTAACCT ACCCTTCAAA CAGTCCTGAG GTTGGAGTCA CTTACTATGC CTTCTTAACC
CTTCATGGCT GGTCAGCCAT GCTGGGGCTA GTGCCCTTCG CTGCGATATC CGTGATAGCT
TACTCCATGT ACAAGGACGG AATGAGCATC AGGAGAACAA AATTGATGAG TGGGATGTTT
TGGCTAGCCA ACGCAGGCCT CATCTTCGCT CTCCTCGGAG GACCAGACAT GGGGTGGTAC
ATGTACCCGC CCCTGGCTGT AGAGGACAAC TCCAATTTTC ATGCTTTTCT TAACTATCAC
GGAGCATTAA TGGGTATAGC CTATCTCGCC TTGGCCTTAA GCTCATTGGC TCAGACCATT
GCCACTGTTA ACTTGGTAAG CGATGCCTAT GCGACCAAAC CCAAGGGACA GAAACTGGGG
ATATTCTCAG CCTACGGTGT AGCCTTTGCG GTCATTATTG CGTTAACTTT ACCAGCGTTA
ACAGCGGGTG AGTTGTGGTA TACCCTTAAC ATTCTAGCAG GGGTCCCAAT CAACACCTTA
CTATGGCTAG TGCTTTTCTG GTTCTACGGT CACCCCGTAG TGTATTATGT GCCTTTCCCT
CTGTTCGGTG CACTTTACTA CTTTGTTCCC AAGTTTTCAG GGAGGCCCCT ATTCAGCGAA
AAGTGGGCAA GGTGGAACAT TTACCTCTTG GCCATTGGTT CCATGTTGAT ATGGGTTCAT
CACCTCCAGA CCTTCCCAAT TCCGGTCCCG GTGAGGCTAT GGATAAACCT GTCGACCTTG
GTGTTGGCTT CTGGTTCAGG TTTAACGGTG CTTAACCTTG GTTTGACGGT TCTAACGAGC
AAGGGGTATA ACTACAAGGA TCCAGTGGGA ATGGCGACCT TAATGGCGCT CATAGGGTTT
ATTCTAGGCG GAGTTCAGGC AGTTCCTCTT CCCATGTTCC CCATCAATCC CATAGTCCAC
AATACTTACT ACGTCGTAGG TCATTTCCAC CTCGTTATCT GGACCCTGAT ACTCATGGGA
TTCACTGCAG TTTTCCTAGA CGTTCTCAGA ACTGTGAGGC CAGGATTTGA CTACAGCAAG
TCGGCAACAA GGCTAATAAA CGCGGGCATA CTTGTGTGGA CCATTCCCTT CGTGATAGTT
GGTTACCTGA TGAGCATGGA AGGCTACATG GGGATGCTAA GAAGGGTTAT TGCTTATCCC
ACGACCTTCT ACCCATACAA TCTTTCAATT TCACTTCTAG CTGAGATAGG GATTGCGGGT
ATAGTTATGG CAGTAGGGTC GGCATTGGTG GAGTTCCTAA CTTACTCTCC CTCCACCACA
GTAAGCGTTT CATCTGGATC TGGATCATCT ACTCCTTCGA TCTCCTTAGC CACAGATCAA
AATGACAAAA AGGGAGAATT TTTTGATAAT CTTAAACTTA AGCTTAATAA CAGTGTTTAT
GGAAAATCTT CCGAATTAAG AAGGGTGAGA TAA
 
Protein sequence
MSFQVKDILK MVSNYLSIFS ASNIKKVLFP STTSGVIWQY FAGSLAWLAV VGMAAMNLRT 
YLTYPSNSPE VGVTYYAFLT LHGWSAMLGL VPFAAISVIA YSMYKDGMSI RRTKLMSGMF
WLANAGLIFA LLGGPDMGWY MYPPLAVEDN SNFHAFLNYH GALMGIAYLA LALSSLAQTI
ATVNLVSDAY ATKPKGQKLG IFSAYGVAFA VIIALTLPAL TAGELWYTLN ILAGVPINTL
LWLVLFWFYG HPVVYYVPFP LFGALYYFVP KFSGRPLFSE KWARWNIYLL AIGSMLIWVH
HLQTFPIPVP VRLWINLSTL VLASGSGLTV LNLGLTVLTS KGYNYKDPVG MATLMALIGF
ILGGVQAVPL PMFPINPIVH NTYYVVGHFH LVIWTLILMG FTAVFLDVLR TVRPGFDYSK
SATRLINAGI LVWTIPFVIV GYLMSMEGYM GMLRRVIAYP TTFYPYNLSI SLLAEIGIAG
IVMAVGSALV EFLTYSPSTT VSVSSGSGSS TPSISLATDQ NDKKGEFFDN LKLKLNNSVY
GKSSELRRVR