Gene Msed_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0485 
Symbol 
ID5103647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp440930 
End bp442687 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content48% 
IMG OID640506391 
Productcytochrome c oxidase, subunit I 
Protein accessionYP_001190586 
Protein GI146303270 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTCCA AAGAAGAAGT TCGAAGGCAA GATAGAGAAA AACTTCTAGA ATGGGCAAGG 
CAATATCAAG AAGTCCAAGA GAGAGAAGCC AGAAGAAACA TCTTCATGAA AGTGTTGTAT
TCCGAAGACT ATAAAACTAT GGCTATAAAA TTAATATTTG CTGGATTAGT ATTTCTATTC
ATAGGCGGTG CGTTTGCCCT ATTCATTAGG GGGCAAGCTG GACTCAGTAG CGAAGGCGTA
CCGGTAGTCC TCGACCCATC ATACTACTTC CAGGCCATGA CCAACCACGT CATGGACATG
ATCTTCGGTG CAGTATTCAA TGTAGTGTTT GCGGTATCTT TCTACATGAT ACCTGCTATG
AACGGATCCA GACTCATAAA GTGGCCAAAG CTAGCTAATG CAGGTTTCTG GATAAGCATT
ATCGCGCTGT TTATGATGAA CTTCGGCGGG GTTGAGAACC AGTATCTCTT CACGTTCTTG
AACCCATTAA AGGCATCTCC CACCTGGTAC ATAGGCTATG GTATGATGGT TGTGGGCGAA
TGGATGGAAA TGGCTTCCGT GCTAGGTACT TCCTTTCAAG GAAGAGTCCC AGGTAGACTG
GTCCCCACTG CCATTGGGTT CATTGTCATG GACATGATAA TGATGGCCTT AGCTAACATA
TCTGTATTCA TTGCTGACAT GTGGAGCTTA TTCTCACCCA TAGGTGGCCT GAACATATAC
CTGTTCGGTA TTCCCAACGC TGAGGTGTGG AAGGGATTGT TCTGGTTCGC TGACCATCCC
CTAGTGTATT TCGCCCCATA TGCATTAACC GGTGCCATAG TTGCCATAAC TCCGCTCTAC
GCGAGAAGAC CAATGTATAG CGTGAGGTTT ACGAGATGGC TCATCCCAGT CCTCTTCGTG
TTGGGATCTA GCGTCTACGT TCACCACATA GTGGACGACC CGTGGCCCCT GATCCTGAGG
GATATATTCG CCCAGACCAG CACTGCACTT ATTGCCGTTC CCTTCGCTGC CCTCTGGCTA
CTCTTCTTCA TCACCCTAGG CGATCCCAGG AAGTTGAAGT GGGATCCAGG GTTCGCCTTC
ATATTTGCCG CAGCCGTCTG GAACATAATC GGAGGAATAC AGGCTGAGCC CACCAATCCA
ACACCTTCGC TGGACCCAAC GATACATAAC ACAGGATGGA TATTCGGTCA CTTCCACATA
ATGCTAGCAA TATACTCCGT AGGCGGTTTG CTCGGTGCCT TATACGTAGT GGGACCTGAT
CTCTTCGGCA AGAACTGGTA CAGCACCAAG CTAGGATGGT TGCACTTCTG GGGATGGCAG
GCAGGTATGG GATTGTTCGC CATAGCTTCA AGTGAGGGTG GATTCTTCGG GCTAATCAGA
AGGGAAGTAG CGTGGGCCGG CTTCTACGAG GTTTACTACC AGCTCCTACT AATCGGTGGT
TGGCTAGCAG GCTTCGCGAC CATAATATTT GCCTACAACT TGATATTGAC TCTACTATAT
GGCGAGAAAG TGCCTAAGAC AGACATACCC ATGTGGGCAG TTCAGACGGT TGCCATGGAA
AGGTATGCGA TGAGAAGGGA AGGATATAAG GAGGAAGAAA TGCCAGTGGC TCTTCCCGCT
GACGGTATGA TAAGGCTAGA GGAGAACTCG GGAATTTCCA ATGGTACCCC TGTTGGAGCA
AAGGCGCTAG ATAACAACTC GCTAGATGTT AGTAAGGGAA GTCCTAGTAA GGGACTGACG
GATCCAGGAA AAAGTTAA
 
Protein sequence
MASKEEVRRQ DREKLLEWAR QYQEVQEREA RRNIFMKVLY SEDYKTMAIK LIFAGLVFLF 
IGGAFALFIR GQAGLSSEGV PVVLDPSYYF QAMTNHVMDM IFGAVFNVVF AVSFYMIPAM
NGSRLIKWPK LANAGFWISI IALFMMNFGG VENQYLFTFL NPLKASPTWY IGYGMMVVGE
WMEMASVLGT SFQGRVPGRL VPTAIGFIVM DMIMMALANI SVFIADMWSL FSPIGGLNIY
LFGIPNAEVW KGLFWFADHP LVYFAPYALT GAIVAITPLY ARRPMYSVRF TRWLIPVLFV
LGSSVYVHHI VDDPWPLILR DIFAQTSTAL IAVPFAALWL LFFITLGDPR KLKWDPGFAF
IFAAAVWNII GGIQAEPTNP TPSLDPTIHN TGWIFGHFHI MLAIYSVGGL LGALYVVGPD
LFGKNWYSTK LGWLHFWGWQ AGMGLFAIAS SEGGFFGLIR REVAWAGFYE VYYQLLLIGG
WLAGFATIIF AYNLILTLLY GEKVPKTDIP MWAVQTVAME RYAMRREGYK EEEMPVALPA
DGMIRLEENS GISNGTPVGA KALDNNSLDV SKGSPSKGLT DPGKS