Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0290 |
Symbol | |
ID | 5104926 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 245965 |
End bp | 247617 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640506196 |
Product | cytochrome-c oxidase |
Protein accession | YP_001190391 |
Protein GI | 146303075 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0048384 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.666159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTTC AAGTCAAAGA TATTTTAAAA ATGGTTTCCA ATTACCTTAG TATTTTTTCT GCCTCTAATA TAAAGAAAGT GCTGTTCCCA TCGACTACCT CTGGGGTCAT ATGGCAATAC TTTGCTGGTT CTCTGGCATG GCTAGCGGTA GTAGGTATGG CAGCTATGAA CCTCAGGACC TACCTAACCT ACCCTTCAAA CAGTCCTGAG GTTGGAGTCA CTTACTATGC CTTCTTAACC CTTCATGGCT GGTCAGCCAT GCTGGGGCTA GTGCCCTTCG CTGCGATATC CGTGATAGCT TACTCCATGT ACAAGGACGG AATGAGCATC AGGAGAACAA AATTGATGAG TGGGATGTTT TGGCTAGCCA ACGCAGGCCT CATCTTCGCT CTCCTCGGAG GACCAGACAT GGGGTGGTAC ATGTACCCGC CCCTGGCTGT AGAGGACAAC TCCAATTTTC ATGCTTTTCT TAACTATCAC GGAGCATTAA TGGGTATAGC CTATCTCGCC TTGGCCTTAA GCTCATTGGC TCAGACCATT GCCACTGTTA ACTTGGTAAG CGATGCCTAT GCGACCAAAC CCAAGGGACA GAAACTGGGG ATATTCTCAG CCTACGGTGT AGCCTTTGCG GTCATTATTG CGTTAACTTT ACCAGCGTTA ACAGCGGGTG AGTTGTGGTA TACCCTTAAC ATTCTAGCAG GGGTCCCAAT CAACACCTTA CTATGGCTAG TGCTTTTCTG GTTCTACGGT CACCCCGTAG TGTATTATGT GCCTTTCCCT CTGTTCGGTG CACTTTACTA CTTTGTTCCC AAGTTTTCAG GGAGGCCCCT ATTCAGCGAA AAGTGGGCAA GGTGGAACAT TTACCTCTTG GCCATTGGTT CCATGTTGAT ATGGGTTCAT CACCTCCAGA CCTTCCCAAT TCCGGTCCCG GTGAGGCTAT GGATAAACCT GTCGACCTTG GTGTTGGCTT CTGGTTCAGG TTTAACGGTG CTTAACCTTG GTTTGACGGT TCTAACGAGC AAGGGGTATA ACTACAAGGA TCCAGTGGGA ATGGCGACCT TAATGGCGCT CATAGGGTTT ATTCTAGGCG GAGTTCAGGC AGTTCCTCTT CCCATGTTCC CCATCAATCC CATAGTCCAC AATACTTACT ACGTCGTAGG TCATTTCCAC CTCGTTATCT GGACCCTGAT ACTCATGGGA TTCACTGCAG TTTTCCTAGA CGTTCTCAGA ACTGTGAGGC CAGGATTTGA CTACAGCAAG TCGGCAACAA GGCTAATAAA CGCGGGCATA CTTGTGTGGA CCATTCCCTT CGTGATAGTT GGTTACCTGA TGAGCATGGA AGGCTACATG GGGATGCTAA GAAGGGTTAT TGCTTATCCC ACGACCTTCT ACCCATACAA TCTTTCAATT TCACTTCTAG CTGAGATAGG GATTGCGGGT ATAGTTATGG CAGTAGGGTC GGCATTGGTG GAGTTCCTAA CTTACTCTCC CTCCACCACA GTAAGCGTTT CATCTGGATC TGGATCATCT ACTCCTTCGA TCTCCTTAGC CACAGATCAA AATGACAAAA AGGGAGAATT TTTTGATAAT CTTAAACTTA AGCTTAATAA CAGTGTTTAT GGAAAATCTT CCGAATTAAG AAGGGTGAGA TAA
|
Protein sequence | MSFQVKDILK MVSNYLSIFS ASNIKKVLFP STTSGVIWQY FAGSLAWLAV VGMAAMNLRT YLTYPSNSPE VGVTYYAFLT LHGWSAMLGL VPFAAISVIA YSMYKDGMSI RRTKLMSGMF WLANAGLIFA LLGGPDMGWY MYPPLAVEDN SNFHAFLNYH GALMGIAYLA LALSSLAQTI ATVNLVSDAY ATKPKGQKLG IFSAYGVAFA VIIALTLPAL TAGELWYTLN ILAGVPINTL LWLVLFWFYG HPVVYYVPFP LFGALYYFVP KFSGRPLFSE KWARWNIYLL AIGSMLIWVH HLQTFPIPVP VRLWINLSTL VLASGSGLTV LNLGLTVLTS KGYNYKDPVG MATLMALIGF ILGGVQAVPL PMFPINPIVH NTYYVVGHFH LVIWTLILMG FTAVFLDVLR TVRPGFDYSK SATRLINAGI LVWTIPFVIV GYLMSMEGYM GMLRRVIAYP TTFYPYNLSI SLLAEIGIAG IVMAVGSALV EFLTYSPSTT VSVSSGSGSS TPSISLATDQ NDKKGEFFDN LKLKLNNSVY GKSSELRRVR
|
| |