Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2032 |
Symbol | |
ID | 5105254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1958264 |
End bp | 1960021 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507920 |
Product | cytochrome c oxidase, subunit I |
Protein accession | YP_001192096 |
Protein GI | 146304780 |
COG category | [C] Energy production and conversion |
COG ID | [COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.329827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACTAA AGGAGTTTGT TGTTTCGTTG TTCCAATTGG ACAAGGACTG GACTACTAGG ATTGTCATGG CTATGCTAGT CATGGGAGTG ATTTGGGGCC TATTGGGAGT GATAGACTCC CTTATGGTGA GGATACAAGA GACCGCGTGG GGGCTGAGTG GAACCCTAGT GTTCACTCCG CAGGAGTACT TCGCCAGCAT CACGTTGCAT GCTGAGAGGG ACCTCTTCGG ATTCGCTCAG CAGGTGATAT ATGCCATTTT CATTTATTTC ACAATTAAGC TTCTTAACCT GCAACCTAGG GCTAAGTGGT TGCTCAACAT CTCCTTCATC CTGATTAACA TTTCAATGAT GTTTATGGAA GGGCCGATTG TCGTTGCTCC CACTTTTAAT GACAACTACT TCAGTGCTAC CGACTGGTAC TACATATCTC CCATGGGAAT TCCCAACTAC TCCAATTACG TGGTATCTCC GCTATTCTTT TACGGTTGGC TTCTACTGGA CGCCTTCACC TATCTAGCGG GAATATGGAT CGTGTACCAT TATTATATAG CGTCCAAGCA ACTTAAGGAG AAGCTACCTG TTCCTCTGGT ATTTTTCCTC ATGAATACCT TGCTTTTCAT GATAGGCTAC TCTGGGGTAA CCGCAGCTGA TGTATGGGAC ATACTAGCGT TCTATAACCT GGTGCCCCTC AACGCAATTG CGAACCAAAT AGCCTTCTGG ATCTTTGGGC ATGCTATCGT GTATATGGCG TGGATGCCTG CAGTTGGAGC ACTTTATCTT CTTATTCCAA CACTAGCTAA CAAACCGCTT TACAGCGATA GAATGGGTAG AATTTCTGCT CTGCTCTACT TAATATTTTC CAACAATGTA CCCATTCATC ACCTATACAT GGTTAACCTT CCAGTCTCTA TCAAGATACT CCAGGAAGTT CTTACGTATG CAGTTGTGGT CCCCTCAATG TTGACCTTCT TCAATCTATG GGCTACGGTA AAGGGCGCAC AGGTTAAGTT CAACGTGATA ACTGCTTTCA CTGTGACATC CTTCTCTGGG GCCATAGCTG CAGGAGTTAC CGGTATATCT AACGCAACCA TAGCCTTCGA CGCAATAATT CACAACACCG ATTGGGTAGT GTCCCACTTC CATGCAATGA TACTGCTGTC CATAGTTCCA GCAGCAATGG CAGTGCTTTA CTTCATGATA CCCATGATGA CTGGGAAGCA GTGGTTCTCA TCCAAGATGG CGTGGGTTCA TTGGATTGGT TACGTGTTTG GATCCATTCT CTTCATAGTG GGCTATGAAT TGCAGGGATT CGAGGGCTTG GTGAGAAGGG CTGAGATCTA TCCTAGAGTA CCGACTCTAA TTACGGCAGA GGTTATCTCC ACTGTGGGTG CAGTAATAGC AGAGCTTGCT ACCCTAGTTT GGTTCCTGAA CCTGGTCCTT ACCCTAGTTA AGGGTAGAAA TATGAACCTA GAGGGAGTAG GGCTTGGCCA GCTCATTGGC ACCGTGGGAG CTGCACTAGA GTGGAATGGA GAAAACATTA ATATTCCAAG TTTATTCAGT AAAAACATGA TAAAGAAAGG ATTAAGTGGC CTGTGGACTC TGGGAATATT GGGTGCACTT GTGATAGTGA TTAGCATGTT CCCACTGGCC TTCTCTGGTA ACACTTACAA CGCAATGCCA TGGATCTGGA TAGTCCTGCT GTCCATAGGG ATAGTGCTGA TCTCGTATCC CGTATTAAAG GGGGCTAAAA GTTTATGA
|
Protein sequence | MGLKEFVVSL FQLDKDWTTR IVMAMLVMGV IWGLLGVIDS LMVRIQETAW GLSGTLVFTP QEYFASITLH AERDLFGFAQ QVIYAIFIYF TIKLLNLQPR AKWLLNISFI LINISMMFME GPIVVAPTFN DNYFSATDWY YISPMGIPNY SNYVVSPLFF YGWLLLDAFT YLAGIWIVYH YYIASKQLKE KLPVPLVFFL MNTLLFMIGY SGVTAADVWD ILAFYNLVPL NAIANQIAFW IFGHAIVYMA WMPAVGALYL LIPTLANKPL YSDRMGRISA LLYLIFSNNV PIHHLYMVNL PVSIKILQEV LTYAVVVPSM LTFFNLWATV KGAQVKFNVI TAFTVTSFSG AIAAGVTGIS NATIAFDAII HNTDWVVSHF HAMILLSIVP AAMAVLYFMI PMMTGKQWFS SKMAWVHWIG YVFGSILFIV GYELQGFEGL VRRAEIYPRV PTLITAEVIS TVGAVIAELA TLVWFLNLVL TLVKGRNMNL EGVGLGQLIG TVGAALEWNG ENINIPSLFS KNMIKKGLSG LWTLGILGAL VIVISMFPLA FSGNTYNAMP WIWIVLLSIG IVLISYPVLK GAKSL
|
| |