Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1206 |
Symbol | |
ID | 5104502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1179103 |
End bp | 1180740 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507098 |
Product | blue (type1) copper domain-containing protein |
Protein accession | YP_001191291 |
Protein GI | 146303975 |
COG category | [C] Energy production and conversion |
COG ID | [COG3794] Plastocyanin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGAA TTTTCAAGAG AATTCCAAAG GCCTCACTAC TCTTTTTCCT AGTAGCAGGG CTCTTTGGGC TTCCAATGGC TATCAGCGCA ACTACACCAA CCCCACAACA TTGGATAGTT TACGTAGGCG GACAGGCAAT GAGTGGGAAC ACCATGATTA TGACCATGGG CTATTTTCCT GAGATAATCA CCATTGACGT GGGAGATAGT ATCACCTTCG TTATCAACTC AACGGAGCCT CACACCATCA CATTCCTCAG CGGAAATCCG CCCTTGAATC CCTTCTCTCC ACAGGCCCTA GCGCCCATAG GGGGATCGGT CTATAACGGC ACGGGGATAG TATCCTCAGG CCTTCTATCT CAAGGTCAAA ACTACACCTT AACTTTCACT AAGGCTGGAG TTTACGTATA TCAATGTCTG ATTCATCCAG GTATGATGGG TGTGGTCATC GTTAATCCTG CGGGTACCCC ATACCCAATG ACTCAGGCAC AGTACGATCA GTTAGCATCA CAACAAAGCT CCCAATCCCT GGCAAGCGGT TTATCTCTAC TGCAACAGGT TAATCTACCC GCAACCCAAG GTCCTAACGG AACGGTCATC TGGCATGTGG ACGTCGGTCA GCAGACTCCA GTTAGCACCG AAGTCACCCT TAACTCGATG AACTCTAAGG TTAGCGGTTC TGCCATCTTG ACAATGACCG CACCAGGAGT GCTCACCGTT CAGGTTAGTC TTACAGGTTT ACTTCCAGGA GAGACCTACA ATGTTGGTAT CTTCCAAGGA GCAGCCGAGG CTGGAGGAAA ATCATTATAT AATCTTAACC CAGTGGTTGA AGCCTCAAAC GGGACTGGAA GCTCTGTCAC AACTCTCACC CTTCCGCCAC TTAGCCCATT TATACCAACG AGCTTTGGAA TACCCTCTGC CGGATGGTAT ATTAACGTCA GCAACTCTGG TAACGCCGTT GCTGCCGGAG ATATAATTTT CCCAGTCTCT AGCGTAATGG GATTCCTTCC GAATACCCTA ACGATACACG CAGGAGATAC TGTGGTGTGG ACAGACGTTG ATCCGGACGA AGTACATACG GTTACATTTG TTCCACAGGG GATGCCAATT CCTGAGTTTG GAACGCCCAC AAGCCTCATA CCTACAAAGA GCCATATATT CAATGGAACA GGTTACTATA ACTCCGGCCC CATGATAGCG GGAGTAAGCT ACAACCTGAC CTTCGTTACT CCAGGAGTTT ATACCTATGT TTGTTTGCTA CATGACGGCA TGGGTATGGT AGGGACAATA ATCGTGTTGC CTTCCACACC GTCATCGAAT CCCCAAGCGA CACTTCTTAG CAAGCAGATG TCTGAACTTA ACAATACCCT AAACTCACTA AACTCACAGG TTAGTCAAAT AGGCTCTCTA ACCTCTCAGG TGGGTCAACT CAACAGTCAA GTAAGCTCAC TAAACTCACA GGTTAGTCAA ATAGGCTCCC TCAGCTCCCA GATATCATCC TTGAATGGCT CCCAGGCCTC CTATGAGAAC AGCGTAAACA GCAAGATCTC CTCACTTTAC TCACTCCTCA CGGTCCTCAT AGTTCTTGTG GTAATCTCTC TAATTCTGAA CGTTGTCCTG ATAGCTAGAA GAAGGTAA
|
Protein sequence | MRRIFKRIPK ASLLFFLVAG LFGLPMAISA TTPTPQHWIV YVGGQAMSGN TMIMTMGYFP EIITIDVGDS ITFVINSTEP HTITFLSGNP PLNPFSPQAL APIGGSVYNG TGIVSSGLLS QGQNYTLTFT KAGVYVYQCL IHPGMMGVVI VNPAGTPYPM TQAQYDQLAS QQSSQSLASG LSLLQQVNLP ATQGPNGTVI WHVDVGQQTP VSTEVTLNSM NSKVSGSAIL TMTAPGVLTV QVSLTGLLPG ETYNVGIFQG AAEAGGKSLY NLNPVVEASN GTGSSVTTLT LPPLSPFIPT SFGIPSAGWY INVSNSGNAV AAGDIIFPVS SVMGFLPNTL TIHAGDTVVW TDVDPDEVHT VTFVPQGMPI PEFGTPTSLI PTKSHIFNGT GYYNSGPMIA GVSYNLTFVT PGVYTYVCLL HDGMGMVGTI IVLPSTPSSN PQATLLSKQM SELNNTLNSL NSQVSQIGSL TSQVGQLNSQ VSSLNSQVSQ IGSLSSQISS LNGSQASYEN SVNSKISSLY SLLTVLIVLV VISLILNVVL IARRR
|
| |