Gene Msed_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1206 
Symbol 
ID5104502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1179103 
End bp1180740 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content48% 
IMG OID640507098 
Productblue (type1) copper domain-containing protein 
Protein accessionYP_001191291 
Protein GI146303975 
COG category[C] Energy production and conversion 
COG ID[COG3794] Plastocyanin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGAA TTTTCAAGAG AATTCCAAAG GCCTCACTAC TCTTTTTCCT AGTAGCAGGG 
CTCTTTGGGC TTCCAATGGC TATCAGCGCA ACTACACCAA CCCCACAACA TTGGATAGTT
TACGTAGGCG GACAGGCAAT GAGTGGGAAC ACCATGATTA TGACCATGGG CTATTTTCCT
GAGATAATCA CCATTGACGT GGGAGATAGT ATCACCTTCG TTATCAACTC AACGGAGCCT
CACACCATCA CATTCCTCAG CGGAAATCCG CCCTTGAATC CCTTCTCTCC ACAGGCCCTA
GCGCCCATAG GGGGATCGGT CTATAACGGC ACGGGGATAG TATCCTCAGG CCTTCTATCT
CAAGGTCAAA ACTACACCTT AACTTTCACT AAGGCTGGAG TTTACGTATA TCAATGTCTG
ATTCATCCAG GTATGATGGG TGTGGTCATC GTTAATCCTG CGGGTACCCC ATACCCAATG
ACTCAGGCAC AGTACGATCA GTTAGCATCA CAACAAAGCT CCCAATCCCT GGCAAGCGGT
TTATCTCTAC TGCAACAGGT TAATCTACCC GCAACCCAAG GTCCTAACGG AACGGTCATC
TGGCATGTGG ACGTCGGTCA GCAGACTCCA GTTAGCACCG AAGTCACCCT TAACTCGATG
AACTCTAAGG TTAGCGGTTC TGCCATCTTG ACAATGACCG CACCAGGAGT GCTCACCGTT
CAGGTTAGTC TTACAGGTTT ACTTCCAGGA GAGACCTACA ATGTTGGTAT CTTCCAAGGA
GCAGCCGAGG CTGGAGGAAA ATCATTATAT AATCTTAACC CAGTGGTTGA AGCCTCAAAC
GGGACTGGAA GCTCTGTCAC AACTCTCACC CTTCCGCCAC TTAGCCCATT TATACCAACG
AGCTTTGGAA TACCCTCTGC CGGATGGTAT ATTAACGTCA GCAACTCTGG TAACGCCGTT
GCTGCCGGAG ATATAATTTT CCCAGTCTCT AGCGTAATGG GATTCCTTCC GAATACCCTA
ACGATACACG CAGGAGATAC TGTGGTGTGG ACAGACGTTG ATCCGGACGA AGTACATACG
GTTACATTTG TTCCACAGGG GATGCCAATT CCTGAGTTTG GAACGCCCAC AAGCCTCATA
CCTACAAAGA GCCATATATT CAATGGAACA GGTTACTATA ACTCCGGCCC CATGATAGCG
GGAGTAAGCT ACAACCTGAC CTTCGTTACT CCAGGAGTTT ATACCTATGT TTGTTTGCTA
CATGACGGCA TGGGTATGGT AGGGACAATA ATCGTGTTGC CTTCCACACC GTCATCGAAT
CCCCAAGCGA CACTTCTTAG CAAGCAGATG TCTGAACTTA ACAATACCCT AAACTCACTA
AACTCACAGG TTAGTCAAAT AGGCTCTCTA ACCTCTCAGG TGGGTCAACT CAACAGTCAA
GTAAGCTCAC TAAACTCACA GGTTAGTCAA ATAGGCTCCC TCAGCTCCCA GATATCATCC
TTGAATGGCT CCCAGGCCTC CTATGAGAAC AGCGTAAACA GCAAGATCTC CTCACTTTAC
TCACTCCTCA CGGTCCTCAT AGTTCTTGTG GTAATCTCTC TAATTCTGAA CGTTGTCCTG
ATAGCTAGAA GAAGGTAA
 
Protein sequence
MRRIFKRIPK ASLLFFLVAG LFGLPMAISA TTPTPQHWIV YVGGQAMSGN TMIMTMGYFP 
EIITIDVGDS ITFVINSTEP HTITFLSGNP PLNPFSPQAL APIGGSVYNG TGIVSSGLLS
QGQNYTLTFT KAGVYVYQCL IHPGMMGVVI VNPAGTPYPM TQAQYDQLAS QQSSQSLASG
LSLLQQVNLP ATQGPNGTVI WHVDVGQQTP VSTEVTLNSM NSKVSGSAIL TMTAPGVLTV
QVSLTGLLPG ETYNVGIFQG AAEAGGKSLY NLNPVVEASN GTGSSVTTLT LPPLSPFIPT
SFGIPSAGWY INVSNSGNAV AAGDIIFPVS SVMGFLPNTL TIHAGDTVVW TDVDPDEVHT
VTFVPQGMPI PEFGTPTSLI PTKSHIFNGT GYYNSGPMIA GVSYNLTFVT PGVYTYVCLL
HDGMGMVGTI IVLPSTPSSN PQATLLSKQM SELNNTLNSL NSQVSQIGSL TSQVGQLNSQ
VSSLNSQVSQ IGSLSSQISS LNGSQASYEN SVNSKISSLY SLLTVLIVLV VISLILNVVL
IARRR