Gene Msed_0288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0288 
Symbol 
ID5104924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp244179 
End bp245264 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content46% 
IMG OID640506194 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001190389 
Protein GI146303073 
COG category[C] Energy production and conversion 
COG ID[COG0723] Rieske Fe-S protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.37556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.576557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATTTTT ATTTTGTTTT CATCAGAAAG ATCAAGAGAG ATAAATTTGA TAAGATCTAT 
TATAACAAAA TATTTAAACC TTTCACTCAA CAAACATTTA ATGTGATAGT TATGGGTAGG
CATTTCGCGC TGAAGAGGGA CGATTTCATT TTTGCTACAA GATTGATAAG AAAAATGCGA
GATCCCAAGA CGAAGTTCGA CGAGAAAAAG TTTGCTGAGA AAGGGAGAGA TTACCTATAT
AATTACGCCG AGGAAAAAGT AGGTCCATTA AGCCCTGGAA GGAGGATGTT CCTCAAGGGA
ATACTTATTG GGATAGGCGC GCTTGCGGTG GCTAGCGCAG TCCCCGTTAT CTCCTATCTT
AATCAGCCCC CTGTCTACAT CAAAAACTTT CCATGGATAA TTATAGTCGA TTCTGATGGC
AACCCCATCG AGGCGTCTAA TCTACAGGTC AACGATCCCT CCATCCTGTT GTTCCAGTAT
CCCATGGAGG GAGACATAAC CTTCCTCATA AACATGGGTG ACGCAAACGA CAACCCTGTG
GCGATTCCCT CAACTAATGT TGTGATTCCC GAGGATGGTA GCACCTATAC CTTCCCTGGA
GGGGTAGGAC CTCACAACTC CATCGTCGCG TTTAGCGCAA TATGTCAGCA CCTAGGTTGT
CAGCCCCCTG AGATTCATTT CTATCCGCCC AAGTACCTTG CTCCGGGAGG TGTAACTCCC
AACTATCTTC CACCCGTTGC GTACCAGGCA GCACAAAATG CAGGTGCACC CTCCGTGATA
CATTGTGACT GCCACGGCTC TACCTATGAT CCTTCCAAGG GAGCCGCAGT CCTGACGGGG
CCAACTCAGA GACCTCTACC CTATGTGGAG CTCTACTGGG ACCAGAATAC AGACTACCTT
TACGCTGTAG GAATGAACCT AAAGGCTCCA GTAATCATGG GGCAGCCCTC AGACCTAGCG
AGCTTCGCAT ATTTATCCTC GTATAATGAG CAAACTGGTT GTCCAAAGAT GCTCTTGAGC
AAGGGCCAGA CTCCATCTCA GTGCTATTCA AAGCTTAATA ACGAGGGAGA CACATTCTCC
TCCTAA
 
Protein sequence
MYFYFVFIRK IKRDKFDKIY YNKIFKPFTQ QTFNVIVMGR HFALKRDDFI FATRLIRKMR 
DPKTKFDEKK FAEKGRDYLY NYAEEKVGPL SPGRRMFLKG ILIGIGALAV ASAVPVISYL
NQPPVYIKNF PWIIIVDSDG NPIEASNLQV NDPSILLFQY PMEGDITFLI NMGDANDNPV
AIPSTNVVIP EDGSTYTFPG GVGPHNSIVA FSAICQHLGC QPPEIHFYPP KYLAPGGVTP
NYLPPVAYQA AQNAGAPSVI HCDCHGSTYD PSKGAAVLTG PTQRPLPYVE LYWDQNTDYL
YAVGMNLKAP VIMGQPSDLA SFAYLSSYNE QTGCPKMLLS KGQTPSQCYS KLNNEGDTFS
S