Gene Msed_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0020 
Symbol 
ID5105159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp18084 
End bp19829 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content44% 
IMG OID640505914 
Productradical SAM domain-containing protein 
Protein accessionYP_001190121 
Protein GI146302805 
COG category[R] General function prediction only 
COG ID[COG1964] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00170872 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCTCAAG TTGAGACAGA AAATAAGGGT TTTAGACTGC TACCAGCGCC ATCAAAATTC 
GAAAATGGGG AAGTAAAGTT TGGCGACAGG GACATCAAGA TTGGGGGACC TTTACCTAAG
CTTGCCCAGG ACGAGAAGCT TATCAGGGTT ACGCATTCTT TGTGTCCTGC TTGCTACAGG
CTATTACCAG CAACCATATT TGAGAAGGAC GAAAAGATGT ACATTAGGAA GATATGTCCT
GAGCATGGAG AGTTTGAGGA CCTTTATTAT GGAGATGTGG GGATGTACTA TAAGTTCGAT
TACTGGGAGT ATGAGGGTAA GGGTCCTAGA GTTCCTTACG TTGACCTTAA GTCTCCCTGT
CCTTATAACT GTGGACTATG TCCAATGCAT CATCAGCACT CGGCACTTGT GAACCTGGTT
ATAACTAACA GATGTGATTT GTCTTGTTGG TACTGCTTCT TCTATGCCGA GAAGGCTGGC
TACGTCTTTG AGCCAACACT GGAGCAGATA AAGTTCATGG TAGATCAGTT AAAGAGGCAG
GATACTACCA TAGTTATCCA GGTTACAGGA GGCGAACCTA CACTTAGGGA AGATATTGTT
GAGGTAATTA AGCTCCTCAG GGAGTCAGGT GTCAGACATA TTCAGCTCAA CAGTTGGGGG
GGAACTTTCG CCAAGATGTA CATGGCTGAT CCGGATAAGG CAGTGAGATA CGCCATAGCC
CTGAGGGAGG CAGGAGTCAA CACTGTTTAC ATGAGCTTTG ACGGGACTAC CAGAAAGACC
AATCCCAAGA ATCATTGGGA GGTTCCTTAC ACCTTAGAAG TGTTCAGAAG GGCAGGGATG
ACCAGCGTCG TTCTAGTACC CACAGTTATC AAGACTGTAA ATGACCATGA TCTAGGAAAT
ATAGTGAAAT TCGCTGCAAG GAATATGGAC GTTGTGAGAG CCATAAACTT CCAACCTGTC
AGCTTGACAG GAATGATGAA GAGGAATATG AGGGCCAAGT TTAGAATAAC CATTCCAGAA
GTCCTGAAGA ACATAGAGGA CCAGACGGAC GGCGAGGTCA CTAGGGACAG TTGGTATCCA
ATAGGCACTT CAGTCGTGTT CTCGAGATTA GTCGAGGCTT TAACAGGTAA GGAGCAATTC
GAAATGGCCA ATCACCCAAG TTGCGGTGCT GGCACCTATA TCTACATAGA GTGGAGAAAC
GGAGAACCAC ACTTCATACC CATTTCCAAG TTCATTGACC TAGAGGGTCT TCTAGAGTAC
TTCAAGGAGA AAGCAGAGGA ACTCAAGGAA GGAGCTAACA AGTACTGGAT TGGGATGAAG
TTACTCTATA ACGTCAGGAA GTTCATCGAC AAGGAGAAAG GACCCAAGGA CTTCGATGTA
TATAAGATGC TGTATAATGT GGTGGTAAGT CACAACTATG AAGCTCTAGG CGAATGGCAT
TACAGGACAT TGTTCCTGGG AACCATGCAC TTCATGGACC TCTACAATTA CGATATTAAC
AGGGTCATGA GATGCGATAT TCACTATGTT GTACCAGATG GGAGAGTAAT TCCATTCTGT
ACCTACAACG TTCTTAATGA TCTATACAGA GATAAAGTAC TCAGGGAGTA TCAGATACCT
CTAGATGATT GGATAAAGAA GCATGGGGAG AACAGTCTTG GAGATGCAAT GAAGTATAGA
AGGGTTGCAA GTACTCTAGA GAAGGGAGAA ATATACAAGG AGACGTACAA GTACTTTAAC
GAGTGA
 
Protein sequence
MAQVETENKG FRLLPAPSKF ENGEVKFGDR DIKIGGPLPK LAQDEKLIRV THSLCPACYR 
LLPATIFEKD EKMYIRKICP EHGEFEDLYY GDVGMYYKFD YWEYEGKGPR VPYVDLKSPC
PYNCGLCPMH HQHSALVNLV ITNRCDLSCW YCFFYAEKAG YVFEPTLEQI KFMVDQLKRQ
DTTIVIQVTG GEPTLREDIV EVIKLLRESG VRHIQLNSWG GTFAKMYMAD PDKAVRYAIA
LREAGVNTVY MSFDGTTRKT NPKNHWEVPY TLEVFRRAGM TSVVLVPTVI KTVNDHDLGN
IVKFAARNMD VVRAINFQPV SLTGMMKRNM RAKFRITIPE VLKNIEDQTD GEVTRDSWYP
IGTSVVFSRL VEALTGKEQF EMANHPSCGA GTYIYIEWRN GEPHFIPISK FIDLEGLLEY
FKEKAEELKE GANKYWIGMK LLYNVRKFID KEKGPKDFDV YKMLYNVVVS HNYEALGEWH
YRTLFLGTMH FMDLYNYDIN RVMRCDIHYV VPDGRVIPFC TYNVLNDLYR DKVLREYQIP
LDDWIKKHGE NSLGDAMKYR RVASTLEKGE IYKETYKYFN E