Gene Msed_0333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0333 
Symbol 
ID5105491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp290429 
End bp291634 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content51% 
IMG OID640506239 
ProductFAD dependent oxidoreductase 
Protein accessionYP_001190434 
Protein GI146303118 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.882916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTG ATGCAGACGT TATAGTTGTG GGCGGAGGCC TCGCTGGCCT ATCAGCTGGG 
ATAACCGCGA ACAGGGAAGG TCTGTCCACC CTAATCCTGG AAAGAGGTGA GTACTCTGGA
GCAAAGAACG TTAGCGGGGG GAGGATGTAT GTTCACGCAC TTAAGAGCCT AGTTAACCTT
GAGGAGGCTC CCCTTGAGAA GCCCATCGTT AGGGAGACGT ACGAAATTAC CTGCGGGGAG
AAGAGACTCA CCTTCTCTTT TTACGATCCG AACACAAGGA ATAGCTACTC GGTGTTGAGA
GCTAAGTTTG ATCCATGGCT CGCAAAGAAG GCAGAGGAGG AGGGGGTTTT AGTTATGTAT
GAGACCTTGG TTCATGATGC AGTGAGGGAG AATCAGGGTA TCACCGTGAG AACCAGCAGG
GGAGATCTTA GGGCTAAGTT AGTGATTGAG GCAGATGGAG TGACCGCAGG CGTGTCTAGG
TACCTAGGAC TGAGGAGTCT CTCCCCCGAC TCACTCATGC TGGGAGTAAA GGAAGTGATA
AAGCCTGACA GCGTTCCTGA GGAGGGTGAG GCGAGGGTAC TAGTGGGGTA CCTGAACGGC
CTACTTGGAG GTGGTTTCAT GTACGTGAAC AAGGACACTT TGTCGATAGG TGCCACAGTC
AAGGTCAATT CCCTGCAGAA AGAGAGAGTC TTGGCCAGGG ATATAGTGGA GGATCTAAGA
ACCAAACTGG GAGTTGAGGG TGAAATACTT GAGTATTCGG CTCACCTTAT CCCCTACTAT
GGTTATACTA AGCTTCCCCC TCTTACCGCA CCTAACCTGC TCGTTACAGG TGACGCGGCG
GGTTTCCTGA TTAATGACGG ATTTGTCATA AGGGGAATGG ATCTGGCCAT TGGGTCAGGG
ATTGTGGCCG GAAGGGCTGC CAAAAAGATA CTGGATCTAG GGGATCCCTC TAACACGAAA
ATTTATGAGG ATATGCTCCA GGAATCCTTC GTGATGAAGG ACCTTAAGAC AGCCAGCAGG
GCTTTTCAGC TCATGAATAA CGAGAGGTTG TTCAAGGTTT ACCCAGAACT GTTCTGCAGG
GTCCTTTCAA GAATGTTCAC AGTGGAGGGA GAGAGGAAGA CTCTCATGAC CGTCTTCCAA
GAGGAAGTGA AGAGAGATGG TCTCACCCTG ACGCAGGTAG TGAGGGACTT GACGAAGGTG
ATGTGA
 
Protein sequence
MSFDADVIVV GGGLAGLSAG ITANREGLST LILERGEYSG AKNVSGGRMY VHALKSLVNL 
EEAPLEKPIV RETYEITCGE KRLTFSFYDP NTRNSYSVLR AKFDPWLAKK AEEEGVLVMY
ETLVHDAVRE NQGITVRTSR GDLRAKLVIE ADGVTAGVSR YLGLRSLSPD SLMLGVKEVI
KPDSVPEEGE ARVLVGYLNG LLGGGFMYVN KDTLSIGATV KVNSLQKERV LARDIVEDLR
TKLGVEGEIL EYSAHLIPYY GYTKLPPLTA PNLLVTGDAA GFLINDGFVI RGMDLAIGSG
IVAGRAAKKI LDLGDPSNTK IYEDMLQESF VMKDLKTASR AFQLMNNERL FKVYPELFCR
VLSRMFTVEG ERKTLMTVFQ EEVKRDGLTL TQVVRDLTKV M