Gene Msed_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0844 
Symbol 
ID5105204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp774495 
End bp775760 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content54% 
IMG OID640506749 
Productradical SAM domain-containing protein 
Protein accessionYP_001190942 
Protein GI146303626 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.757233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTATGC TCCAATATCT CCTAGATGTT ATGGACCGCT ATCCCGACCT ACCTAGGGAG 
GTGGTCCTGA AGGAAAGCAT CCTCCTTCAC GGGATAAGTT TTTCCGACGA GGCCCTAAAG
GCCAGTTACC AGGAGAAAGC CTATTTCCTA TTCACTTTCG ACCCAGATGA CCCTGAGAGA
GTTAAGGCCA GGCGAAAGGT TATACCGCAG GAGATCCGTG TCTCGGGAGG ACCTCTCGGT
TTAAGGCCAA CGGTGATCCA GTCCAGGCAC TCCTCAAGTT CGCCCTTTAA GGTCGAGCTT
CAGGATAGTC TCAAGCTCTT TGTGGGAAAG GAGGCCATAG CCAACGTTGA GTTCGTTCCA
AGGCCTTCCT ATTACGGGAA ATCCCTGAGC AATGGATCTA GGGTGGAGGA GGTCGCCCCA
GCCATCTACT GGGGAGAGAC AGCCGACGTG ACGGCCTATC GTATCTGTGA GTACTGGAAC
GTGAATCAGC AGTGCAAGTT CTGCGATATC AACGAGAACT TCAAGGCGTG GGGTTACATT
AGAAAGGGGA TTGGCGCCGT GGTTCCCAAG GAACTCGTGG CGGAGGCCAT TGAGCTTGCG
TCCAAGGATT CCAACGTGAA GAGGTACCTG ATCACCGGCG GAACTATCAG GGATGGGGTA
AAGGAGGCCG AGTTTTACCT TCAGTACATG AGGCAGGTTG AGGAGAGGGT CTCCTCGCTC
CCTGCGAGGT TGAATACACA GGCCTTGCGG GTCGAGGTCT TGCCCAAGTT TTACGAGGCT
GGAGTTGACT ATTACCATCC AAACCTGGAG GTCTGGGACG AGAGGTTGTT CTCCCTAATT
TCCCCAGGTA AAAGTCAGAA TGTGGGAAGG GATGAGTGGA TAAGGAGGAC AGTTGAGGCA
GTGAGGGTGT TCGGCGTTGG AAACGTGAGT CCAAACTTCG TCGCTGGGAT AGAGATGGCC
TACTCCCCAG ACTTCCCCTA TGGTTTCAGG AGGATTGAGG ACGCGGTCAA GTCCACTGCG
GAGGGAATAG AGTACCTGAT GTCGAGGGAC GTGACCCCGA AGTTTGACAC GTGGGGACTT
GAACCCAGAT CGTGGTTTGG GATTCACGGA GTGTCACTGC CTCCGCTTGA GTATTACCTC
GAACTTTACA GGGTCTACAG GGATTTAAGG AGGGAGTATG GGATGCCATG GCCCAGGGGA
CTGGGTGATC CTGGACCTGG AGTATCCAAG GTACCGGCCT CAGGGTTCAT GGACCTAGAG
GGGTGA
 
Protein sequence
MLMLQYLLDV MDRYPDLPRE VVLKESILLH GISFSDEALK ASYQEKAYFL FTFDPDDPER 
VKARRKVIPQ EIRVSGGPLG LRPTVIQSRH SSSSPFKVEL QDSLKLFVGK EAIANVEFVP
RPSYYGKSLS NGSRVEEVAP AIYWGETADV TAYRICEYWN VNQQCKFCDI NENFKAWGYI
RKGIGAVVPK ELVAEAIELA SKDSNVKRYL ITGGTIRDGV KEAEFYLQYM RQVEERVSSL
PARLNTQALR VEVLPKFYEA GVDYYHPNLE VWDERLFSLI SPGKSQNVGR DEWIRRTVEA
VRVFGVGNVS PNFVAGIEMA YSPDFPYGFR RIEDAVKSTA EGIEYLMSRD VTPKFDTWGL
EPRSWFGIHG VSLPPLEYYL ELYRVYRDLR REYGMPWPRG LGDPGPGVSK VPASGFMDLE
G