Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0844 |
Symbol | |
ID | 5105204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 774495 |
End bp | 775760 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640506749 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001190942 |
Protein GI | 146303626 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.757233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTATGC TCCAATATCT CCTAGATGTT ATGGACCGCT ATCCCGACCT ACCTAGGGAG GTGGTCCTGA AGGAAAGCAT CCTCCTTCAC GGGATAAGTT TTTCCGACGA GGCCCTAAAG GCCAGTTACC AGGAGAAAGC CTATTTCCTA TTCACTTTCG ACCCAGATGA CCCTGAGAGA GTTAAGGCCA GGCGAAAGGT TATACCGCAG GAGATCCGTG TCTCGGGAGG ACCTCTCGGT TTAAGGCCAA CGGTGATCCA GTCCAGGCAC TCCTCAAGTT CGCCCTTTAA GGTCGAGCTT CAGGATAGTC TCAAGCTCTT TGTGGGAAAG GAGGCCATAG CCAACGTTGA GTTCGTTCCA AGGCCTTCCT ATTACGGGAA ATCCCTGAGC AATGGATCTA GGGTGGAGGA GGTCGCCCCA GCCATCTACT GGGGAGAGAC AGCCGACGTG ACGGCCTATC GTATCTGTGA GTACTGGAAC GTGAATCAGC AGTGCAAGTT CTGCGATATC AACGAGAACT TCAAGGCGTG GGGTTACATT AGAAAGGGGA TTGGCGCCGT GGTTCCCAAG GAACTCGTGG CGGAGGCCAT TGAGCTTGCG TCCAAGGATT CCAACGTGAA GAGGTACCTG ATCACCGGCG GAACTATCAG GGATGGGGTA AAGGAGGCCG AGTTTTACCT TCAGTACATG AGGCAGGTTG AGGAGAGGGT CTCCTCGCTC CCTGCGAGGT TGAATACACA GGCCTTGCGG GTCGAGGTCT TGCCCAAGTT TTACGAGGCT GGAGTTGACT ATTACCATCC AAACCTGGAG GTCTGGGACG AGAGGTTGTT CTCCCTAATT TCCCCAGGTA AAAGTCAGAA TGTGGGAAGG GATGAGTGGA TAAGGAGGAC AGTTGAGGCA GTGAGGGTGT TCGGCGTTGG AAACGTGAGT CCAAACTTCG TCGCTGGGAT AGAGATGGCC TACTCCCCAG ACTTCCCCTA TGGTTTCAGG AGGATTGAGG ACGCGGTCAA GTCCACTGCG GAGGGAATAG AGTACCTGAT GTCGAGGGAC GTGACCCCGA AGTTTGACAC GTGGGGACTT GAACCCAGAT CGTGGTTTGG GATTCACGGA GTGTCACTGC CTCCGCTTGA GTATTACCTC GAACTTTACA GGGTCTACAG GGATTTAAGG AGGGAGTATG GGATGCCATG GCCCAGGGGA CTGGGTGATC CTGGACCTGG AGTATCCAAG GTACCGGCCT CAGGGTTCAT GGACCTAGAG GGGTGA
|
Protein sequence | MLMLQYLLDV MDRYPDLPRE VVLKESILLH GISFSDEALK ASYQEKAYFL FTFDPDDPER VKARRKVIPQ EIRVSGGPLG LRPTVIQSRH SSSSPFKVEL QDSLKLFVGK EAIANVEFVP RPSYYGKSLS NGSRVEEVAP AIYWGETADV TAYRICEYWN VNQQCKFCDI NENFKAWGYI RKGIGAVVPK ELVAEAIELA SKDSNVKRYL ITGGTIRDGV KEAEFYLQYM RQVEERVSSL PARLNTQALR VEVLPKFYEA GVDYYHPNLE VWDERLFSLI SPGKSQNVGR DEWIRRTVEA VRVFGVGNVS PNFVAGIEMA YSPDFPYGFR RIEDAVKSTA EGIEYLMSRD VTPKFDTWGL EPRSWFGIHG VSLPPLEYYL ELYRVYRDLR REYGMPWPRG LGDPGPGVSK VPASGFMDLE G
|
| |