Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1559 |
Symbol | |
ID | 5104004 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1515982 |
End bp | 1516881 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507445 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001191638 |
Protein GI | 146304322 |
COG category | [S] Function unknown |
COG ID | [COG1856] Uncharacterized homolog of biotin synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0415302 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACCCA TATTCTTCTA TGCCCCATCA CTTAAGAGGT ACGAGACTGA TTACCTTAGC TCGGAGGAAG GATGGAAACC CATCTCCGTG ACAGGCACAT CCTGCGCCTT TAGTTGCAAA CATTGCGAAA CTAGGGTCCT TGAGGGAATG GAGGACGGAT CCACGAGAGA GAAGTTCGAG AAAATACTTG AAAGCGTCAG TAAGGCAGGC CACAAGGGCG TAATCCTGTC TGGAGGATCT TCACCAAGGG GGGACGTCCC CGTGTGGAAG TATAGCGACG TCCTCAAGAA GTACCCAAAC GTCACTGTGA TAGCTCACAC TGGAGTTGTG AAATCCAGAG AAATAGCTAA GAGGTTCAGG GATAGCGGGG TAAAGATTGC CTTACTGGAT ATGGTCGGGG ACCAAGAGAC CATAAACGAA GTTTTAGGTC AACCCTTCAC GATTGACGAC TATCTTAACT CCTTTAAGTA CCTTAAGGAG GTAGGAATAA AGATTGCGCC TCACGTAATA GTTGGACTTA GCAAGAGGGG AGTAGAAGGA GATCTTCACG CACTGGAACT CCTGCAGGAA GTTAATCCGG ATGCCGTGAT TGTTGTGGGC CTGATGCCCC TGGTAGGAAC CCCCTACAGG GGAGTCAAGG AACCCTCCCC GGAGGAGTTA GGAAAGGTTC TCATGAGGGC TCGCGAGCTC TTCTCCTCCC CCGTTATGTT AGGCTGTGCC AGACCTAGGG GGAAAGCCTA CCTTGATGTG GAGAAAATGG CAGTAGATCA GGGAATAGAC GGTATGGCTT TCCCCTCGCA AGAGACCATA GAATACGCCA TGGGGAGGAG GGAGGTGGTT CTTAGCCACG CATGTTGTGG CAATGTGATA CATGACTTCC TATTACGGGT GTCCCTATGA
|
Protein sequence | MKPIFFYAPS LKRYETDYLS SEEGWKPISV TGTSCAFSCK HCETRVLEGM EDGSTREKFE KILESVSKAG HKGVILSGGS SPRGDVPVWK YSDVLKKYPN VTVIAHTGVV KSREIAKRFR DSGVKIALLD MVGDQETINE VLGQPFTIDD YLNSFKYLKE VGIKIAPHVI VGLSKRGVEG DLHALELLQE VNPDAVIVVG LMPLVGTPYR GVKEPSPEEL GKVLMRAREL FSSPVMLGCA RPRGKAYLDV EKMAVDQGID GMAFPSQETI EYAMGRREVV LSHACCGNVI HDFLLRVSL
|
| |