Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0511 |
Symbol | |
ID | 5103671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 467281 |
End bp | 468381 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506415 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001190610 |
Protein GI | 146303294 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00485755 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00020138 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGATAAGCA TATCTAGGCT GGTGTCTGAT AGGAAGGAGG AGGCTGACAG GATAAGATAC GCTGGCCTGA AGGACAGGTA TCCCTCAGTA CTAGTGTTCA ACGTAACTAG AAACTGTAAC CTGAGATGTC TCCACTGCTA TTCCGGATCA GGAACTCAAC TATTTCAGGA CCTTCCTCTC TCCACGTGGA TTAATGCGGT GAAGCAGGCA TCAGATATGG GAGTAAAGCA TATCCTTCTC TCAGGAGGGG AACCCTTGGC GAGGAGAGAC CTTCACCTGA TAGCTAGGGA GGCATGGGAG AGGGGAATAA GGGTGGAGCT GTCCACCAAC GGGACCATGT TAACTAGGGA GAGGTTGGAG GAACTCAAGA ATTACGTGGA CTACGTGGGA GTCAGTTTGG ACGGACCAGA GCCCATACAC GATAAATTCA GGGGGGTGGA AGGTGCCTTC GCGAAGGCCT TGAAGGGAAT TAGGACGGCA AAGGAGATAG GTCTAAAGAC GGGACTTAGA TTCACGATCA CGAGGGAGAA TTACGAGTAC GTGGACTTCG TGTTTGACTT GATGAGGAAG GAGGGGATTA ACAGGGTATG CTTTTATCAC CTAGCCTATG CTGGAAGGGC AGACAAGAAA CTAGACGTGG ATAATTTCAC TAGATTGAAG GTAGTGAGTA AGATAGTGGA ATATGCCAAG TCAGGGGAGT GGGAGGTCCT GACAGCGGAT AACCCGGTAG ACGGAGTCCT GGTGTATCAC TTGACGGGAA AGGAGAAGGT CTTAGAGCTA CTCAGGAGAA ATGGAGGAAA CAAGTCAGGT GAGAGGATAG CTGACGTTAA CCCAGAGGGT ACGATTTACC CAGATCAGTT CACTCCAGTG AAGATAGGTG ACATCACAGA CCTGAAAAGA ATATGGGACG AACCACATCC CATGGTGAAG AAACTTAGGG AAAGAAAGTC CTTGGTTAAG TGTTCTTCCT GCAAGTTCTT CGACGTGTGT AACGGCGGGC TAAGGGGAAG AGCTCTGGCG GTCACAGGCG ATATGTGGGA GAAGGATCCG TCGTGCTATC TAGACGAAAT AGAGAAAATA AAGGAGAAAA TAACTTTTTA G
|
Protein sequence | MISISRLVSD RKEEADRIRY AGLKDRYPSV LVFNVTRNCN LRCLHCYSGS GTQLFQDLPL STWINAVKQA SDMGVKHILL SGGEPLARRD LHLIAREAWE RGIRVELSTN GTMLTRERLE ELKNYVDYVG VSLDGPEPIH DKFRGVEGAF AKALKGIRTA KEIGLKTGLR FTITRENYEY VDFVFDLMRK EGINRVCFYH LAYAGRADKK LDVDNFTRLK VVSKIVEYAK SGEWEVLTAD NPVDGVLVYH LTGKEKVLEL LRRNGGNKSG ERIADVNPEG TIYPDQFTPV KIGDITDLKR IWDEPHPMVK KLRERKSLVK CSSCKFFDVC NGGLRGRALA VTGDMWEKDP SCYLDEIEKI KEKITF
|
| |