Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0515 |
Symbol | |
ID | 5103675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 471423 |
End bp | 472625 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640506419 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001190614 |
Protein GI | 146303298 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0000107878 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000774187 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCTAT CCAAGTTCAA TATTTTTATT GATAATATAA TTTTCAACAC ACTTACCGGT TATGCCATGG AACTAGAGCC GTGGGAGATA GAGAAGTTAA AGGGTGGAGA GGTACCTGAT CACCTTAAGG AAGTAGTGGA GGAGGGATTT TCAACTCCTG GCGATTTGGA GAGCGTGTTG GAGCCACTCC TCAACAAGCC TGTCCTCGAA CCTACCCTTC TCCTCACATA CAACTGCAAT TTCAATTGTA CCTACTGTTT TCAGAAGGGA TTCAGGAAAG ATCTCACGGT CACGGAAGAG GTGATGAAGG GTTTCATAAA CTACGTGAGG AAGAGGGAAA GAGGTAGAAA GGTAAGAGTC ACGTTCTTTG GGGGCGAGCC TCTCCTAGAG CTCAAGAAGA TCGAGGAGAT ATCTAGGTCG CTCTCTGATC TGAAGTACTC CTTTAGCGTT GTCACCAATG GTTCCCTCTT GACCAAAAGT GTAACCCAAA GGCTGATATC CCACGGACTT TCGCATGTCC AGATAACCCT GGATGGACCC CCGGAAGTTC ACGATAAGAG AAGGTTTTAT GTAGATGGTA GAGGTTCCTT CAACACGATA ATACAAAACC TGAGAGAGGT TCAGGATCTA GTGAAGGTAG TTTTGAGAAT AAACATAGAC GTGAATAACC TTAACGAGGT ATACACTCTT CTGGCCAAAT TGGTGGAGGA GGGGATAACT AGGATCAGAT TGGATCCTCA CTTCGTACAT ACCAACCTAT TTAGGAACGA ATGGTGGGAA AACGTGATTC CGAAGGACCT GGAATCAGAC GTCCTAGTCA AGTTCTGGGA AAAGGCCAGG GGTTACGGAT TTGAGATTCC CCATGACATC TTTAGACTTG GGATCTGTGC AGCACATATA GACGAAGACA TCGTGGTAGA TCCTGAGGGA AAGGTCTATC CATGTTGGGC TTTCACAGGG AATCCCCTAT ACGTGAAGGG AAGGCTCACG CAGGAAGGTG AGGTGGAGCT ACTGAATCGG TCCCTATCCG GAAGGAAATC CCTCATAATC CACGAGAAAT GTAAGTCATG CCCCTATCTT CCCATGTGTA TGGGAGGGTG TAGGTTCCTC TCAGTCCTTG ACGGAAAAGG ATACCACGGT CTAGATTGCA GGAAGGAAAC TTATGAAAAG CTAGTCAAGC TATTAAAGTT TCTAATGCGG TAA
|
Protein sequence | MALSKFNIFI DNIIFNTLTG YAMELEPWEI EKLKGGEVPD HLKEVVEEGF STPGDLESVL EPLLNKPVLE PTLLLTYNCN FNCTYCFQKG FRKDLTVTEE VMKGFINYVR KRERGRKVRV TFFGGEPLLE LKKIEEISRS LSDLKYSFSV VTNGSLLTKS VTQRLISHGL SHVQITLDGP PEVHDKRRFY VDGRGSFNTI IQNLREVQDL VKVVLRINID VNNLNEVYTL LAKLVEEGIT RIRLDPHFVH TNLFRNEWWE NVIPKDLESD VLVKFWEKAR GYGFEIPHDI FRLGICAAHI DEDIVVDPEG KVYPCWAFTG NPLYVKGRLT QEGEVELLNR SLSGRKSLII HEKCKSCPYL PMCMGGCRFL SVLDGKGYHG LDCRKETYEK LVKLLKFLMR
|
| |