Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0433 |
Symbol | |
ID | 4462026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 443297 |
End bp | 444394 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639699436 |
Product | radical SAM domain-containing protein |
Protein accession | YP_842865 |
Protein GI | 116753747 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.293902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTAAAC AAGCAGGGGT GACTGACCAG GATGCACAAA TTGATATTGA TGCCAGCGGA ACGCATCCGA GAACACTCAT ACTCTGGGTC ACCACCGACT GTAACTTGCG ATGCGTGTAC TGTTATGCGA ACGGCGGCGA TAATAAGGCA TACATGGGTT GGGATGTGGC AAAGCGTGCC ATAGATCTTG TTGCAGAAGG TGCTGATTGT TTTAAGGTAC AGCTTGCAGG TGGTGAGCCG CTGCTTAATT TTGGTCTGAT CGAGAGGATC GTTTTTTACA TCCACGATCT GGGAGCGGAC GCGAGCATTC AGCTTCAGAC CAATGCAACA CTTATCTCCC CGGCTATCGC CAGCCGTCTC AGAGCTCTTG GTATCGGTGT GGGCGTGAGT CTTGATGGTG TGCCTGCTAT AAATGATCAT CTCCGCCCTT TTGCAGATGG GCATGGTTCC ACTCACTCCG TTATAAATGG AATAAGAAAT CTCCGTGATG CAGGAATCTC TGTGGGAATG ACATCTGTTC TTTCCAGCAC AAGCGTTAAA GGTCTCTCTT CGCTGGTAGA TCTGGCAAGC TACCTGGGCA ATGTGGCAGG CATCTCCCTG GATCCGCTGA GACCTCTCGG ACGCGGTAGT TGTGATATGA TGCCAAGTCC TCACTTAACT GCAGAGCATC TTTATAAAGC GATCAAACGT TCAGAGTATC TTGCTGGATC CGGCGGGAAT CCTGTGAGGT TCAGAGAAGT TGAGCGGATG AGGTGTCTCT TAAAGACTGG AGGGGTTCGA CGGTATCGCT GCTATTTCGA TGCATACCAA TCGCTGATGG TCTGCCCAAA TGGCGATGCG TATCCATGTG CGTCTCTACA CCATCCAGAT TTCCGCCTCG GCAACATATT GGAACCACGT TTTTGGGACA CGATTTCGGA GAGGCTGCGA GATGCCAGGA GATCCATAAA AACACCACAT AAGTGCATGA CATGCTCAGA ACTCTGGCTC TGCGGTGGCC CTTGTCCAGC TGGTGTCTTC AGCAGAGCTG GCGGCGCGGA AATTGAGTGC GCAGTGAAAA GGGTTTTCGT GAAGTACGTG ACCGGCTCTC TGCATTAG
|
Protein sequence | MPKQAGVTDQ DAQIDIDASG THPRTLILWV TTDCNLRCVY CYANGGDNKA YMGWDVAKRA IDLVAEGADC FKVQLAGGEP LLNFGLIERI VFYIHDLGAD ASIQLQTNAT LISPAIASRL RALGIGVGVS LDGVPAINDH LRPFADGHGS THSVINGIRN LRDAGISVGM TSVLSSTSVK GLSSLVDLAS YLGNVAGISL DPLRPLGRGS CDMMPSPHLT AEHLYKAIKR SEYLAGSGGN PVRFREVERM RCLLKTGGVR RYRCYFDAYQ SLMVCPNGDA YPCASLHHPD FRLGNILEPR FWDTISERLR DARRSIKTPH KCMTCSELWL CGGPCPAGVF SRAGGAEIEC AVKRVFVKYV TGSLH
|
| |