Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1024 |
Symbol | |
ID | 4462773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1108026 |
End bp | 1109195 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639700043 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_843449 |
Protein GI | 116754331 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.299453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATCAC AAATCAAGCG GCCAAATGGA TCCCTGCGCC CAGGTGGTAT TATGAGCTCA GGTTATAAAA TGATAATAAT TCTCACGATT TTAATGAGCA CTGCATATGC AGTGGATATA TGTGACAGAT CTGATCTCCG TTTCTCTGAT CTGCGCGGCC GGGATCTCAG CGGCGCAAGT CTCAACCAGT CAGACCTGAC GGGCGCGGAT CTCAGGGGTG CAAACCTCAA CGGAGCCTAT CTGAGATCCG CCTGGCTTGT TAATGCAAAC CTCGAAGGTG CTTCGCTGGC AGGCGCGGAT CTGAGCATGG CGGACCTCAG CGGCGCAAAT CTCAGCGGCA CGGATCTCTC CAGGGCCAAG CTCAGGAACG CGCGGCTTAG TGGTGCAAGT CTGGTAAACG CAAATCTGAC CATGGCGGAC TGCACAGAGG CCCTGATGGA CGATGTCTCT CTTGAGGATG CTGAGATGAC TGGAACCAGG TTCTTTCGCA CAGATCTCAC AGGCGCGGTC TTCTCCGGCG CATCGCTTAG CCATGCGAAC TTCGTCGGCG CTCATCTGAG CTGGGCGGAT ATGAGCAGGA GCCGGTTCAG GGAGAGCCAG TTCTCCAGAG CTGAGCTCTA CGGAGCGAAC CTGACAGGTA CAGATCTCAG CGGCTCCGAC TTCACGCGGT CATACATGAT GAGGGCCAGA ATGACAGGCG CGGATCTGAG TGACGCAAGC CTGGATTATG CAGACCTCAC AGAGGCAGAG CTGAGAGATA CGGACCTAAG CGGCTGCAAG ATGCGCTACG CGGATCTCAG CGGGGCCAAT CTGGCAGGCG CGGATATCTC AGAGGTGGTG CTGGATTCTG TGAAGACGAC AGGTGTAAAC CTCAGCGGAG CAATCCTGTA CAAGACATCG CTCTTCAATC TCGACCTCAG GGACATCGAT ATGCATGGGG TGCAGATCAA AAAGGCGAAG ATGGACACAG TCTTCCTCAC AAACTCGAAC CTCGCAGGGG CGGTGCTGAA TGATGTGACG ATGCACATGG TCGAGATGAC GAACGTGGAT CTGAGCGGGG CGAGCCTGCG CAACATCGAG TACGATGAGT TCACACTGAG ATCGCTCGAG AAGGCCAACC TGGATGGTGC TTCCATGGAC GACCGCCTGA AGTCGGACCT GAGCGGGTGA
|
Protein sequence | MRSQIKRPNG SLRPGGIMSS GYKMIIILTI LMSTAYAVDI CDRSDLRFSD LRGRDLSGAS LNQSDLTGAD LRGANLNGAY LRSAWLVNAN LEGASLAGAD LSMADLSGAN LSGTDLSRAK LRNARLSGAS LVNANLTMAD CTEALMDDVS LEDAEMTGTR FFRTDLTGAV FSGASLSHAN FVGAHLSWAD MSRSRFRESQ FSRAELYGAN LTGTDLSGSD FTRSYMMRAR MTGADLSDAS LDYADLTEAE LRDTDLSGCK MRYADLSGAN LAGADISEVV LDSVKTTGVN LSGAILYKTS LFNLDLRDID MHGVQIKKAK MDTVFLTNSN LAGAVLNDVT MHMVEMTNVD LSGASLRNIE YDEFTLRSLE KANLDGASMD DRLKSDLSG
|
| |