Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0509 |
Symbol | |
ID | 4463422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 520069 |
End bp | 521160 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639699512 |
Product | hypothetical protein |
Protein accession | YP_842940 |
Protein GI | 116753822 |
COG category | [R] General function prediction only |
COG ID | [COG5643] Protein containing a metal-binding domain shared with formylmethanofuran dehydrogenase subunit E |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00001659 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGAT ATGTTTTAAG TAGTATCTGC ATCACCCTTG TCCTGATATC ATGCGCGTTT GCAGACGATG CCATGATACA GGAGATTGGT GTCAAAGCTG CAGAGAAGGC GATGAGCGAG TTATCCTTCC AGAAAGGCGA TGAGAACATA CTTGTTTTGA CGAATGCCGG TTATGCGATC GTCTCAGGCA TGACCACCCA GAAGGCGCTG AAGGGCATTA CTGAGACAGC AGGCTGCTCT CATGGCGACG GAAACCTCTT TCAGGTTCTA AGGCCGCATT GGAAGCCGCT GTGGTTCTAC TTCTTCGATA AGAACAGCAA AGAGGCGCTG TATCTGGAAG TGAAGCCGGA GGCGCTCTCG ATGAGTTTGG AGGAGTTGAA AGCTGCGTCG GATGATGCAG TCTTCTCAAA GATCTCAAAG GCAAATGTGG ATCTCGATTA CCTATTGAAT AACACCGATG AAGGTAACAG AACTTTTAAC GAGAAGCTCT TCAACGGGAA CGAGTTCTCG CTTGTCGGCA TCTCGAATGT GTGGGCGAGG AACGCCAGCT TTGACTTCAT TCAGGCTACT TCATTCCATG ATCATCTCTG CCCTGGAGTC ACCAGTGGAT ACATGATCGC GAAATACGTG GAGAGGGAGC TGCCGATAAA CAGCAGTGCC GAGAGCTACA AGGTGATAGC AGTGCCTCCA TGGTGCAAGG ATGACGCACT CCAGATTCTC TGGGATGCGA CGGTCGGGAA GAGCGGCATC TTCGTCATGG CTCTGACGGA CACGGAGAAG AATGCGCTCA AGGCGAAGTA CAACCAGAGC GACGTTGCGG GGATATTCGT GAGATGGAAT GATACCGCAA AGCAGGGGGA TGCGCTGGTT CTGAGCTTCA ACTGGACCAG GATGTACGAG CTCACGGAGA CGAAGGACTG GAAGGGCCCA TCCTGGGCGC CGAAGCTCGT GATGGATGTT CGCATGATGG ACTACTGGGA TGAGCCAGAG ATCGCGGTGA GCGTTATAAA GAGATTCCAG GTTGACCAGA ACATGCTGGC CCAGCTCCAG AACGCTGGCA TGCATCCCCT GAAGGTTGCG GGAGTGATGT GA
|
Protein sequence | MMRYVLSSIC ITLVLISCAF ADDAMIQEIG VKAAEKAMSE LSFQKGDENI LVLTNAGYAI VSGMTTQKAL KGITETAGCS HGDGNLFQVL RPHWKPLWFY FFDKNSKEAL YLEVKPEALS MSLEELKAAS DDAVFSKISK ANVDLDYLLN NTDEGNRTFN EKLFNGNEFS LVGISNVWAR NASFDFIQAT SFHDHLCPGV TSGYMIAKYV ERELPINSSA ESYKVIAVPP WCKDDALQIL WDATVGKSGI FVMALTDTEK NALKAKYNQS DVAGIFVRWN DTAKQGDALV LSFNWTRMYE LTETKDWKGP SWAPKLVMDV RMMDYWDEPE IAVSVIKRFQ VDQNMLAQLQ NAGMHPLKVA GVM
|
| |