Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1534 |
Symbol | |
ID | 4461721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1661695 |
End bp | 1663620 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639700557 |
Product | hypothetical protein |
Protein accession | YP_843946 |
Protein GI | 116754828 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.101346 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAG CAATGAGCAA TGTGGATGTG GCTGCGATCG TGGCTGAGCT TCAAACCCGG ATAGCCGGGG GATTTTTCGG CAAGGCCTAC CAGAGCTCGG GAGATGCCAT ATGGCTCACA ATCCAGGCAC GCGAGGGGAG GCTGGATATC ATCCTGGAGG CGGGAAGAAG GGCGCACGTC ACCAGGAAAG AGAGAGTGGT GGGCAGGACT CCTCCGCAGT TCCCTGCGAT GCTCCGCTCA CGGCTCTCTG GCGGGAGGAT CGTGAGCGTT GAGCAACACG ACTTCGACAG GGTCATGGAG ATCTGCGTTG AGAGATCTGA TGGAAGGTAT CGGCTGGTGG TTGAGCTCTT CCCGAAGGGG AACATGCTGC TTCTCGATGA TGAAATGAGG ATCATACTCC CGCTCAGGCC GATGAGCTTC AGGGACAGGA AGCTCATCGC TGGCGAGCAG TACGTATACC ATGCTGGAGC AGAGGATCCG AGAAGCGTAT CGATTGAGCG GCTGGAGGCG ATCCTCCACA GCTCAGACGC AGATCTCGTG AGAACACTTG TTCGCAACCT CAATATGGGC GGGACGTATG GGGAGGAGGT CTGCCTCAGG GCCGGGGTGG ATAAGAACAC CCCTGCAACC GCTCTATCAG AGGAGGAGAT CGCGAGGGTG CATTCTGCTC TGAGAGATGT CTTCGATATC AGAGAGATCA GGCCTCAGAT AGTTTACAGG GACGGGGAGC CGTTTGATGT GATCCCCTTC CCGCTGGAGG TCTACAAGGG GCTCGAGGCG AGATCCTTCG AGAGGTTCAG CGATGCACTT GATGAGTTCT TCGTGGCAGA GCCGGAGATG CCCAAACTCA GCGCTCTTGA GAGAAGGCTG GAGCTCCAGA GGGCTGCGAT CGATGAGCTC AGGGCCAAGG AGACTCAGCT GGCCTCGATG GGCGATTTCA TTTACCAGAG GTATTCTGAG ATCGACTCGA TACTGAAGGC GATAGCAGGT GCGAGGGAGA GAGGGTTATC ATACACAGAC ATCTGGGAGA GGATACAGAG CTCGGGAAAG TCTGCTGTAA AATCCCTGGA TTACAGCGGC GAGATGATAG TAGAGATTGA TGGGGTCACT CTGGAGCTGA ATGCAGGTCT CACAGTGCCT CAGAACGCAG GGAGGTACTA TGAGCGTGCT AAAGAGGCGG CGAAGAAGGC CGCAGGAGCA GAGGAGGCTC TGAGGAGGAC AGAAGATCTC CTTCAGCGTG GAGAGGAGCG AAGAAGGTCT CCAGTTTTGA AACGAAGACA TAAACCGAGG TGGTTCGAGA GGTTCAGGTG GTTCTACTCC TCAGATGATT TCCTTGTCAT AGGCGGGAGG GATGCGGATG GGAACGAGGA GATATACCTC AAGTACCTGG AGAAGAGGGA TCTCGCGCTC CACACAGACT ACCCCGGAGC GCCGCTGACG GTGATAAAGA CGGAGGGGCG GGAGGTGCCG GAGAGGACTG TGGAGGAGGC AGCGCAGTTC GCCGTCAGCT ACAGCAACCT CTGGAGGGAG GGCGTGGCCT CGGGCGACTG TTATGTGGTC AGAGGTGATC AGGTCACCAA GACCCCAGAG CACGGCGAGT TTCTGAGAAA GGGAGCGTTC GTCGTCCGAG GGGAGCGCAG GTATCTCAGA GATGTCCCTC TGGGGGTTGC ACTTGCGATC GCCGATGGAT CCTTGATAGG TGGGCCGGTC TCAGCAGTGA GATCTAAAAG TTCGGAGGCG ATCGAGCTCG AGCCGGGCGA GTACATGCCA GACGATCTCG CGAAGATGAT ATACAGGCAG CTCCTCGAGA TCTGCGAGGA CAGGAGGTAC CTGAAGGCGA TCGCGTCTCC TGATAAGATC GTGGCATTCC TGCCTCCCGG CGGATCCAGG ATCAGGCGGT TTGATGTGAG GAAGATCGGC GTCTAG
|
Protein sequence | MKKAMSNVDV AAIVAELQTR IAGGFFGKAY QSSGDAIWLT IQAREGRLDI ILEAGRRAHV TRKERVVGRT PPQFPAMLRS RLSGGRIVSV EQHDFDRVME ICVERSDGRY RLVVELFPKG NMLLLDDEMR IILPLRPMSF RDRKLIAGEQ YVYHAGAEDP RSVSIERLEA ILHSSDADLV RTLVRNLNMG GTYGEEVCLR AGVDKNTPAT ALSEEEIARV HSALRDVFDI REIRPQIVYR DGEPFDVIPF PLEVYKGLEA RSFERFSDAL DEFFVAEPEM PKLSALERRL ELQRAAIDEL RAKETQLASM GDFIYQRYSE IDSILKAIAG ARERGLSYTD IWERIQSSGK SAVKSLDYSG EMIVEIDGVT LELNAGLTVP QNAGRYYERA KEAAKKAAGA EEALRRTEDL LQRGEERRRS PVLKRRHKPR WFERFRWFYS SDDFLVIGGR DADGNEEIYL KYLEKRDLAL HTDYPGAPLT VIKTEGREVP ERTVEEAAQF AVSYSNLWRE GVASGDCYVV RGDQVTKTPE HGEFLRKGAF VVRGERRYLR DVPLGVALAI ADGSLIGGPV SAVRSKSSEA IELEPGEYMP DDLAKMIYRQ LLEICEDRRY LKAIASPDKI VAFLPPGGSR IRRFDVRKIG V
|
| |