Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1697 |
Symbol | |
ID | 4463449 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 1852108 |
End bp | 1854117 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639700715 |
Product | hypothetical protein |
Protein accession | YP_844103 |
Protein GI | 116754985 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGCA AGCCCAACCG TCTTGCTGGA GAGTCAAGTC CTTACCTCCT CCAGCATGCA TACAACCCCG TGGACTGGTA TCCCTGGTCT CCTGAGGCCT TCGAGCGTGC CAGAGCTGAG GACAGGCCGA TATTTCTCTC CATAGGGTAC TCCACGTGCC ACTGGTGCCA CGTGATGGCA AGGGAGTCTT TTGAGGATGA GAGGATCGCT GAGATGCTGA ACAGGGCATT TGTATGCGTG AAGGTGGACA GGGAGGAGAG GCCGGATATC GACGCCATTT ACATGGAGGC CTGCCAGATC ATCACCGGCA GGGGCGGATG GCCTCTGACG ATCATAATGT CCCCGGATGG CATTCCCTTT TTTGCAGCCA CATATATCCC GAAGGATGGG CGGCTCGGTA TGATGGGGCT GAGGGAGCTC ATACCGCTGG TGGAGGAGCT CTGGAGAAAT CGGAGATCCG AGCTCACATC TCTCGGATTC AAGGTGCTGA ACGCCATGCG AAAGGCTGAT ACGCATCTCC AGGCGTCGAA TGCGGATGAG AGTACTCTGA GCAGAGCGTA CCTCGAGCTT TCTGGGATCT TCGACTGGAC CAGCGGAGGG TTTGGGAGAG CCCCGAAGTT CCCCCTGGCA CAGAATCTGC TGTTCCTGCT CAGGTACTGG CACAGAACAG GGGAGATGAA AGCTCTGGAG ATGGTGGAGC TGACTCTCAG GGAGATGCGA TGCGGCGGCA TATACGACCA GCTCGCATAT GGTTTCCACA GGTACTCAAC TGATTCGAGC TGGGGCGTGC CCCACTTCGA GAAGATGCTC TACGACCAGG CGCTGATGTC TGTGGTTTAT CTTGAGGCGT ATCAGGCCAC AGGAAAGAGG GATTACGCGA TTGTGGCAGA TGAGATACTC GGTTTCGTTG CTGAGGATCT GAGATCACCC GATGGCGCTT TCTGCTCAGC GCTGGACGCA GAGAGCGATA ACATCGAGGG AGGATATTAT CTCTGGACAA TGGATCAGCT TCGAGATGCT CTTGGTGATG ATCTGAAAAA AGCGCTGGAG GTGTTCGTCC TCGAGCCAAT CGGCGGGAGC GATGGAAAGA ACGTCCTCAG GATCTCGCTG AAAGGCGAAT TGAGCGAGTT CAAGCACACC AGCGAGCCCA TAAGAAGAAA ACTTCTGGAT GCAAGATCTC TGAGGAGAAA ACCCTTCAGA GATGAGAAGG TGCTTGCGGA CTGGAATGGA TTGATGATAG CGGCATTCTC CAGAGGTGCC CAGGTTCTGG GAGATGAGAG ATGGCTCCGC ATAGCATCCG AGGCAGCGGA TTTCGTGCTG TCGAGCATGC ACAGAGACGG CATGCTGATG CATTCCTATA AGGGAAGCAG GGTGTCGATT CTGGACGATT ACGCCTTCCT CATCTTTGGG CTGATAGAGC TTTACCAGGC CGGGTTCGAC GGAAGGTATC TGGAGAGAGC TGAGATCTTG TGTGATGAGA TGGTCTCCCA TTTCTCAGAT CCGGATGGAG GGTTCTATTA CACGATGAAG GAGCAGAGTG ACATCATCCT GCAAAGAAAG GAGATCCGCG ATGGTGCGAT CCCATCAGGC TATTCGATGG CCACCATGGA CATGCTCCTG CTCGGGAAAA TCCTCGGCAG GCCGGATCTG GAGGAGATCG CTTCGATGAG CCTCAGGCAC ATCAGCATGG CCTCCCTGCC TGCGCAGGTG GGACTCCTGA TCGCTCTCGA TCTCGCGCTC GGCCCGTCGC ATGAGATCGC AATCGTGGGC GATGCGGATA ATACAAGAAC TATGCTGCGC GCGCTCTGGT CCGTCTACGC ACCGAGAAAG GTCGTGGTAT CAGGTGATAG ACCTCCGGAG TGGGCATCTT CTCTGAGACC TGTGGATAAA AAGGCCACCG CGTATGTCTG CAGCAGATAC ACATGCAGCT TTCCGGCAAC AGATATAAGA AGTATGATCG AGCTTCTCGA TGTTAGAGAA CTCAGAAGTT CTGAGAATGC ATCAGGGTAA
|
Protein sequence | MDRKPNRLAG ESSPYLLQHA YNPVDWYPWS PEAFERARAE DRPIFLSIGY STCHWCHVMA RESFEDERIA EMLNRAFVCV KVDREERPDI DAIYMEACQI ITGRGGWPLT IIMSPDGIPF FAATYIPKDG RLGMMGLREL IPLVEELWRN RRSELTSLGF KVLNAMRKAD THLQASNADE STLSRAYLEL SGIFDWTSGG FGRAPKFPLA QNLLFLLRYW HRTGEMKALE MVELTLREMR CGGIYDQLAY GFHRYSTDSS WGVPHFEKML YDQALMSVVY LEAYQATGKR DYAIVADEIL GFVAEDLRSP DGAFCSALDA ESDNIEGGYY LWTMDQLRDA LGDDLKKALE VFVLEPIGGS DGKNVLRISL KGELSEFKHT SEPIRRKLLD ARSLRRKPFR DEKVLADWNG LMIAAFSRGA QVLGDERWLR IASEAADFVL SSMHRDGMLM HSYKGSRVSI LDDYAFLIFG LIELYQAGFD GRYLERAEIL CDEMVSHFSD PDGGFYYTMK EQSDIILQRK EIRDGAIPSG YSMATMDMLL LGKILGRPDL EEIASMSLRH ISMASLPAQV GLLIALDLAL GPSHEIAIVG DADNTRTMLR ALWSVYAPRK VVVSGDRPPE WASSLRPVDK KATAYVCSRY TCSFPATDIR SMIELLDVRE LRSSENASG
|
| |