Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1503 |
Symbol | |
ID | 4462894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1629712 |
End bp | 1631661 |
Gene Length | 1950 bp |
Protein Length | 649 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639700526 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_843915 |
Protein GI | 116754797 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.227839 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCCGC TGCTGTTCGG AAATGGGAGA TTACTGATAT GCGAGGATGA ACTGGGGATC ATCCGGGACA TTTACTATCC CTATGTGGGT CTCGAAAATC ACTGTAACAT GATACGTGCA GGGATTTACG ACGCGAATCT TCACGTATTC AGCTGGCTTG ATGGATGGGG GATTCAGCAG AGATACAAAT CATCTTTCGA GGAGAACTTC TTCGAGACTC TGGAGGATCT CAAAACGGAT GACCGCGAAA ATATGGAGAT TGGCTTTTGT GCATCGAACA TCGGTGAAAC CATCTTCGAA AACCAGTCGT TGGGATTGAG GGTGAGAGTG CTGGATGCGG TACATCCATC ATCAAATTAT TTTTATAGAT TGTTTGATGT GATGAACATC TCGCCATCTT CCAGGGACAT CGGTCTCTTC TCAAACCAGA ATTACAACAT ACTGGAGAAT AAGATCGGTG AGACAGCATT CGTAGATGGC GATATGTTGA TTCACTACAA ACGTGATCGG TACTTTCTTC ACAGCAGCTA TCCGCAGTTC GATCAATATG CTGTAGGGGT TGCTGAGTGG AAGGGAATGC AAGGTACTTG GAAGGACATG GAAGAGGATG GAATGCTGAG CTGCAATGCA GTAGCGCATG GATCCATCGA CTCGACTATC AGCTGGAGAA TATCAGGCAT CAGGCCCGGG GAGTCAAGAC GCATACACAT GTGGATCGTT GTTGGTAGGG GGCATCGCCA GGTGGTTGAT ATTCACAGAA AATTGAAAGA GAATGGACCG GCCAACGTCT ACCGCATCAG CTTCAATTTC TGGAAAGCGT TTATCGAGCA TGTCGATGCT CTTCCAGAGT GCAGGAACCT CCGCGAGCTG CCGGAGAAGG TACAGGATGC ATTTTACAGA AGCCTCATGG CTACGGTTGC TCATATGGAT GTGAATGGAT CGATTATCGC CTCCTGCGAT TCGGAGATAA AGCAATTCGG AGCGGATCTT TACACCTATT GCTGGCCAAG AGATGCCTCG TGGGCATGTA TAGCGCTGGA TAGGGCCAGA TATCACCATC TCAGCAAAGA AATTATAGAT TTTTTATCTA AAATAATAAC TGTGGACGGT TATTTTCTCC ATAAATACAC ACCTGCTGGT GATTTCGGTA GTACATGGCA TCCAGTGCCC ATGATTCAGC TCGATGAGAC GGGCCTGCCT CTTTATGCTC TGTACCACAA CTGGCTCATG TCCAAGGATG TCTGGATAAT AGGGCGTTAC TTTTCATCGC TGGTGAGGCC AGCCGCAGAA TTCCTCGTCG GCTCAATCGA CAGAACGACA AATCTGCCCG CTGAGAGCTT TGACCTGTGG GAGGAAAGGA GAGGGTCACA TGCATACACT GCGGCTGTTG TCCATGCTGG CCTCCGCGGA GCCTCAGAGA TTGCGAGGAT TTTGGGTAAC GAGAAGTTTC ACTCGAGGTG GTCGCAAGCT GCAGAGCTCA TCAGGCATGC TGCGCTGGAT CTTTATGATG ACAGCATCAT GCATTTCAGG CGCTCCCCTT CGGACTCCAC ACTTGATGCA TCGGTTTTCA GCATCTGGTA CTTTGAATTG CTGCCGGCAA ACGATCCAAG AGTGGTCAAT ACGATGCGGG CTATTGAGAG AGAGCTTACC CGCCCGTCAG GAGGCGTTGC CCGTTATATG CACGATACCT ATCATGGCTA CATGAACAGC TGGATCATAT GTACGCTTTG GCTGGCACAA TGGCATATCG CAGTGGGAAG CCTGGATCGT GCCCTGGAAC TCATAAAATG GTGTTCGGAT CACACATTCT CTACAGGCTT GATGCCTGAG CAGGTCAGTG ATGATAACAC TTTCAGGTCG GTTCTTCCGC TGATGTGGTC TCACTGCACA TTCGTTTTGG CAGTCCTGGA ATATCTCAGG GCTGTTTCGG ATAATCAGCG TAAAGAATAG
|
Protein sequence | MRPLLFGNGR LLICEDELGI IRDIYYPYVG LENHCNMIRA GIYDANLHVF SWLDGWGIQQ RYKSSFEENF FETLEDLKTD DRENMEIGFC ASNIGETIFE NQSLGLRVRV LDAVHPSSNY FYRLFDVMNI SPSSRDIGLF SNQNYNILEN KIGETAFVDG DMLIHYKRDR YFLHSSYPQF DQYAVGVAEW KGMQGTWKDM EEDGMLSCNA VAHGSIDSTI SWRISGIRPG ESRRIHMWIV VGRGHRQVVD IHRKLKENGP ANVYRISFNF WKAFIEHVDA LPECRNLREL PEKVQDAFYR SLMATVAHMD VNGSIIASCD SEIKQFGADL YTYCWPRDAS WACIALDRAR YHHLSKEIID FLSKIITVDG YFLHKYTPAG DFGSTWHPVP MIQLDETGLP LYALYHNWLM SKDVWIIGRY FSSLVRPAAE FLVGSIDRTT NLPAESFDLW EERRGSHAYT AAVVHAGLRG ASEIARILGN EKFHSRWSQA AELIRHAALD LYDDSIMHFR RSPSDSTLDA SVFSIWYFEL LPANDPRVVN TMRAIERELT RPSGGVARYM HDTYHGYMNS WIICTLWLAQ WHIAVGSLDR ALELIKWCSD HTFSTGLMPE QVSDDNTFRS VLPLMWSHCT FVLAVLEYLR AVSDNQRKE
|
| |