Gene Mthe_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1503 
Symbol 
ID4462894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1629712 
End bp1631661 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content49% 
IMG OID639700526 
Productglycoside hydrolase 15-related 
Protein accessionYP_843915 
Protein GI116754797 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.227839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCCGC TGCTGTTCGG AAATGGGAGA TTACTGATAT GCGAGGATGA ACTGGGGATC 
ATCCGGGACA TTTACTATCC CTATGTGGGT CTCGAAAATC ACTGTAACAT GATACGTGCA
GGGATTTACG ACGCGAATCT TCACGTATTC AGCTGGCTTG ATGGATGGGG GATTCAGCAG
AGATACAAAT CATCTTTCGA GGAGAACTTC TTCGAGACTC TGGAGGATCT CAAAACGGAT
GACCGCGAAA ATATGGAGAT TGGCTTTTGT GCATCGAACA TCGGTGAAAC CATCTTCGAA
AACCAGTCGT TGGGATTGAG GGTGAGAGTG CTGGATGCGG TACATCCATC ATCAAATTAT
TTTTATAGAT TGTTTGATGT GATGAACATC TCGCCATCTT CCAGGGACAT CGGTCTCTTC
TCAAACCAGA ATTACAACAT ACTGGAGAAT AAGATCGGTG AGACAGCATT CGTAGATGGC
GATATGTTGA TTCACTACAA ACGTGATCGG TACTTTCTTC ACAGCAGCTA TCCGCAGTTC
GATCAATATG CTGTAGGGGT TGCTGAGTGG AAGGGAATGC AAGGTACTTG GAAGGACATG
GAAGAGGATG GAATGCTGAG CTGCAATGCA GTAGCGCATG GATCCATCGA CTCGACTATC
AGCTGGAGAA TATCAGGCAT CAGGCCCGGG GAGTCAAGAC GCATACACAT GTGGATCGTT
GTTGGTAGGG GGCATCGCCA GGTGGTTGAT ATTCACAGAA AATTGAAAGA GAATGGACCG
GCCAACGTCT ACCGCATCAG CTTCAATTTC TGGAAAGCGT TTATCGAGCA TGTCGATGCT
CTTCCAGAGT GCAGGAACCT CCGCGAGCTG CCGGAGAAGG TACAGGATGC ATTTTACAGA
AGCCTCATGG CTACGGTTGC TCATATGGAT GTGAATGGAT CGATTATCGC CTCCTGCGAT
TCGGAGATAA AGCAATTCGG AGCGGATCTT TACACCTATT GCTGGCCAAG AGATGCCTCG
TGGGCATGTA TAGCGCTGGA TAGGGCCAGA TATCACCATC TCAGCAAAGA AATTATAGAT
TTTTTATCTA AAATAATAAC TGTGGACGGT TATTTTCTCC ATAAATACAC ACCTGCTGGT
GATTTCGGTA GTACATGGCA TCCAGTGCCC ATGATTCAGC TCGATGAGAC GGGCCTGCCT
CTTTATGCTC TGTACCACAA CTGGCTCATG TCCAAGGATG TCTGGATAAT AGGGCGTTAC
TTTTCATCGC TGGTGAGGCC AGCCGCAGAA TTCCTCGTCG GCTCAATCGA CAGAACGACA
AATCTGCCCG CTGAGAGCTT TGACCTGTGG GAGGAAAGGA GAGGGTCACA TGCATACACT
GCGGCTGTTG TCCATGCTGG CCTCCGCGGA GCCTCAGAGA TTGCGAGGAT TTTGGGTAAC
GAGAAGTTTC ACTCGAGGTG GTCGCAAGCT GCAGAGCTCA TCAGGCATGC TGCGCTGGAT
CTTTATGATG ACAGCATCAT GCATTTCAGG CGCTCCCCTT CGGACTCCAC ACTTGATGCA
TCGGTTTTCA GCATCTGGTA CTTTGAATTG CTGCCGGCAA ACGATCCAAG AGTGGTCAAT
ACGATGCGGG CTATTGAGAG AGAGCTTACC CGCCCGTCAG GAGGCGTTGC CCGTTATATG
CACGATACCT ATCATGGCTA CATGAACAGC TGGATCATAT GTACGCTTTG GCTGGCACAA
TGGCATATCG CAGTGGGAAG CCTGGATCGT GCCCTGGAAC TCATAAAATG GTGTTCGGAT
CACACATTCT CTACAGGCTT GATGCCTGAG CAGGTCAGTG ATGATAACAC TTTCAGGTCG
GTTCTTCCGC TGATGTGGTC TCACTGCACA TTCGTTTTGG CAGTCCTGGA ATATCTCAGG
GCTGTTTCGG ATAATCAGCG TAAAGAATAG
 
Protein sequence
MRPLLFGNGR LLICEDELGI IRDIYYPYVG LENHCNMIRA GIYDANLHVF SWLDGWGIQQ 
RYKSSFEENF FETLEDLKTD DRENMEIGFC ASNIGETIFE NQSLGLRVRV LDAVHPSSNY
FYRLFDVMNI SPSSRDIGLF SNQNYNILEN KIGETAFVDG DMLIHYKRDR YFLHSSYPQF
DQYAVGVAEW KGMQGTWKDM EEDGMLSCNA VAHGSIDSTI SWRISGIRPG ESRRIHMWIV
VGRGHRQVVD IHRKLKENGP ANVYRISFNF WKAFIEHVDA LPECRNLREL PEKVQDAFYR
SLMATVAHMD VNGSIIASCD SEIKQFGADL YTYCWPRDAS WACIALDRAR YHHLSKEIID
FLSKIITVDG YFLHKYTPAG DFGSTWHPVP MIQLDETGLP LYALYHNWLM SKDVWIIGRY
FSSLVRPAAE FLVGSIDRTT NLPAESFDLW EERRGSHAYT AAVVHAGLRG ASEIARILGN
EKFHSRWSQA AELIRHAALD LYDDSIMHFR RSPSDSTLDA SVFSIWYFEL LPANDPRVVN
TMRAIERELT RPSGGVARYM HDTYHGYMNS WIICTLWLAQ WHIAVGSLDR ALELIKWCSD
HTFSTGLMPE QVSDDNTFRS VLPLMWSHCT FVLAVLEYLR AVSDNQRKE