Gene Mthe_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1040 
Symbol 
ID4463108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1124812 
End bp1125813 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content50% 
IMG OID639700058 
Productdienelactone hydrolase 
Protein accessionYP_843464 
Protein GI116754346 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAAGTTAT GGTTGAAAAG CCTGTAACCT TTTACAGCGG ATCCTCTCGG 
CTTGCTGGGG TCCTCAGATA CCCATCTGTG ATTAAAGACC CTGCACCTGC GGTCCTTCTG
ATCCACGGAT CACTTGAGCA GGACAGGGAC GGAAATCTAT TAAACAGACC GGATGGAAGA
CCAATATTCA AAAAGAACTT CTTCCTGGAG ATATCGAAGA GGTTATCAGC GGAGGGATTC
GCAACATTCT CATGGGACAG AAGAGGCTTT GGAGAGAGCG AGTCTTCTAT CCGTGATGGC
GGGTACCTTC AGGATGGAAT AGATGCGATG GCCGCCTATC AGGCTCTCTC CTCCCTCGAT
CTCGTAGATC CTGAAAGAGT CGCGGTCCTG GGTCAGAGCG CTGGAGTTTA TACAGCATGT
CTCCTGGCTG AAAAGGAGAG TAGACCGAAA GCGTACATTC TCCAAGGTGG TCTTTACAGG
GATTATGAGG AGATGATGAT CTTCAACTAC CTAAGGGTAG TGGATTACGC CTCAAAGAGC
CCTGAGAACC TCAGATGGGT GGAAGAGAAC GACCCACTTG GCCTGGTAAT TGGACTGAAC
CTCTACACGC TGATGGAGAG GGCGAGGATG GGCGAGGTCG AACACCAGTT CAGCTATAAG
GGAAGAACGT GGAGGATTTG GCACGACCCG ATCTGCTATT TACCGGAACA CGCTCCGAGG
AACCTGTTCA AGTACATACA AAAGCCCACT CTTGTAATCC ATGGGGCCTG CGATCTGAAT
GTTCCTGTTG AGGATGCCTT CATGATCGAG CGGGATTTGA AAAAGCACGG CAACGAGAAT
GTGGAGCTGG CCATTATCCC AGATGCGGAC CACAGCTTCC AGCAGATCGC AGAGTCATAC
GATCTCACAC TCAGAGAGAG AATGAGTCTT GAGAGCTTTC GACGTCCATA TCGAGAGGAT
TACTTTATGG CAGTAATCTC TTTTCTCAAG AGGTGGCTTT GA
 
Protein sequence
MSEEKVMVEK PVTFYSGSSR LAGVLRYPSV IKDPAPAVLL IHGSLEQDRD GNLLNRPDGR 
PIFKKNFFLE ISKRLSAEGF ATFSWDRRGF GESESSIRDG GYLQDGIDAM AAYQALSSLD
LVDPERVAVL GQSAGVYTAC LLAEKESRPK AYILQGGLYR DYEEMMIFNY LRVVDYASKS
PENLRWVEEN DPLGLVIGLN LYTLMERARM GEVEHQFSYK GRTWRIWHDP ICYLPEHAPR
NLFKYIQKPT LVIHGACDLN VPVEDAFMIE RDLKKHGNEN VELAIIPDAD HSFQQIAESY
DLTLRERMSL ESFRRPYRED YFMAVISFLK RWL