Gene Mthe_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0940 
Symbol 
ID4463333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1026993 
End bp1028246 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content57% 
IMG OID639699960 
ProductUbiD family decarboxylase 
Protein accessionYP_843368 
Protein GI116754250 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTCA GATCTTTCCT AGAGGATCTC CGCGCGGATG GCGTTCTGGA GGAGACCCAT 
GAGCATGTAT CCACGGAGTA TGAGCTGGCC ATGCGCGCAA CCGGAAGGGG ACCGATGCTC
TTCCACAATG CGGATGGCCA TGTCTGCTGC ATAAACATAC TCGGGAGCAG GGAGCTTCTC
GCCAGGGCCC TGAGGATGGA TGCCAGGAAC CTCGCGCGCG ATCTCTCAGC TGTGGGCTTT
GATGGCCATG TCAGAGAGGT CGACTCATCT CAGTTCCAGG AAAACATCCT GGAGCCGGAT
CTGATGCGAC TTCCGGTGCT GAGGCATTTC AGAGGGGATG GCGGGCGGTA CATAACATCA
GGCATTGTCG TCTCCAGGCT GGATGACAGG ATCAATGCAT GCGTTCACAG GCTCATGGTT
CTCGACAGGA ATAGGCTGGC CGCCAGGCTC GTCCCGGGAA GGCACACGCA TCAGATGTAC
TCCAGAGCCA TCGAAACCGG GAGGAGGTTG CCTGTTGCGA TCGCCATCGG CGTGGATCCG
GTGGTTCTCA TAGCTGCTTC AACAAGAGTG CCTGAGAACA AGGAGTTTGA GTATGCATCC
GCTCTCAGAG GGGATGTTGT TGAGGTTGTG ACCCTTGAGA ATGGCGTCCC GGTTCCGCAT
GCTGAGATCG TTCTGGAGGG ATACCTGACG GAGAAGAGGG CTCCGGAGGG GCCGTTTGTG
GACATCACCG GCACGATGGA TATCGTGAGG GAGGAGCCTG TCATAGAGAT CACCAGGATC
ATGATGAGGG ATGACGCGAT CTATCATGCA CTTCTTCCCG CCGGAGGGGA GCACAGGATG
CTGATGGGCG TGCCCTATGA GCCGCTGATA TACAGAGAGG CATCAAAGGT CGTGAGGGTC
AGGAATGTGC TTCTGACGGA GGGTGGGTGC ACGTACTTCC ACGCGGTTGT TCAGATAGAA
AAGCAGGAGG AGGAGGATGG TTTGAAGGCC ATACAGGCCG CGATGGCCGC ACACGGGAGC
CTGAAACATG TGCTTGTTGT CGACACGGAC ATCGATATCC ACGATCCGAG AGAGCTGGAG
TACGCGATCG CGACCAGGGT TCGCGGTGAT CAGGACATTT ACATGTATCC GAACGTGAGG
GGGAGCACGC TGGATCCGAG ATCTGTGGAT GGGATGACAA CAAAAGTAGG GGTCGATGCG
ACCGCAAAGC TCGACAGGCT CTGGAAGTTC AGGCGTGTTG TCAGACCGTG GTGA
 
Protein sequence
MSFRSFLEDL RADGVLEETH EHVSTEYELA MRATGRGPML FHNADGHVCC INILGSRELL 
ARALRMDARN LARDLSAVGF DGHVREVDSS QFQENILEPD LMRLPVLRHF RGDGGRYITS
GIVVSRLDDR INACVHRLMV LDRNRLAARL VPGRHTHQMY SRAIETGRRL PVAIAIGVDP
VVLIAASTRV PENKEFEYAS ALRGDVVEVV TLENGVPVPH AEIVLEGYLT EKRAPEGPFV
DITGTMDIVR EEPVIEITRI MMRDDAIYHA LLPAGGEHRM LMGVPYEPLI YREASKVVRV
RNVLLTEGGC TYFHAVVQIE KQEEEDGLKA IQAAMAAHGS LKHVLVVDTD IDIHDPRELE
YAIATRVRGD QDIYMYPNVR GSTLDPRSVD GMTTKVGVDA TAKLDRLWKF RRVVRPW