Gene Mthe_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0101 
Symbol 
ID4462493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp94211 
End bp95236 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content56% 
IMG OID639699110 
Productglycerophosphoryl diester phosphodiesterase 
Protein accessionYP_842543 
Protein GI116753425 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATATA CATGGCTCCA CCATACTCGA TCGATGGTCT CACTGATTGG ACATCGTGGT 
GCTCCAGCTC TGGCTCCCGA GAACACGCTG CAGGGAATAA GGAAGGCGCA TTCATGCGGA
GCCGATATGG TCGAGATGGA CGTGCGTCTA TCCTCGGATG GCGTTCTGGT CCTAATGCAC
GATGAGACTG TTGACAGGAC AACGAACGGC TCAGGCAGAG TGGAGGATCT GAGCATCGGG
GAGCTCAGAG GTCTGGATGC GGGTGGAGAG CCAGTGCCGA CGCTGAAAGA GGCTCTTAGG
CTCGCTGAGG TTCTCGGAAT CCAGCCGATC GTCGAGATGA AGGAGGAGGG CTTGGAGGAG
CTTCTACTGG AAGAGCTTGT TGGGTTGAAC GCAGTTGTGA CATCATTTTA CCACAGAAGC
GTGCTGGAAC TCAGTGAGCT TCTCAGAGAG AAAAAGGGCG CGGAGGGGAT AAAAACCGGC
ATCATCATAT CATCCCTGCC CGTGAACCCT GTGGATCTGG CCCTGGATGC GCATGCGGAT
GCGATATTTC CAAAGAGGGT GAGCCCGAAC ATCTTCAAGA TCGCACATAA AAGCGGTTTG
AAGGTTTACC CCTGGACGGT CAACACCCCT GAGAGGGCGG CATGGCTCCT CAGGCTCGGG
GCTGATGGCA TTGTCACCGA TGATCCATGC GCGATAAGGG ATGTGCTGAA AGCACCTCCA
AGAAACACAG GGCAGGAGAA CTGCGAGTAT TACCCGTGTC ATCACTTCGA GGGGCAGGAC
TGCACACACT GCTTCTGCCC GCTCTACCCA TGCAAGGACC CAGAGCTGGG CAGGTTTGTG
AGAACGAAGA GGGGGAAGAG GTTCTGGTCA TGCATAGACT GCGTTCTGGT CCACATACCC
GAGGTCGCCA GGTATCTCGA GGCGAACCCG GATGCTGGAA CAGAGGAGCT GAAGAACTTT
CTCGGCACCA CTGGAAGGGG GTGCTTCCGC AGAGCAGACC GCGCTGGGAA GGGGACCGGC
TCATGA
 
Protein sequence
MEYTWLHHTR SMVSLIGHRG APALAPENTL QGIRKAHSCG ADMVEMDVRL SSDGVLVLMH 
DETVDRTTNG SGRVEDLSIG ELRGLDAGGE PVPTLKEALR LAEVLGIQPI VEMKEEGLEE
LLLEELVGLN AVVTSFYHRS VLELSELLRE KKGAEGIKTG IIISSLPVNP VDLALDAHAD
AIFPKRVSPN IFKIAHKSGL KVYPWTVNTP ERAAWLLRLG ADGIVTDDPC AIRDVLKAPP
RNTGQENCEY YPCHHFEGQD CTHCFCPLYP CKDPELGRFV RTKRGKRFWS CIDCVLVHIP
EVARYLEANP DAGTEELKNF LGTTGRGCFR RADRAGKGTG S