Gene Mthe_1239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1239 
Symbol 
ID4463170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1334799 
End bp1335950 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content55% 
IMG OID639700256 
Productpeptidase M24 
Protein accessionYP_843658 
Protein GI116754540 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.862796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGAG ATGCTGTTGT CGGCGCTGTC AGAGCGCTCG GAGTCGATGG ACTGCTTCTC 
GTGGGCGACA GCGTCTGCGA TGCTGATATC TACTATGCAT CGAGGTTTCT ATCCAGCGAT
AGATTCGCTG TGCTGATCAC AGACAGGATC CATCTTCTGG TCTCCAGCAT GGAGAGAGCG
AGGGCCTCGT CAGAATCAAA GGCTGATGTG GTGGAGACAA CGAGCGATTA CTCCATGAAA
TCCAGGATCG AAGAGTTTGG GAGTGCTGAT AAGGCATACA TTAAGGTCCT CGAGGAGTTC
GTCTCCAAAC ATGGAATCTC GCATCTCGGC ATACCCTCAA ATACCCCCGC GGGAATTTAC
AGAAGCCTAA CTGAGCAGTT CGAGACCTCT CTCCTGGATA AACCATTTGA GCACATTCGC
GCTGTTAAAA CGCCTGAGGA GATCTCAGCG ATTGCAGAAG TCCAGGAGGC ATGCGAGTCT
GCAATGGAGG TTGCAGTAAG TCTCATAAAA AAATCAAAGC CCACTGGTGG CATCCTTGTT
TTTGACGGCA AGCCGCTCAC CTCTGAAAGG GTGAGGAGCG CTGTTGAGCT CAGGCTTGCG
GAGCTGGGAT GCGAGACTCT GGACACCATA GTCTGCGGTG GTCTCATGAG CTCCAGTCCA
CATTCAAGAG GCAGCGGACT GCTTCCCGCG GACATGCCCA TCGTCATAGA CATATTCCCG
CGATCGAAGA GCAGCAGGTA CTTTGCGGAC ATGACCCGAA CGGTCGTCCG CGGGGAGCCA
TCGGTAGAGA TCGTGGAGAT GTATCAGGCT GTGAAGATAG CTCAGGAGGC GGGTCTGAAG
TGCATAAAGG AGGGTGTGAG CGGAGCCGAT GTGCACGGGG CCGTATGCAG AACGTTCGAT
GATTTTGGAT ACACAGAGCG GGAAGAGTGT GGTTTCATCC ACTCAACAGG CCACGGCGTC
GGGCTGTCGA TACACGAGAG ACCCTCCCTG AGCGAGCACG GTGGGACGCT CAGATCAGGG
AATGTGGTCA CGGTTGAGCC CGGGCTGTAC TATCCGGATA TCGGTGGAGT CAGGCTTGAG
GATCTCGTGG TTGTCAGAGA GAACGGGTGC GAGAACCTGA CAGCATTCGA GAAGGAGCTT
GTGATCCGGT AG
 
Protein sequence
MSRDAVVGAV RALGVDGLLL VGDSVCDADI YYASRFLSSD RFAVLITDRI HLLVSSMERA 
RASSESKADV VETTSDYSMK SRIEEFGSAD KAYIKVLEEF VSKHGISHLG IPSNTPAGIY
RSLTEQFETS LLDKPFEHIR AVKTPEEISA IAEVQEACES AMEVAVSLIK KSKPTGGILV
FDGKPLTSER VRSAVELRLA ELGCETLDTI VCGGLMSSSP HSRGSGLLPA DMPIVIDIFP
RSKSSRYFAD MTRTVVRGEP SVEIVEMYQA VKIAQEAGLK CIKEGVSGAD VHGAVCRTFD
DFGYTEREEC GFIHSTGHGV GLSIHERPSL SEHGGTLRSG NVVTVEPGLY YPDIGGVRLE
DLVVVRENGC ENLTAFEKEL VIR