Gene Mthe_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1024 
Symbol 
ID4462773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1108026 
End bp1109195 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content56% 
IMG OID639700043 
Productpentapeptide repeat-containing protein 
Protein accessionYP_843449 
Protein GI116754331 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.299453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCAC AAATCAAGCG GCCAAATGGA TCCCTGCGCC CAGGTGGTAT TATGAGCTCA 
GGTTATAAAA TGATAATAAT TCTCACGATT TTAATGAGCA CTGCATATGC AGTGGATATA
TGTGACAGAT CTGATCTCCG TTTCTCTGAT CTGCGCGGCC GGGATCTCAG CGGCGCAAGT
CTCAACCAGT CAGACCTGAC GGGCGCGGAT CTCAGGGGTG CAAACCTCAA CGGAGCCTAT
CTGAGATCCG CCTGGCTTGT TAATGCAAAC CTCGAAGGTG CTTCGCTGGC AGGCGCGGAT
CTGAGCATGG CGGACCTCAG CGGCGCAAAT CTCAGCGGCA CGGATCTCTC CAGGGCCAAG
CTCAGGAACG CGCGGCTTAG TGGTGCAAGT CTGGTAAACG CAAATCTGAC CATGGCGGAC
TGCACAGAGG CCCTGATGGA CGATGTCTCT CTTGAGGATG CTGAGATGAC TGGAACCAGG
TTCTTTCGCA CAGATCTCAC AGGCGCGGTC TTCTCCGGCG CATCGCTTAG CCATGCGAAC
TTCGTCGGCG CTCATCTGAG CTGGGCGGAT ATGAGCAGGA GCCGGTTCAG GGAGAGCCAG
TTCTCCAGAG CTGAGCTCTA CGGAGCGAAC CTGACAGGTA CAGATCTCAG CGGCTCCGAC
TTCACGCGGT CATACATGAT GAGGGCCAGA ATGACAGGCG CGGATCTGAG TGACGCAAGC
CTGGATTATG CAGACCTCAC AGAGGCAGAG CTGAGAGATA CGGACCTAAG CGGCTGCAAG
ATGCGCTACG CGGATCTCAG CGGGGCCAAT CTGGCAGGCG CGGATATCTC AGAGGTGGTG
CTGGATTCTG TGAAGACGAC AGGTGTAAAC CTCAGCGGAG CAATCCTGTA CAAGACATCG
CTCTTCAATC TCGACCTCAG GGACATCGAT ATGCATGGGG TGCAGATCAA AAAGGCGAAG
ATGGACACAG TCTTCCTCAC AAACTCGAAC CTCGCAGGGG CGGTGCTGAA TGATGTGACG
ATGCACATGG TCGAGATGAC GAACGTGGAT CTGAGCGGGG CGAGCCTGCG CAACATCGAG
TACGATGAGT TCACACTGAG ATCGCTCGAG AAGGCCAACC TGGATGGTGC TTCCATGGAC
GACCGCCTGA AGTCGGACCT GAGCGGGTGA
 
Protein sequence
MRSQIKRPNG SLRPGGIMSS GYKMIIILTI LMSTAYAVDI CDRSDLRFSD LRGRDLSGAS 
LNQSDLTGAD LRGANLNGAY LRSAWLVNAN LEGASLAGAD LSMADLSGAN LSGTDLSRAK
LRNARLSGAS LVNANLTMAD CTEALMDDVS LEDAEMTGTR FFRTDLTGAV FSGASLSHAN
FVGAHLSWAD MSRSRFRESQ FSRAELYGAN LTGTDLSGSD FTRSYMMRAR MTGADLSDAS
LDYADLTEAE LRDTDLSGCK MRYADLSGAN LAGADISEVV LDSVKTTGVN LSGAILYKTS
LFNLDLRDID MHGVQIKKAK MDTVFLTNSN LAGAVLNDVT MHMVEMTNVD LSGASLRNIE
YDEFTLRSLE KANLDGASMD DRLKSDLSG