Gene Mthe_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0917 
Symbol 
ID4462381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp999841 
End bp1000935 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content54% 
IMG OID639699936 
Producthypothetical protein 
Protein accessionYP_843345 
Protein GI116754227 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGCAA TCGGACTGGC GGTAGCTGTG AGTGTGACCT CTGTATCGCT CCAGGCCGGC 
TTTCAGGAGT ATCTCCTAGA CATAATCGTG AAAGACCTCG CGCATGTCAG CGTTAGCCCC
AAGGAGGGTG AGGAGTACAT CTACCTGTAC AGAACGCTCA TGGAGAGGAT ATGGCAGCTT
GAAGGCGTTA CCGCGGTCTC TCCACGCCTG AGCACACCCG CATCTCTTTC ACACAAGAAC
AACGTCGAGA ACGTCATTCT GATAGGTGTT ATCCCATCAG AGATGGAAAG GATCTATCCA
AGCATATCAG AGCGGATGGT TTATGGAGAA CTGGAGTCGA TCCAGCAGGA GAACAGGATT
GTGATGTCTA AGAAGCTCGC CGAGAAGCTG GATGTTGAAC TTGATGATAC AGTGGATGCG
AGGTTTCCCG ATGCGAATCC GCTCAACCTC ATGGTCACCG GGATATTCGA TCCCCCTCAG
GGCTTTCCTG AGGAGATGAC ATTCGTATCA CTCCGCACAG CCAGGAACTT CCTGGGTGAG
GGGGATGTCA TAAACGGCAT TGATATAAAG CTCCATGACA TCTACATGGC GGATCGAATC
AGCAGGGAGA TATCTGCGAC AGGCTACAAG GCAGAGAGCT GGCAGCAGCT GTATCCGGAG
ATCCTCAGGA CCATTGCAAT AGAGAACTTT GAGAACCGCA TAATCATGCT CCTGATAATG
ATCATCGCGG CCTTCGGGAT AGCGAGCGTG ATGTACATGC TTGTTCTAGA GAAGACCTCT
GAGATAGGCA TGCTCATGGC TGAAGGGGCG ACAGGCGCAA TGATACGCAA CATCTTTCTC
ATACAGAGCA CCGTCCTGGG CCTCATCGGC GGCATCTGCG GCGCTGCGGG CGGCGTTGCC
CTTTCTCTGT ATCTCAAAGG CATGGAGTTC GAGGTCGAGG CTCCTGGCTG GGAGGAGTTC
GTGCTGCCTG TGGTCATAGA TCCCTGGAAC ACACTGATCA TAGTGGTCGC CGCGGTCCTC
CTGAGCCTAG CAGCAGGCGT GTATCCCGCG CACAAGGCAT CGAAGCTCGA TCCCGTAATC
GCCCTGCATG GATGA
 
Protein sequence
MLAIGLAVAV SVTSVSLQAG FQEYLLDIIV KDLAHVSVSP KEGEEYIYLY RTLMERIWQL 
EGVTAVSPRL STPASLSHKN NVENVILIGV IPSEMERIYP SISERMVYGE LESIQQENRI
VMSKKLAEKL DVELDDTVDA RFPDANPLNL MVTGIFDPPQ GFPEEMTFVS LRTARNFLGE
GDVINGIDIK LHDIYMADRI SREISATGYK AESWQQLYPE ILRTIAIENF ENRIIMLLIM
IIAAFGIASV MYMLVLEKTS EIGMLMAEGA TGAMIRNIFL IQSTVLGLIG GICGAAGGVA
LSLYLKGMEF EVEAPGWEEF VLPVVIDPWN TLIIVVAAVL LSLAAGVYPA HKASKLDPVI
ALHG