Gene Mthe_0509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0509 
Symbol 
ID4463422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp520069 
End bp521160 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content52% 
IMG OID639699512 
Producthypothetical protein 
Protein accessionYP_842940 
Protein GI116753822 
COG category[R] General function prediction only 
COG ID[COG5643] Protein containing a metal-binding domain shared with formylmethanofuran dehydrogenase subunit E 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00001659 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGAT ATGTTTTAAG TAGTATCTGC ATCACCCTTG TCCTGATATC ATGCGCGTTT 
GCAGACGATG CCATGATACA GGAGATTGGT GTCAAAGCTG CAGAGAAGGC GATGAGCGAG
TTATCCTTCC AGAAAGGCGA TGAGAACATA CTTGTTTTGA CGAATGCCGG TTATGCGATC
GTCTCAGGCA TGACCACCCA GAAGGCGCTG AAGGGCATTA CTGAGACAGC AGGCTGCTCT
CATGGCGACG GAAACCTCTT TCAGGTTCTA AGGCCGCATT GGAAGCCGCT GTGGTTCTAC
TTCTTCGATA AGAACAGCAA AGAGGCGCTG TATCTGGAAG TGAAGCCGGA GGCGCTCTCG
ATGAGTTTGG AGGAGTTGAA AGCTGCGTCG GATGATGCAG TCTTCTCAAA GATCTCAAAG
GCAAATGTGG ATCTCGATTA CCTATTGAAT AACACCGATG AAGGTAACAG AACTTTTAAC
GAGAAGCTCT TCAACGGGAA CGAGTTCTCG CTTGTCGGCA TCTCGAATGT GTGGGCGAGG
AACGCCAGCT TTGACTTCAT TCAGGCTACT TCATTCCATG ATCATCTCTG CCCTGGAGTC
ACCAGTGGAT ACATGATCGC GAAATACGTG GAGAGGGAGC TGCCGATAAA CAGCAGTGCC
GAGAGCTACA AGGTGATAGC AGTGCCTCCA TGGTGCAAGG ATGACGCACT CCAGATTCTC
TGGGATGCGA CGGTCGGGAA GAGCGGCATC TTCGTCATGG CTCTGACGGA CACGGAGAAG
AATGCGCTCA AGGCGAAGTA CAACCAGAGC GACGTTGCGG GGATATTCGT GAGATGGAAT
GATACCGCAA AGCAGGGGGA TGCGCTGGTT CTGAGCTTCA ACTGGACCAG GATGTACGAG
CTCACGGAGA CGAAGGACTG GAAGGGCCCA TCCTGGGCGC CGAAGCTCGT GATGGATGTT
CGCATGATGG ACTACTGGGA TGAGCCAGAG ATCGCGGTGA GCGTTATAAA GAGATTCCAG
GTTGACCAGA ACATGCTGGC CCAGCTCCAG AACGCTGGCA TGCATCCCCT GAAGGTTGCG
GGAGTGATGT GA
 
Protein sequence
MMRYVLSSIC ITLVLISCAF ADDAMIQEIG VKAAEKAMSE LSFQKGDENI LVLTNAGYAI 
VSGMTTQKAL KGITETAGCS HGDGNLFQVL RPHWKPLWFY FFDKNSKEAL YLEVKPEALS
MSLEELKAAS DDAVFSKISK ANVDLDYLLN NTDEGNRTFN EKLFNGNEFS LVGISNVWAR
NASFDFIQAT SFHDHLCPGV TSGYMIAKYV ERELPINSSA ESYKVIAVPP WCKDDALQIL
WDATVGKSGI FVMALTDTEK NALKAKYNQS DVAGIFVRWN DTAKQGDALV LSFNWTRMYE
LTETKDWKGP SWAPKLVMDV RMMDYWDEPE IAVSVIKRFQ VDQNMLAQLQ NAGMHPLKVA
GVM