Gene Mthe_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1206 
Symbol 
ID4462348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1301280 
End bp1303313 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content52% 
IMG OID639700224 
Productcarbohydrate-binding and sugar hydrolysis 
Protein accessionYP_843627 
Protein GI116754509 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3420] Nitrous oxidase accessory protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCTCG TTCTTTGTGC AATCATGCTG ATTCTCCTCA TTCCGTCTGA GGGGCGCGTG 
ATAAGGGTCG ATGATGACGG CGGCGCTGAT TTCTCCAGCA TTCAGGATGC TCTTGCCCTG
GCAGGGGATG GCGATATAAT AGAGGTGATG AGCGGGGTTT ATGCGGAGAG CGTGAACATA
ACAAAGTCAG TATGTGTGCG AGGGGTCGAC TCAGGAGAGG GCCTTCCTGT CCTGGACGGA
GGCGGCCTGG ATGGTTTTGT TGCCTCATCA GATGGCGTTA ACGTGAGCGA ATTCATCATA
ACGAATGCGA GCGCGTGGGG CGCAGCGGGC ATCAGGGTTC TCTCAGAGAA CGCGCTCATA
TCAAGATGCA ACGTGAGCGA CTCGTTCGCT GGTGTTCTCC TGCTAGGCGG GAACGGATCT
GTTGTTGATT GCAATGTAAC CGAATGCCAC TTCGCCGGAA TTGTGGCTGC ATCCTCTCGT
AATACCGTGA AAATGAACAG GGCACATGAG AACAACAATG CCGGTATCCT CATTCTAAGA
TCAGATGAGA ATCTCGTGGA AGGCAACGAT GCGAGCTACA ACCTCGGAAG CGGGATAAAG
GTTTACAAAT CAAACGATTG TGTTGTAAGA AACAACACTG CAGATACTAA TGATTACGGG
ATAGTGCTCT CCGAGTCTCA TAGATGCACT GTTGAGGGCA ACATCCTCGA GAACAACGAT
TACGGCATAT ACCCATACCT CTCCTCCACG AACACCATAT CCAGAAACAC CGCGAGGACG
AACGATTATG CTGTATACCT TTACAGCTCC TGGAATAACA CCATCACAGA AAACGACCTC
AGGATGAATG ATGCATACGG AATTCTGATA TACTATTCAG ATAATAATAC GATCTCATCG
AATGATCTGA GATCAAACGA GATCGGCGGG CTGAGCGTGA GGGGTTCCAG AGGCAACAGA
TTATGCGGAA ACATGATGGG CGACAGCCCA TATTCGCTCC ACATTGAGGA CGCGGAGTTC
AATATCGCAG AAGGAAACCG GATAAGCAAT GGCGAGACTG GAATCATTCT TGAGAGGTCA
ATGAACAACA CTCTCTCCGG CAACACAATC TCTGAGACGC TTACAGGGAT ATCGGTCTCA
TCTGGCCGAG GGAATATCAT TGAAGGGAAT AACCTCACGG GATGTAGACT CGGCATAGAG
GCGAACGCAT CCATCGGGAG CATAATATCA GCGAACATCA TCGAGAGATG CGATCTGTGC
GTTCATCTCC TCCACTCTCC GGCCTCCAGG CTCGAGGGCA ACATCATCAG GCAGGGCAGC
TCAGGGGTGC TCGTCGAGGC GTCCGATGGT TTTGCAGCGG TCGATAACAC GTTCGATGGT
GTGCCCAGAG CAATTACCAT CACTGCATCC CACGGCTCCC GGGTATCGAA CAGCAGCATA
ACCGGAGCTG ACACCGCGAT TCTCGTGAAC TCCTCTGAAG AGGTGGTGGT GGAGGGCAAC
AGCATGAGCA ACTCCACTGA GGGCATCAGG CTCCTCGCAT CATATGATAC AGAGATCGCC
GGGAACGTCA TCACAGGCGC TAAGAACGGA ATCATCCTGG ACGAATCGAA TGAAAACACA
ATCGAGGAGA ATAACATCTC GAGATGCAAC AGCGGGATCA CGCTCTCACG CTCCTGGAGG
AACGTTCTGA TGATGAATAA CATCTCATAC AACACCAACG GCCTGGTGCT GGATGAGGGC
ATGTATCCCA TAGAGTCAAG CGAGAATCAG ATCTACCTCA ACAGCTTCGT TCAGAACAGA
AACGATGTCG TCTCATACAT ATCGAGCAAT TTCTGGAGCT CTCCCACGAG CATTCGATAC
ATCTATCGCG GCAGGAGCTT CGAGAGCCGG ATGGGCAACT ACTGGCACGG TTTGGAGGGG
GAGGACAGGA ACGGCGATGG GATACTGGAC AGTGGCAGGA GCGTTGGCCT CGAGGACGAC
CCCTATCCGC TCGCAGAGCC TCCGGAGAGG TACAGGGTGC TCTCAGGCGT GTGA
 
Protein sequence
MRLVLCAIML ILLIPSEGRV IRVDDDGGAD FSSIQDALAL AGDGDIIEVM SGVYAESVNI 
TKSVCVRGVD SGEGLPVLDG GGLDGFVASS DGVNVSEFII TNASAWGAAG IRVLSENALI
SRCNVSDSFA GVLLLGGNGS VVDCNVTECH FAGIVAASSR NTVKMNRAHE NNNAGILILR
SDENLVEGND ASYNLGSGIK VYKSNDCVVR NNTADTNDYG IVLSESHRCT VEGNILENND
YGIYPYLSST NTISRNTART NDYAVYLYSS WNNTITENDL RMNDAYGILI YYSDNNTISS
NDLRSNEIGG LSVRGSRGNR LCGNMMGDSP YSLHIEDAEF NIAEGNRISN GETGIILERS
MNNTLSGNTI SETLTGISVS SGRGNIIEGN NLTGCRLGIE ANASIGSIIS ANIIERCDLC
VHLLHSPASR LEGNIIRQGS SGVLVEASDG FAAVDNTFDG VPRAITITAS HGSRVSNSSI
TGADTAILVN SSEEVVVEGN SMSNSTEGIR LLASYDTEIA GNVITGAKNG IILDESNENT
IEENNISRCN SGITLSRSWR NVLMMNNISY NTNGLVLDEG MYPIESSENQ IYLNSFVQNR
NDVVSYISSN FWSSPTSIRY IYRGRSFESR MGNYWHGLEG EDRNGDGILD SGRSVGLEDD
PYPLAEPPER YRVLSGV