Gene Mthe_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0502 
Symbol 
ID4463249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp514240 
End bp515403 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content54% 
IMG OID639699505 
Productmajor facilitator transporter 
Protein accessionYP_842933 
Protein GI116753815 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGACC TCCCAAAGAG CATATTTCCC CTGTACCTGT CGGTCTTCGT CTCAGTTCTG 
GGTTTTGCAG TGGTGGCGCC GATATTCCCC CTCTACGTCC TTGATATGGG GGCCACACAC
CTCATGCTGG GGATGATAAT CTCCATTTAC GGCGCGGTTC AGCTGCTGAC ACAGATGCCC
GCTGGCAGGC TCTCAGACCA GAGGGGAAGA AAGCCAGTTC TCCTAATCGG TCTCCTGACA
TTCACAATAA TGCCGCTGCT CTACATTTAT GCATCAAATG CGTATCAGCT CCTCCTGATA
AGAATCTTCG GCGGCATAGG AGCTTCGATG GTCTGGCCAG TCACCATGGC TCTTATAGTC
GATTGCGTGG ATCCATCCCA CAGAGGTCTT GCGATGGGAT GGTACAATGC ATCATTCTAC
TCTGCCGTGG CTGTGGGTCC TGTCATCGGA AGCCTTCTCT ATGGGAGCTT TGGAATAAAT
GCGCCATTCA TATTCTGGAG TCTGTTTGCA GTCGCATCTC TTATCATGGT GACGTTTGTT
GTCAGAGAGC CGCCAGTCAG GGGAGAGGTG CTGTCCACAA ATACACCCAG AACACCCAAA
GCCCGGCTGA TTGTAGATGG ATCCATGATA ACGTTCATCA TATGCTGCAG TGTGGTTATG
GTACCTGGCA TAATCGGAGG GTTCAACATG ACCCTGCTGC CCGAGCTGGC GCTGAGCGTC
GGTGTGGGTG TATCCCAGCT GGGCATCCTG TATATGGCAT ACGCGGGCAG CAATGCACTG
GCCAATATCT ATTTCGGGAG GGTAGCCGAC CTTGGCCACC GCAGGCTTCT CATAAGCGGA
GGAAGTCTTG GATGTGTTCT TGGATTCCTG GTGCTTGGCC ACGGTCAGGG CATTCTCCCA
CAGCTCATCG CTCTGACTAT TCTAGGCCTG AGCTCTGGAA TATGCACACC AGCGGCTACT
GTTGTGGTCT CGTATATCAC AAGCCCGGAG AGGAGGGGGG AGATCTTCGG GATATTCAAC
ACCTCCAGAA TGCTCGGCGT CGTCATAGGT CCGATAATAG CCGGCCTCAC TGCGGATCTC
GGTGGGCTGG CAGGAGCTAT CACGGCATTC CTGGCTGTAT CCCTGCTGAT CTCTGCCATG
ACGCTGAGCC TTCGGGAAAT ATGA
 
Protein sequence
MADLPKSIFP LYLSVFVSVL GFAVVAPIFP LYVLDMGATH LMLGMIISIY GAVQLLTQMP 
AGRLSDQRGR KPVLLIGLLT FTIMPLLYIY ASNAYQLLLI RIFGGIGASM VWPVTMALIV
DCVDPSHRGL AMGWYNASFY SAVAVGPVIG SLLYGSFGIN APFIFWSLFA VASLIMVTFV
VREPPVRGEV LSTNTPRTPK ARLIVDGSMI TFIICCSVVM VPGIIGGFNM TLLPELALSV
GVGVSQLGIL YMAYAGSNAL ANIYFGRVAD LGHRRLLISG GSLGCVLGFL VLGHGQGILP
QLIALTILGL SSGICTPAAT VVVSYITSPE RRGEIFGIFN TSRMLGVVIG PIIAGLTADL
GGLAGAITAF LAVSLLISAM TLSLREI