Gene Mthe_1461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1461 
Symbol 
ID4462404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1563317 
End bp1564525 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content55% 
IMG OID639700480 
Productmajor facilitator transporter 
Protein accessionYP_843875 
Protein GI116754757 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.609311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATGTCAA AAGATCCGCA GGCTGAGGAT GGCAGCAGAA ATATTCTCTT CTTGGGGCTG 
GTCAGTCTCT TTAATGACAC CAGCAGTGAG ATCATACAGC CGATAATGCC TCTCTTCATA
ACATCGCTTG GAGGAGGCGC GCTTGCTGTG GGGCTGGTGG GAGGTCTCAG TGAGGGCATT
CCGAGCATTC TGAAGATCCT ATCAGGATAT GCGGGTGATA TAATGGGGAG AAGAAAGCCG
TTTGTCCTGG CGGGCTATGG CCTCTCTGCA GCGGCCAAGA CGCTTCTTCC TTTATCGTTA
ACATGGCAGC ATGTTGTCCT GCTTAAGAGC ATCGAGCGAT GTGGCAAGGG TGTGAGGGCA
GCGCCGAAGG ATGCGATCAT CGCGGATTCG GCCGAGCCCT CACGCATCGG ACGCGGGTTT
GGGACCGTCA GGGCTCTCGA CACCTTCGGA GCGGTGCTGG GATCCGTTAT AGCTTATCTC
CTGTGGAGCG CAGGCCTGAG CTTCAGGGAT ATTCTGGCGG TAGCGGCTGC ACTCTCATTG
ATGGCATTTG TCCCTCTTTT TTATGTCAGG GATATAAGGA GATCTCCAGT GGTAGTCCTC
AGGCCTGGCA TCTCCTCTCT ATCGCCACGG CTCAGGTGGT TCATTCTCGT GGCATCCGTA
TTTTCGCTTG CCAACTTCAG CTACATGTTC TTCATGCTCA GGGCGCAGGA GCTCTTCACA
GGGGCTCTCG CAGTTGGAGC TCCTCTTCTG CTTTACATCC TCTTCAACGT GGTCTACTCG
GGAATGGCCA TACCGAGCGG AGCGGTATCC GATCTCATAG GGAGAAGATG GATCCTCGCG
ATCGGATACG CGCTCTTCTC CTTGGTTGCC CTCGGATTCG TCTTCGTATC GTCTTCAGAG
TGGCTCGTGC TGCTCTTCGT CATGTACGGT CTTGTCTTCG CGGTGGTCGA TGGGGGGCAG
AGCGCATATG TTTCGGATCT GAGCTGTGAT GAGATAAGAT GTACAGCACT GGGCGCATAT
CATGGAATGG TTGGGATCGC CTCGATAGCT TCTGGCCTGA TCGCCGGCAC GGTCTGGCAG
ATATATGGAC CGGCGGCGAC CTTCCTCCTC AGCGCCCTGC TGGCCGCGGT AGCATCATGT
GGCATGATCC TCGGAGGGGT GATTACGGGT GGAGAGAGTC GACAGCTATC ATGCTCAGGT
CGAGAGTAA
 
Protein sequence
MMSKDPQAED GSRNILFLGL VSLFNDTSSE IIQPIMPLFI TSLGGGALAV GLVGGLSEGI 
PSILKILSGY AGDIMGRRKP FVLAGYGLSA AAKTLLPLSL TWQHVVLLKS IERCGKGVRA
APKDAIIADS AEPSRIGRGF GTVRALDTFG AVLGSVIAYL LWSAGLSFRD ILAVAAALSL
MAFVPLFYVR DIRRSPVVVL RPGISSLSPR LRWFILVASV FSLANFSYMF FMLRAQELFT
GALAVGAPLL LYILFNVVYS GMAIPSGAVS DLIGRRWILA IGYALFSLVA LGFVFVSSSE
WLVLLFVMYG LVFAVVDGGQ SAYVSDLSCD EIRCTALGAY HGMVGIASIA SGLIAGTVWQ
IYGPAATFLL SALLAAVASC GMILGGVITG GESRQLSCSG RE