Gene Mpal_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1478 
Symbol 
ID7270083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1527891 
End bp1529090 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content51% 
IMG OID643570101 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002466523 
Protein GI219852091 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.311538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATT GGAAGAGAAA CTTAATCGTC TGCTGGTTCG GGATGTTCAT GACCGGTATG 
GGGCTGAGCC AGATCGCACC GGTATTGCCG CTGTACATCC AGCATCTCGG CGTCGACAAC
ACCGCGTTGA TCGAACAATT TTCTGGCATA GCATTTGGTG TAACATTTAT CATTTCAGCG
ATCTTCTCAC CGATCTGGGG TCTTGCCGCC GACAAATTCG GACGAAAATC GATGCTGTTA
CGAGCAAGTC TCGGCATGGC GATCGTGATC GGGTGCATGG GGTTTGCACA GAACGTGTAC
CAGCTGATCG GACTGCGGCT ATTGCAGGGC GTGATAACCG GGTACAGTTC GGCCTGTACT
GCATTGATTG CAACGCAGAC GGACAGAGAA CACGCGGGCT GGGCGTTGGG TACCCTTTCG
ACATCTTCGA TCGCAGGAAC TCTGCTCGGA CCCATGATTG GCGGGTACAT CGCAGAGAAC
CTGGGTTTTC AGGAGGTTTT TTTTATAACC GGTGCACTGC TGCTGATCGC ATTTATTGCA
ACCGCCCTCT TTGTGAACGA ATCATTCACC CGTCAGGACA GGATGGTGCT CAGTATCAAA
GAGACCTGGG GGACTGTTCC ACACAAAAGT TTGACACTTA TTCTGTTTGT GAGCTCTTTT
GTCATGACTC TGGCACTATA CTCCGTGCAG CCCATCTTGA CCATATATAT TACCCAGTTA
TCCAGTACCA CCAGTCATGT TGCTCTGCTG GCAGGTATGA CATTCTCGGC CTCAGGGCTG
GCCAGTATCG TTGCGGCTTC ACAATTGGGG AAACTCTCTG ATAAGATTGG CCCCCAGAAG
GTCATGCTTG CCGCACTGAT TGTAGCCGGA CTCATCTTTA TCCCCCAGGC CTTTGTAACC
GACCCCTGGC AGTTGATGGC CCTTCGATTT GTACTGGGAT TGGCGATCGC GGGATTGATT
CCGTCTGTCA ATACCCTGCT CAAGAGGATC ACACCGGATT CTCTGACCGG CAGGGTCTTC
GGTTTCAACA TGTCTGCAGG GTATCTGGGT GTATTTGGAG GATCAGTCCT GGGCGGGCAG
GTGGCAGCCT ATCTGGGTAT CAGATCGGTA TTCTTCATTA CCGGGGCATT GTTACTGGTA
AATGCAGTCT GGGTCTATTT CAAGGTGTAT AAAAATATCC GTATCGCAGA ATATGCATAA
 
Protein sequence
MQNWKRNLIV CWFGMFMTGM GLSQIAPVLP LYIQHLGVDN TALIEQFSGI AFGVTFIISA 
IFSPIWGLAA DKFGRKSMLL RASLGMAIVI GCMGFAQNVY QLIGLRLLQG VITGYSSACT
ALIATQTDRE HAGWALGTLS TSSIAGTLLG PMIGGYIAEN LGFQEVFFIT GALLLIAFIA
TALFVNESFT RQDRMVLSIK ETWGTVPHKS LTLILFVSSF VMTLALYSVQ PILTIYITQL
SSTTSHVALL AGMTFSASGL ASIVAASQLG KLSDKIGPQK VMLAALIVAG LIFIPQAFVT
DPWQLMALRF VLGLAIAGLI PSVNTLLKRI TPDSLTGRVF GFNMSAGYLG VFGGSVLGGQ
VAAYLGIRSV FFITGALLLV NAVWVYFKVY KNIRIAEYA