Gene Mpal_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1052 
Symbol 
ID7271786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1081710 
End bp1083200 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content52% 
IMG OID643569688 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_002466122 
Protein GI219851690 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.923754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.334038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAGTA CCAGTCAGGG AACGGATCAG AAAGGATTGA ACCTTCTGAT CCTCTCCATC 
TCCCTCGCAA CGTTCATGTC GTCACTGGAC GGAACGATCG TCAATATCGC CCTGCCGACC
ATCTCTTCGG TGTTCAATAT CTCTTCGAGT ACCGTGAGTT GGGTTGCGAC CATTTACCTG
CTGGTGATGG CTGGCTGCGT CCTGATCTTC GGTAAGTTGT CGGACAGTGT CGGGTTTAAA
AGAATGTTCC TATCAGGATT TGTAATCTTT ACCCTGGGAT CGTTCTTATG CGGTCTGCTC
CCGGACCTCC TATCTTCATT CTTCGCGCTC ATCGGTTCAC GTGCATTCCA GGCAATAGGC
GGTGCCATGA TAACAGCGAT TGCGCCGGCG ATGATCGCTG CATACATCCC CATGAAGCAG
AAAGGAAAAG CGATGGGAAT CGTTATGACC GTCGCTGCAC TCGGGACCGC CATCGGACCG
ACCATCGGTG GAGTCCTCAC CCAGTACATC TCCTGGCACT GGATATTTTT CATCAACGTA
CCGGTGGGAA TCTGTGCAAT CATACTGGGG TTACGTGTTA TTCCCACCAC TCAGCCCCAC
AATAAAAATG CCGGCTTTGA CAGAGCCGGC GCGTTGTTGA TCTTCACCGG CCTCGCTGCA
CTGCTCTTTG CGGTTTCAGA AGGGCAGTCG CTTGGGTGGG ATTCCCCGGT GATCCTCGGT
TCCCTTGCCC TCGCTCTCAT TACACTCGGT TACTTTGTAT GGCACGAACT CAGGACCGCT
GACCCTCTGC TGGAACTCCG TCTCTTTAAA AACAAGAACT TCCTGATGAC CAATCTTGTC
CTTTCGCTGG TCTTCTTCAG TTTTGCCGGT ATCAGTTACC TGCTCCCGTT CTATCTTCAG
TACATCAAAG GGTTCAGTTC CTCAGATGCA GGGATGATAA TTACCGCCCT ATCGGTTGCC
ATGATGGTCT CCGGCCTTCT TTCAGGAGCG CTGTATAACC GGGTTGGTGG CAGGATACTC
TGCATCGCTT CGGGGATCTT CCTGGTTGCC GGTTATTTTA TGATGACCCT CCTCCGGGTC
GACACCTCAA TCGGATTTGT GATTCTCTGT CTGCTCGTGC TCGGTTTCAG CCTTGGCCTG
ATGATCACAC CGGCATCGAA TATGATCATG AACTCGGTTG CTAAGCGATA CCAGGGGATG
GTCTCCAGCC TCACGAGCCT TGAACGATTC GCACCGTTGA CCCTGGGGAT TGCTTTTGCA
AACCTGGTCT TTATTCAGGG GATCACAGCA ATTGCTGACA ACCGGGGGAT CACGGAGAGC
GCACCGGTTA ACATCAAACT GCACCTGATT ACTGCTGGTT TTGACCTTGC CTTCTTCTTC
TCACTGGTTA TTGCGGTCAT CATCCTCATC CTCACCCTGC TCGCACGACA GGAAGTGCAC
CCGGACTACC AGTCAGGCAC CGATGAGGAT GCTCTGAATA GTACAATCTA A
 
Protein sequence
MESTSQGTDQ KGLNLLILSI SLATFMSSLD GTIVNIALPT ISSVFNISSS TVSWVATIYL 
LVMAGCVLIF GKLSDSVGFK RMFLSGFVIF TLGSFLCGLL PDLLSSFFAL IGSRAFQAIG
GAMITAIAPA MIAAYIPMKQ KGKAMGIVMT VAALGTAIGP TIGGVLTQYI SWHWIFFINV
PVGICAIILG LRVIPTTQPH NKNAGFDRAG ALLIFTGLAA LLFAVSEGQS LGWDSPVILG
SLALALITLG YFVWHELRTA DPLLELRLFK NKNFLMTNLV LSLVFFSFAG ISYLLPFYLQ
YIKGFSSSDA GMIITALSVA MMVSGLLSGA LYNRVGGRIL CIASGIFLVA GYFMMTLLRV
DTSIGFVILC LLVLGFSLGL MITPASNMIM NSVAKRYQGM VSSLTSLERF APLTLGIAFA
NLVFIQGITA IADNRGITES APVNIKLHLI TAGFDLAFFF SLVIAVIILI LTLLARQEVH
PDYQSGTDED ALNSTI