Gene Mpal_1264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1264 
Symbol 
ID7271542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1291897 
End bp1293297 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content54% 
IMG OID643569898 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002466322 
Protein GI219851890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.543529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT CCAACTCACC TCAACCCACA AAAAGTGAAA AAGCGTTCGA TTGGCGTTTT 
GTAACACCTC TCTATATCGG CTCCGCACTA AATCCTGTCA ACAGTTCGTT CATTGCCACT
GCGCTGGTGC CAATAGCAGC GGCCATCAAC GTTCCGGTCG GACAGACTGC TGTTCTGGTC
GCAGCACTCT ATATGGCATG TATCGTTGCC CAGCCGGCAG CCGGAAAATT ATCTGAGGCA
TGTGGGCCAC GGCGGGTGTT CCTTGCAGGT ATTCTCGCTG TACTGGTTGG AGGAGTGCTG
GGTGGCTCAG GCCATGACCT CGCAACGTTG ATCATATCGC GGGTCCTGAT TGGTGTGGGC
ACCTCGACCG GCTATCCTTC GGCAATGCTT TTGATCCGAC AGCGGGCCGA ATCGGCCGGG
CTGACCGAGC CCCCGGGAGG AGTGCTTGGC GGCCTTGTGA TAGCGGGAAT GGCGACTGCG
GTTATAGGTC TGCCCATTGG CGGATTCCTC GTCGGCGCCT GGGGCTGGCA GAGCGTGTTT
TTTATTAACG TCCCGCTGGC TCTCGTAGCA CTCATTATGG CTGTGTCCTG GATCCTCCGG
GACCCGCCAA GCAGGAGTAC AAAGACGCTC CGTACCCTGG CAGCCCGCAT CGATCTGGCC
GGCATCACGG TCTTTAGTGG CGTGATGCTT TCCCTCCTGG TCTTTCTCAT GTCATTGCCA
GATCCGGATT GGGTTGTTTT AGGCGTAGCT GTTCTGCTCG TTTTGGCCTT TGTCTGGTGG
GAAGGACAGG TGAGTCAACC TTTTATTGAC CTCCGCCTGT TGGCAACGAA CCGGCCATTG
ATACTCACCT ATGTGCGCTT TGCCCTTGCA TCGCTGTGCG TCTACACCGT AATGTATGGT
GTCACGCAAT GGCTTCAGAT CGACAAAAAT ATTCCATCCG CTGATGTCGG ATTCATCATT
TTGCCAATGA GTCTCATATC CATTGTGCTT GCGTGGCCGG TATCGCGGCT GAACCTCGTG
CGCACTCCCC TTATTGCGTC CGCCGTTGCC TGCTTGATAG GGTCTGTGGG TGTACTTTTA
TTTACCACGG CGACTCCACT AATCTGGATA GTTGTAGTCA CTGCGATCTT CGGGATTACC
ATGGGGATGT GCACCAGTGC GAACCAGACA GCCTTTTACA CCCAGGTCAC CGCAGATCAG
ATCGGTACCG CTTCAGGCCT GTTCCGTACC TTTGGGTATT TGGGCTCGAT TACATCGTCG
GCCCTTATCG CGATATTCTT TAATCCAAAT GTCAGCGATC AGAGCCTGCA TTCAATTGCT
GCCGTTATGG TGATCCTGAG CGTTGTGGGG CTGCTTATTG TCATTGTCGA CAGGAAAATC
ATGGTGCTGG CAAAAGTATA G
 
Protein sequence
MNKSNSPQPT KSEKAFDWRF VTPLYIGSAL NPVNSSFIAT ALVPIAAAIN VPVGQTAVLV 
AALYMACIVA QPAAGKLSEA CGPRRVFLAG ILAVLVGGVL GGSGHDLATL IISRVLIGVG
TSTGYPSAML LIRQRAESAG LTEPPGGVLG GLVIAGMATA VIGLPIGGFL VGAWGWQSVF
FINVPLALVA LIMAVSWILR DPPSRSTKTL RTLAARIDLA GITVFSGVML SLLVFLMSLP
DPDWVVLGVA VLLVLAFVWW EGQVSQPFID LRLLATNRPL ILTYVRFALA SLCVYTVMYG
VTQWLQIDKN IPSADVGFII LPMSLISIVL AWPVSRLNLV RTPLIASAVA CLIGSVGVLL
FTTATPLIWI VVVTAIFGIT MGMCTSANQT AFYTQVTADQ IGTASGLFRT FGYLGSITSS
ALIAIFFNPN VSDQSLHSIA AVMVILSVVG LLIVIVDRKI MVLAKV