Gene Msed_1095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1095 
Symbol 
ID5103569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1021515 
End bp1022948 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content50% 
IMG OID640506990 
Productmajor facilitator transporter 
Protein accessionYP_001191183 
Protein GI146303867 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACTC TTCTCTCCCT AACCCTTATG TTAATGCTTG TGAACTACGT GGAGACCATG 
GTGATTCCAG CACTGCCTAA GATAGAGGAC CAATTCTCCA CAACCGCGAC CACCGTGGCG
TGGGTAACCT CGGCATACCT CATTGTGGGG GCGGTCGCGT CTCCGATTTT CGGCAAACTG
GGCGACAGAT ACGGGAAAAA GAAGGTTTAC CTGATCTCAA TCGGGTTCTA CTCGCTTGCG
GTTTTGATGG CAGGTTTCTC TCCGAACATT TACTTCCTGA TCTTCTCTAG GGGAGTTCAG
GGAATAGGAT ATTCCACATT TCCCTTGGCT ATTGCCATCA TCACTGACCT GTTCCCCAAG
GAAAGGGTGG CATGGGCACA GGGCATACTT AGCGCAACCT TGGCTGCTGG TCCTGCACTA
GGTCTCCTTG TGGGATCCTA TATAGTCCAG GACTTGGGGT GGCCGTACGC CTTTCACACG
GCTTTCATCC TCTCCTTGAT TTTGCTGGGC ATCTCGGCCA AGTACATCGT GGAAATCCCT
GAGAAGACTA GGGAAAAGAT TGACTACCTT GGTGCTACCT TCCTCATGTT AACAGTGGTG
CCACTCCTAG TTTATCTTTC CAATGGGCCC AACGTGGGGT GGACCACCTT GAGCCAGATA
GCCCTCATCG TGGTGTCAGT GGTAGCGTTC CCTATCTTCT TGATCGTGGA GAGGAGAACC
TCGGAGCCCT TGATGAGGCT TGACCTCTTT AGGGTAAGGA ACCTCATGGT GGCAAACGTG
GCTGGTCTCA TTTCCGGTAC GGGTATGTTC CTGATGTTCA CTGGATTGGT TTACTACCTT
CAGCTACCCA GACCCTACGG ACTAGGTTTA ACTATTATCG AGTCAGGTCT CCTCATGGCA
CCCGTTGCCC TGGTCATGAT GACCTTGGGT CCTATTGTGG GTAGAGCTAT AAATGTGATT
GGCCCGAAAC CTCTGCTCGT CGTAGGATCA TCGGTCAGCA TGTTGGGCTA CTTCCTACTG
GACACCTTTA GGTATAGCGA GTACGAGGTT CTATTTGACG TGATAGTGAC AGCTGCAGGA
TTGGTTAACC TGATTATCCC CCTAGTTAAC ATGGTCGCCT TGGCTTTACC TGAGGAGCAA
AGGGGAATAG GGATCGGAAT GAACACCTTG ATAAGGACCA TAGGAAGCGC AATCGGTCCA
GTGATATCAA CAGTGTTCAT GGACACTTAT GTCACGTGGT TGCTATATGA CGTTAATGGA
CAGTTCATTC CCGTGGCACA GGTACCTGAC TACTCGGCCT TCCACTACAT GTACATGGTA
GCAATTGCCC TTATGTTCCT GAGTTTGATA GCCTCATTGT TCACGAAAAA CTATGTTATA
AAGGCCAGAC AGGAGGTGAA AAGGGAAGTA GTTGAGGCCA AACATCCTGG ATGA
 
Protein sequence
MRTLLSLTLM LMLVNYVETM VIPALPKIED QFSTTATTVA WVTSAYLIVG AVASPIFGKL 
GDRYGKKKVY LISIGFYSLA VLMAGFSPNI YFLIFSRGVQ GIGYSTFPLA IAIITDLFPK
ERVAWAQGIL SATLAAGPAL GLLVGSYIVQ DLGWPYAFHT AFILSLILLG ISAKYIVEIP
EKTREKIDYL GATFLMLTVV PLLVYLSNGP NVGWTTLSQI ALIVVSVVAF PIFLIVERRT
SEPLMRLDLF RVRNLMVANV AGLISGTGMF LMFTGLVYYL QLPRPYGLGL TIIESGLLMA
PVALVMMTLG PIVGRAINVI GPKPLLVVGS SVSMLGYFLL DTFRYSEYEV LFDVIVTAAG
LVNLIIPLVN MVALALPEEQ RGIGIGMNTL IRTIGSAIGP VISTVFMDTY VTWLLYDVNG
QFIPVAQVPD YSAFHYMYMV AIALMFLSLI ASLFTKNYVI KARQEVKREV VEAKHPG