Gene Amir_6047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_6047 
Symbol 
ID8330257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp7098670 
End bp7100217 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content76% 
IMG OID644946480 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003103700 
Protein GI256380040 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGGG TGACCCCACG TGACACCCGA CCCGACGGCG ACCCCCACGA CCTCCCCGCC 
ACCGCCCTCG GGGCCGCCGC CGACGCGAGC GCCGCCGCGC GCGGGTCCGC AGGACCGGGT
GCCGCCTCCG CGACCACCAC CGGGGCCTCC GAGGACGCCA CCGAGGACGC CACCGGGGCC
GCCACCGGGG CCAAGTCCGC CGCCACCACC ACCGAACCCG GTCTGTGGGC CCCCGAGAAC
CGGGGCCCCG TCACCGGCAT GGTCCTGCTG ATCACCCTGC TCGCGTTCGA GGCCATGGGC
GTCAGCACCG CGATGCCCCG CATGGTCGCC GACCTCGACG GCCAGGCGTT CTACTCGTGG
CCGTTCCTCG GGTTCCAGGC CGCCAGCGTC GTCGCGGTCG TGCTGTCCGG CCGGGTCTGC
GACCGGATCG GCCCGCGCCT GCCGCTGCTC GTCGGCCCCG CCCTGTTCGT CGTCGGGCTC
GCCGTCGCCG GGATCGCCCA GGACATGACC CTGCTGATGG CCGGTCGGGT GCTCCAGGGC
CTCGGCGCGG GCGCCCAGAT CGTGGCCGTC TACGTCCTGA TCGGCCTGGT CTACCCGGAG
CGGCTGCGGC CCGCCGTGTT CGGCGCGCTG TCCGCCGCCT GGGTGGTGCC CTCGCTCGTC
GGGCCCGCCG TCGCGGGCTG GCTCACCGAG AACCTGAGCT GGCGGTGGGT GTTCCTCGGC
CTGGTCCCGC TGGTCGCGAT CGGGTTCGCG CTGGTCCTGC CGGTGCTGCG CGCGCTGCCG
CCGCACCGGG GCGAGGAGCC CGCCCGCCGA GGGCTGCCGC TGGCCGCGTT CGGCGCGGGC
GGCGGCGTCG CCGGGCTCAG CTGGGCCGCG CAGCACCCCG GCTGGGCCAG CCTCGCGCTC
GGCGCCGCCT CGCTGGCCGT GCTGGCGCCG TCGCTGCGCG TCCTGCTGCC CAAGGGCACG
CTCACCGCCC GGCGCGGCCT GCCCGTCACG ATCCTGGCCA GGGGCCTGCT CGCGGGCACG
TTCTTCGCCG TCGAGGCGTT CATCCCGCTC ACCCTGACCA CCGTGCACGG CTACTCGGCC
ACCGCCGCGG GCATCCCGCT CACGCTCAGC GCCATCGGCT GGTCCGCCGC GTCCATGTGG
CAGTCCCGCC GCCCGGACAT CCCGCGCGAG ACCCTGGTGC GCTGGGGCTT CACCGTCAGC
GCCACAGGCA TCGCCTCCGT GACCCTCATC GCACCGAGCT GGGGGCCCGC GTGGCTGACC
TCCGTGCTGT GGGGCGTCGC CGGGCTCGGG GTCGGCATGG CCATGTCCAG CCTGAGCGTG
CTCACCCTCG CCGCGTCCAC CGACTCCGAC CGGGGCTTCA ACTCCTCGGC CCTGCAGGTG
AGCGACATGC TCGGTTCGGC CCTGCTGGTC GGCCTCGGCG GCGTCGTGCT CGCCGCCGCA
CCGGACCTGA CCACCGCCGT CATCCCCCTG GACCTGCTCA TGGCCGGTCT CGCGGTGCTC
GGTGCCGTGC TCACCGGACC GCGCTGCCGG GCTACCCTGG ACGACTGA
 
Protein sequence
MGRVTPRDTR PDGDPHDLPA TALGAAADAS AAARGSAGPG AASATTTGAS EDATEDATGA 
ATGAKSAATT TEPGLWAPEN RGPVTGMVLL ITLLAFEAMG VSTAMPRMVA DLDGQAFYSW
PFLGFQAASV VAVVLSGRVC DRIGPRLPLL VGPALFVVGL AVAGIAQDMT LLMAGRVLQG
LGAGAQIVAV YVLIGLVYPE RLRPAVFGAL SAAWVVPSLV GPAVAGWLTE NLSWRWVFLG
LVPLVAIGFA LVLPVLRALP PHRGEEPARR GLPLAAFGAG GGVAGLSWAA QHPGWASLAL
GAASLAVLAP SLRVLLPKGT LTARRGLPVT ILARGLLAGT FFAVEAFIPL TLTTVHGYSA
TAAGIPLTLS AIGWSAASMW QSRRPDIPRE TLVRWGFTVS ATGIASVTLI APSWGPAWLT
SVLWGVAGLG VGMAMSSLSV LTLAASTDSD RGFNSSALQV SDMLGSALLV GLGGVVLAAA
PDLTTAVIPL DLLMAGLAVL GAVLTGPRCR ATLDD