Gene Amir_4647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4647 
Symbol 
ID8328845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5532742 
End bp5533992 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content72% 
IMG OID644945093 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003102325 
Protein GI256378665 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCA CTCCCAGGAT CGCGGGGGGC ATCACCGCGT TCGCGATCTT CGCCACCGCG 
GTCAGCTTCG TGATCTCGAT GGCGGGCTCG TCGATGAAGA GCACCGTGCA GGTGCTGTTC
CTGCCGATGG TCGACAGCTT CGACGTCACC AGGGGCACCC TGGCGGTGGG CACGACGCTG
TTCGCGGTGG TCACCGCGCT CGCCTCGTCG GCGGTGGGGC ACCTGGCGGA CCGGATCGGC
GCGGTGCCGG TGCTGGCGAT GGGCGCCGGG ATCGTCGGGT GCGTGCTGCT GATCTGCGGG
ACGGTCACCG ACATCCGGCT GTTCGTGCTG GCCTACGGCG TGCTCGGCGC GATCGGCTGC
ACGATGCTGT CGTTCGTGCC GCTGGGCGTG CTGGCCGACC AGCTGTTCGC GGGGCGCAAC
GCGGGCGTGC TGTACGCGGT GCTGACCAAC GGCGCGGCGG TCGGCTTCAT GGTGCTGGTG
CCGCTGTGGA CGTACCTGGG CGGCATCACC GACTGGCGGC AGATCCTGCT GGGCGCGGGC
GCGGTGTTCC TGGTGGTGCT GCTGCCGCTG TCGCTGCTGC TGGTGCGCTC CTCCACCCGC
CAGCCCAAGC CGCCCGCCGC GCCCGCCGAG CACGGGTTCC TGGCCGGGGT GCGCACCGCG
TTCGCCGACC GGCGGGTGCG CGGGCTGATC CTGCCGTTCT TCGCCTGCGG CACCACGATG
GCGTTCGTCG ACGTGCACCT GTTCCCGCAC ATGCACGACC ACGGCGTGGC CCCGGTGACC
AGCTCGGTGG CGTTCGTGCT GCTGGGCGCG ACCGAGATCG CCGGGTCGCT GGTGGCGGGC
AGGCTGTGCG ACCGGGGCCG GATCAGGGCC ACGCTGGTCG GCGGCTACCT GATGCGCGCG
GGCGCGATGG TGCTGACCCC GTTCTTCTCC GCCGAGTTCA CCGTCCTGGT GTTCGGCGCG
GTGTTCGGGG CGAGCTACCT GGTGACCGTG GTGGCCACCA CGATGTGGAT CGCGAAGATC
CTGCCGCGCG GGCGCAAGGG CACCGCGATC GGCGTGCTGT GGGCGCTGCA CATGGTGGCG
GTGGCGGTGA GCAGCCAGCT GGGCGCGGTG ATCGCGGACC GGTTCCACAG CTACCTGCCG
GTGATCCTGC TCAGCGCGGT CATGACGGTC GGCGCGGCCC TGCTGGTGTC GCTGCAGCCC
GACCCGGACG CGGTCGGGCC CGAGGTGAGC CGGACGCCCG CCGCGGCGTG A
 
Protein sequence
MSGTPRIAGG ITAFAIFATA VSFVISMAGS SMKSTVQVLF LPMVDSFDVT RGTLAVGTTL 
FAVVTALASS AVGHLADRIG AVPVLAMGAG IVGCVLLICG TVTDIRLFVL AYGVLGAIGC
TMLSFVPLGV LADQLFAGRN AGVLYAVLTN GAAVGFMVLV PLWTYLGGIT DWRQILLGAG
AVFLVVLLPL SLLLVRSSTR QPKPPAAPAE HGFLAGVRTA FADRRVRGLI LPFFACGTTM
AFVDVHLFPH MHDHGVAPVT SSVAFVLLGA TEIAGSLVAG RLCDRGRIRA TLVGGYLMRA
GAMVLTPFFS AEFTVLVFGA VFGASYLVTV VATTMWIAKI LPRGRKGTAI GVLWALHMVA
VAVSSQLGAV IADRFHSYLP VILLSAVMTV GAALLVSLQP DPDAVGPEVS RTPAAA