Gene Amir_4704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmir_4704 
Symbol 
ID8328902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameActinosynnema mirum DSM 43827 
KingdomBacteria 
Replicon accessionNC_013093 
Strand
Start bp5610713 
End bp5612149 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID644945148 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003102380 
Protein GI256378720 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.391324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGCGA CCCCGAACAG CCCATCCTTA GACGGGCCCG TGCGCATGAC CGCGCGCGAC 
GCCGCACTGC TCTTCGTGCT GTGCGGCTCG ATCTTCCTCG AAGGCGTCGA CATCTCCATG
CTCGGCGTCG CACTGCCCTC CATCAAGGAC GAGCTCGGCA TGTCGGCGGG CGAGCTCCAG
TGGGTCGTCA GCGCCTACGT GCTGGCCTAC GGCGGGTTCA TGCTCCTGGG CGGCCGTGCC
GCCGACCTGC TGGGCCGGCG CAAGATGTTC GTGCTGTGGC TGGCGGTCTT CGTCGTGTTC
TCCGGTCTCG GCGGCCTGGC CACCGAGGGC TGGATGCTCA TCGTCTCCCG CGCCATCACC
GGCCTCGCCG CCGCGTTCAT GACCCCCGCG GGCCTGTCGC TGATCACCAC GAGCTTCGCC
GAGGGCCCCA AGCGCAACCG GGCGCTGCTC TGGTACGCGG GCACCGCCGC CGGTGGCTTC
TCGCTGGGCC TGGTCGTCGG CGGCCTGCTG ACCGCCATCG GCTGGCGCTG GGTCTTCTTC
GCCCCGGTCA TCCTCGCCGC GATCATCTTC GGCTTCGCGG TCAAGCTGAT CGAGCACGAC
ACCCCGCCGC CGCGCGTCCC CGGCCAGAAG TACGACGTGT TCGGCGCCGC CACGATGACC
CTCGGCGCGG TCGGCGCGGT CTACACCGTC GTCATGGCGC CCGAGGTGCC CGTCAGCCGG
ACCGTGCTGA CCGCCGCGAT CAGCGCCGTG CTGCTGGTGG CGTTCGTGAT CGCGGAGAGG
CGCTCGAAGG AGCCGCTGGT GCGCCTGGGC ATCTTCCGCA GCGCCAACCT GGTGCGCGCC
AACATCGCCA CCGTGCTGTT CGCAGGCTCG TTCTTCGGCT TCCAGTTCAT CACCGTGCTC
TACATGCAGG ACCTGCGCGG CTGGGGCGCG CTGGAGACCG GCATCGCGCT GCTGGTCATC
GGCATCGACT CGGTGCTCGC GCCGACCCTC ACGCCCAAGC TGGTCGACCG GTTCGGCAAC
CCCGTCGTGC TGTTCGCGGG CTTCGTGTCG CTGGCGCTGG CCTACTTCCT GTTCCTGGAC
ATCCCGGCCG ACGCGAACTA CTGGACCGGC ATGTTCCCGA CGATGCTGCT CATCGGCATC
GCCTTCACCC TGGTCTACGG GCCGCTGACG ATCGTCGCGG TGGAGGGCAT CGCCGACGAG
GAGCAGGGCC TGGCGGGCGG CATCTGGAAC ACCTCGTTCC AGTTCGGCGC GGCGCTCGGC
CTCGCCGTGG CCGCCTCGCT GACCGCCACC TCGGGCACCG CCCTCGACGG CTACCACAAG
GCGCTGCTCG TGCCGTTCGT CGCCACGCTC GTCGCGCTCG TCGTGGTGGC CACGGGCCTG
CGCTCCCGCG CCTCCGCCGA GCCCGTCGCG GTGGAGGCCC AGCCCACCGC GGGCTGA
 
Protein sequence
MSATPNSPSL DGPVRMTARD AALLFVLCGS IFLEGVDISM LGVALPSIKD ELGMSAGELQ 
WVVSAYVLAY GGFMLLGGRA ADLLGRRKMF VLWLAVFVVF SGLGGLATEG WMLIVSRAIT
GLAAAFMTPA GLSLITTSFA EGPKRNRALL WYAGTAAGGF SLGLVVGGLL TAIGWRWVFF
APVILAAIIF GFAVKLIEHD TPPPRVPGQK YDVFGAATMT LGAVGAVYTV VMAPEVPVSR
TVLTAAISAV LLVAFVIAER RSKEPLVRLG IFRSANLVRA NIATVLFAGS FFGFQFITVL
YMQDLRGWGA LETGIALLVI GIDSVLAPTL TPKLVDRFGN PVVLFAGFVS LALAYFLFLD
IPADANYWTG MFPTMLLIGI AFTLVYGPLT IVAVEGIADE EQGLAGGIWN TSFQFGAALG
LAVAASLTAT SGTALDGYHK ALLVPFVATL VALVVVATGL RSRASAEPVA VEAQPTAG