Gene Acel_2081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2081 
Symbol 
ID4485165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2356000 
End bp2357586 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content66% 
IMG OID639730881 
Productmajor facilitator transporter 
Protein accessionYP_873839 
Protein GI117929288 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC CGACAACATC CCGCTGGTGG ACCCTGCTCG TCGTCGGCAT TGGCACCTTC 
ATGCTGATGC TCGACATCAG CGTGGCGAGT ATTGCGCTGC CGCAGATCCG CGCGTCATTG
CACGCGAGCT TCGCCGAGCT GCAGTGGGTT TTTGACGCGT ATGCCCTGAC ACTCGCCGCC
TTTCTCGTGA CCGCCGGCTC AATCGCCGAC CGGCGGGGGC GCCGCGGTCT CTTTTTTCTC
GGCCTTCTCG TGTTCACCGC CGCCTCGCTG AGCTGCGGTC TCGCCCCCAA TGCGGTGGCG
CTCAACGTGA GCCGCGGCTG TCAGGGCGTG GGGGCTGCAA TTCTCTTTGC CGTGGGTCCG
GCGCTGCTCG GCCAGGCGTT TCACGGCAAG GAGCGGGCGA TGGCATTCGG TGTCTTCGGC
GCCGTCACGG GCATCGCTGT CGCGTCCGGG CCGCTCATCG GCGGTGGTCT CACCTCGGGC
GTGAGTTGGC GGTGGATCTT TTTGCTCAAC GTCCCGCTCG GTGTGTCGGC TGCGGTCATC
ACCCGGCTGC GCGTACAAGA GTCCCGGGAC CCACGCGCTC GGGGAGCGGA TTGGGCAGGC
ATGCTGACCT TCACCGTCGC CCTCGCTGCC ATCGTCTACG CACTCATCCG CGGCAATGAG
ATCGGCTGGA CGAGTCCGGA GATCCTCGCA ATGTATGGCA TTTCGGCTGT CATGCTTGTT
GCCTTCGTTG TGACGGAGCG GCGGCTCGGC GAGCGGGCGA TGTTTCCGCT TTCTTTCTTT
AGGAACGTCA TATTTGTCGG CATCTCGCTT GTCGCTCTGA TTGCCAACGG CTCGGCACTG
CCCGCGATTT TCCTCGAGAC GAATTACGTG GAGAACATCA TGCACCTGTC GGCGTTCAGC
ACGGGCGTTC GCTTTCTTCC GCTGACCCTG GCGCTGTTCG TGTTCGGCGC CGTAGCGGGC
GCACTGACTG GTCGGGTGCC GTTCCGTCTT CTCATGGGCG CCTCCTGTGT CGCCCTCGGC
ATCGGCCTGC TGCTCGCCCG GACGACGACC GCTGATTCAC GGTGGACGGC GCTTGTCCCG
AGCATGATTG GAATGGGCGT TGGAATGGGC ATTTTCAACC CGACGCGCGC CGCACTCGCG
ATCGGGGTCG CCGAACCGCG GGACGCCGGC GTCGCCTCCG GTATCAATGA GACCTTCCAG
CAGGTGGGCA TTGCGGTCGG CATCGCCGGC ATCGGCGCGC TTTTCCAACA CCGCGTCGTG
TCGCTGTTCG CGGATTCGCA AGCGGGACAC CTGCTCGGCG GCCAGGCTGC CTCGGGGGCG
GCCCGGGGCA TCAGCGCCGG TTCCCTGGAC TCCGTCGCCG CCGCGTTCAG TGGGCTGCGC
GACATGGTCC TCCGTGACGG CCGAGCCGCC TTTGTGGCCG CATTTCACGA CGCCATGCTG
GCCTGCGCGA CGTGCGCTCT TGCCGCAGCC GCTCTCGCCG CCCTGCTCCT GCGTACCAAG
GACCTCCACG CCTCCGCGCT CTCGCTTGTG CCGCCGGAGA CCGAGACGGA CGCCGCGGAG
CGAACGGCGG TCGCTGCCCG CTCCTAG
 
Protein sequence
MTAPTTSRWW TLLVVGIGTF MLMLDISVAS IALPQIRASL HASFAELQWV FDAYALTLAA 
FLVTAGSIAD RRGRRGLFFL GLLVFTAASL SCGLAPNAVA LNVSRGCQGV GAAILFAVGP
ALLGQAFHGK ERAMAFGVFG AVTGIAVASG PLIGGGLTSG VSWRWIFLLN VPLGVSAAVI
TRLRVQESRD PRARGADWAG MLTFTVALAA IVYALIRGNE IGWTSPEILA MYGISAVMLV
AFVVTERRLG ERAMFPLSFF RNVIFVGISL VALIANGSAL PAIFLETNYV ENIMHLSAFS
TGVRFLPLTL ALFVFGAVAG ALTGRVPFRL LMGASCVALG IGLLLARTTT ADSRWTALVP
SMIGMGVGMG IFNPTRAALA IGVAEPRDAG VASGINETFQ QVGIAVGIAG IGALFQHRVV
SLFADSQAGH LLGGQAASGA ARGISAGSLD SVAAAFSGLR DMVLRDGRAA FVAAFHDAML
ACATCALAAA ALAALLLRTK DLHASALSLV PPETETDAAE RTAVAARS