Gene Acel_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1894 
Symbol 
ID4486151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2142518 
End bp2143813 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content68% 
IMG OID639730684 
Productmajor facilitator transporter 
Protein accessionYP_873652 
Protein GI117929101 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGACG TCGGGCGGGC TGACTCATCG CGCAACGCCC GCCGTTCTCG TCTGCCGATC 
GGCGCCCTCC CCCGGCCCGT GCTTGTTCTC TCCGTGGTCG CGTTCTGCGT CGCCGTCGGT
TTCGGTCTCG TGGTGCCTGC GTTACCGCTT TTTGCCCGGG ATTTCGGCGC AAACAAGGCA
GCGGTGGGCG CGGTCATTTC TACCTTTGCG GCGATGCGAC TGGCCGCCGC GCTCGGCGTC
GGCAAGGTCG TGGATATTCT CGGGGAACGC GTCGTCCTCG CCCTCGGCAT CGCCATCGTC
GCCGTGAGTT CCGCTGCTAC GGGTTTCTCG GAGAACTACC CCCAACTGCT GGTGCTCCGC
GGTGCCGGCG GCATCGGATC GGCGATGTTC ACGGTGAGCG CACTCGCGCT CGTGCTGCGC
GTCTCCGACG TCACCGTCCG TGGACGGGCC GCCGGTATAT TCCAGGGCGG GTTCCTGCTG
GGCGGTATTT TCGGCCCGGT TCTGGGCGGG CCACTCATCA GCTGGTCGAT TCGTGCACCT
TTTTTCTGTT ACGCGGGCAC CCTAGTGGTG GCCGGCGGCG TCGGACTCAT CGGACTTCGT
CAGGTGGAGC GGCCGCCAGG GATCTCACCG CATCGGCGGT CGACGCCCTC CAATGGCGGA
CGGCGTTCGC CCGTGACCGG TCCTGCCCCA CGAATGACGA TCGGCGCCGC CCTGCGACAG
CGGCCCTATC AAGCGGCACT CGCCGCCAAT GCGGCGATTT CGTGGGCGGC TCTCGGTGTG
CGGAATTCGC TGATCCCGCT TTTCGTGATC GAAGCCCTGC ACGCACCGGC GGTCTGGATC
GGTGTGGGAC TCACGTTGAT GGCGGCGGCC AATGCCGCGG TGCTTCTTCC GGCCGGCCGG
TCCGCGGATC GTCGGGGCAG GCGCAGCCTT CTCGTGGCGG GTTGCGCGGT CAGTGGCGTG
GCACTGGTGA TGCTCGCGGT GATGGGTCAT ATTGCCGGGT ATCTCGCGGC GATGGTGGTG
TTCGGCGTAG GTTCCGGCCT ACTGGACGTC GCTCCGGCCG CCATCGTCGG GGACATCGCG
GGCGGCCGCG GCGGGACGGT CGTCGCCGGC TATCAAATGG CCGGGGACCT TGGGTCGGTG
CTTGGGCCGG TCACCGCAGG CTGGATTGCC GACGCCGCCG GCGATCGTGC GGCGTTCTGG
ACGACCGCCG TCGTCCTGCT CGGCGCCGCG CTGCTCGGGG TTTCCGCTTC CGAAACGCGG
AAAATCTCAT CGGGTCACGC CATGGAAACA CACTGA
 
Protein sequence
MDDVGRADSS RNARRSRLPI GALPRPVLVL SVVAFCVAVG FGLVVPALPL FARDFGANKA 
AVGAVISTFA AMRLAAALGV GKVVDILGER VVLALGIAIV AVSSAATGFS ENYPQLLVLR
GAGGIGSAMF TVSALALVLR VSDVTVRGRA AGIFQGGFLL GGIFGPVLGG PLISWSIRAP
FFCYAGTLVV AGGVGLIGLR QVERPPGISP HRRSTPSNGG RRSPVTGPAP RMTIGAALRQ
RPYQAALAAN AAISWAALGV RNSLIPLFVI EALHAPAVWI GVGLTLMAAA NAAVLLPAGR
SADRRGRRSL LVAGCAVSGV ALVMLAVMGH IAGYLAAMVV FGVGSGLLDV APAAIVGDIA
GGRGGTVVAG YQMAGDLGSV LGPVTAGWIA DAAGDRAAFW TTAVVLLGAA LLGVSASETR
KISSGHAMET H