Gene Acel_0607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0607 
Symbol 
ID4486389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp643884 
End bp645632 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content64% 
IMG OID639729374 
Productmajor facilitator transporter 
Protein accessionYP_872366 
Protein GI117927815 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0958846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0536193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCCG TGGCGGAGGC CACCTGGGCG CTGCGCAGAA CGATCGACGC CGACCATCCG 
CATTACCGCT GGGTGGCGCT GTCCAACACC ACGCTTGGCA TGTTGATGGC CACGATCAAC
TCATCGATCG TGATGATCTC CCTACCGGCC ATCTTCCGCG GCATTCACAT CGATCCGCTC
GCCCACGGCA ATGTCAGCTA TCTGCTGTGG ATGATTATGG GATTTCTCCT GGTCACGGCT
GTCCTGGTCG TGAGCGTGGG ACGCTTGGGC GACATCTACG GTCGGGTTCG GATTTACAGC
ATGGGCTTCG CCATCTTCAC GGCGGCATCC ATCGCGCTGG CCCTCGATCC GCTCACCGGC
GGCGGCGGAG CGCTCTGGTT AATCATCTGG CGGGTCGTGC AGGGCGTCGG GGCGGCCATG
CTGTTCGCGA ATTCAGCGGC CATTCTCACC GACGCTTTTC CGCCGGATCG ACGCGGCATG
GCCATGGGCA TCAACCAAGT CGCCGCCATT GCCGGATCGT TCATCGGCCT GATCGCCGGT
GGACTGCTCT CCGAGGTGGA CTGGCGCCTG GTGTTCTTCG TCTCCGTACC GTTCGGCCTC
CTCGGCACGA TCTGGTCGTA CGCGAGCCTG CGGGAGATCG GTGAACGCTC GCCCGCCCGC
ATCGACGTCT GGGGCAACCT CACGTTTGGC CTCGGTCTCA CCGCGATTCT CGTCGCCATC
ACCTACGGCA TTCAGCCGTA CGGCGGCTCG ACCATGGGCT GGACAAACCC GTGGGTCGAT
GCCGGACTCG TCGGCGGAGC CGCGCTGCTC GCGGTGTTCT GCATCATCGA AACCAAGGTT
CCCGAGCCGA TGTTCCACAT GCAGTTGTTC CGCATCAAGG CCTTCAGTGC GGGCAATCTC
GCCGGGTGGC TTGCGTCCAT TGCGCGCGGC GGCATGCAAT TCATGCTCAT CATCTGGCTG
CAGGGCATCT GGCTTCCGTT GCATGGCTAC AAGTTCGCGG ACACCCCCTT ATGGGCGGGC
ATCTACCTTC TTCCCCTGAC CATTGCGTTT CTCGCTGCGG GGCCCATTTC GGGCTATCTC
TCGGACCGGT TCGGCGCTCG AGCATTCGCC ACCGGCGGCC TGCTCATCGT CGCGGCCAGT
TTCGCCGGTC TGATGGCGCT GCCGACGAAT TTCCCGTACT GGGAATTCGC TGCTCTCCTG
GCGATAAACG GAATCGGTTC CGGGCTGTTC TCCTCGCCGA ACACGACCGC CATCATGAAC
GCCGTACCGG CCAGCACCCG TGGCGCGGCA TCCGGAATGC GCTCGACCTT CTTCAACTCC
GGGACGTCGC TGTCGATCGG CATTTTCTTC TCCTTGATGA TCCTCGGATT GGCGCACAAC
CTGCCGCACA CGCTGACCAG CGGCCTCGAA GCACACCACG TGCCTTCGAA CATCGCCACA
GCCATCGGAA ACGCCCCGCC GGTGGCAAGC CTGTTCGCCG CATTCCTCGG CTACAACCCG
ATCCAGACCT TGCTGGGTCC GCACGTGTTG TCGACCCTCC CGCCCACGGA TGCCGCGGTG
CTCACCGGCC GGACATTCTT CCCGTCGCTC ATCGCCGGGC CGTTCCACGA CGGACTGGTC
GTCGTCTTCG CCATGGCCAT TGCGATGTCG GTCATCGGCG CGGTCGCCTC GCTGTTCCGT
GGCGCCCGAT ACGTGCACGA CGAGGCGGAG CGGCGACCGA CAGCGACGCC GGCCGCCCAG
GGTGCGTGA
 
Protein sequence
MDAVAEATWA LRRTIDADHP HYRWVALSNT TLGMLMATIN SSIVMISLPA IFRGIHIDPL 
AHGNVSYLLW MIMGFLLVTA VLVVSVGRLG DIYGRVRIYS MGFAIFTAAS IALALDPLTG
GGGALWLIIW RVVQGVGAAM LFANSAAILT DAFPPDRRGM AMGINQVAAI AGSFIGLIAG
GLLSEVDWRL VFFVSVPFGL LGTIWSYASL REIGERSPAR IDVWGNLTFG LGLTAILVAI
TYGIQPYGGS TMGWTNPWVD AGLVGGAALL AVFCIIETKV PEPMFHMQLF RIKAFSAGNL
AGWLASIARG GMQFMLIIWL QGIWLPLHGY KFADTPLWAG IYLLPLTIAF LAAGPISGYL
SDRFGARAFA TGGLLIVAAS FAGLMALPTN FPYWEFAALL AINGIGSGLF SSPNTTAIMN
AVPASTRGAA SGMRSTFFNS GTSLSIGIFF SLMILGLAHN LPHTLTSGLE AHHVPSNIAT
AIGNAPPVAS LFAAFLGYNP IQTLLGPHVL STLPPTDAAV LTGRTFFPSL IAGPFHDGLV
VVFAMAIAMS VIGAVASLFR GARYVHDEAE RRPTATPAAQ GA