Gene Acel_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0434 
Symbol 
ID4485669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp464708 
End bp466066 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content61% 
IMG OID639729201 
Productextracellular solute-binding protein 
Protein accessionYP_872194 
Protein GI117927643 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.396307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGGT CCACCTTGCC CAGGAAATCC GTCGTTGCCG CGCTTGTCGT GACCGCGCTC 
GCTGTCGCGG CATGCTCCAG CGGCACAAAG AGCCAGACCA ATACCAGCAA CTCCGCGACC
TCGAGCGGCG CCGGCGGCGC GTCATCGAAC GCGAACACCG CACCGGTCAC GCTGACGCTC
TGGCACAACT ACGGCACGGA ACAGAACGCC ACAGCCACGC AGAACCTCGT CAACGCCTAC
GAAAAGCTTC ATCCCAACGT GACGATCAAA GTGGTGAGCC AACCGGCGGA CAACTACTTC
TCCCTGCTGC AGGCTGCAGC GATCTCCAAG ACCGGGCCTG ACATCGCCGT CATGTGGACC
GGCTTATTCA CCCTGCAGTA CAAGGATTTC CTCACGCCGC TCAAGGGACT CGTCCCCGAT
GCGGCGCTCA CCAAGCAGCA GGGTCTTGAA TGGATGACGG ACAATTTCAA CCCGAACGGC
AATCCCTACG TCATGCCGCT CGAGACCCAG TTCTATATCG GTTTCTATAA CAAGGCCGCA
TTCGCGAAAG CCGGCATCAC CCAGGTTCCG CAGACCTGGA ATGAGCTTTA CGCGGCATGC
GACAAGCTCA AGGCGGCCGG TTACACGCCG CTGGTGTACG GGAACGGCGG GCAGAGTCTC
GGTGCGGAGT TCTACCCGTG GTACGATGCG AGCTACATCG AGATCGGCCT TCTCCCCGTC
GATCAGTGGC GCAACCTCTA CGACGGGAAA ACGCCGTGGA ATTCTCCGGA AGAAATCGCC
GCCTTCAGCA AGTGGGCTGC GCTCCACGAC AAGGGTTGCA CCAATCCGGA CGTGCTGACC
AAGACGAACA ACATCGATGA TTTCGTCAAC GGGAAGGCGG CGATGATTAT CGACGGCACC
TGGGACACCC AGAAGTTCAC CGACGCGATG AAGGACAACG TCGCCGCGTT CGTCCCGCCG
TTCTCAGACA CACCGATCAA GGGTGTCGTC AACTACCCCG GGGACGGTTT CAGCATCATG
AGCTACTCCA AGCACAAGGC GGAGGCGGCG GACTTCCTCG CCTTCATCGC GTCGCCGGAA
GGGCAGGCGG CCATCAACGC CGCCGGTCTG ATCCCCGACA CCGAGGGTGC GACCACCTCG
AATCCGGTGA ACCAGCAGAT GCTGGACTTT GTCAGCAAGG ACGGGATGAC CCCGTACCCG
ATGCTCGACA ATGTCGTCCA GGGTGAGATC GTCGATGCGG GCAACAAGAT TCTGCCGTCC
ATTCTTGCCG GCAAGATTTC GCCGTCCGAC GGTCTCGGCC AACTGCAATC GACGTGGGCG
CACCTGAGCC CGGACAAGAG GTCCAACGTT TACCAGTGA
 
Protein sequence
MRRSTLPRKS VVAALVVTAL AVAACSSGTK SQTNTSNSAT SSGAGGASSN ANTAPVTLTL 
WHNYGTEQNA TATQNLVNAY EKLHPNVTIK VVSQPADNYF SLLQAAAISK TGPDIAVMWT
GLFTLQYKDF LTPLKGLVPD AALTKQQGLE WMTDNFNPNG NPYVMPLETQ FYIGFYNKAA
FAKAGITQVP QTWNELYAAC DKLKAAGYTP LVYGNGGQSL GAEFYPWYDA SYIEIGLLPV
DQWRNLYDGK TPWNSPEEIA AFSKWAALHD KGCTNPDVLT KTNNIDDFVN GKAAMIIDGT
WDTQKFTDAM KDNVAAFVPP FSDTPIKGVV NYPGDGFSIM SYSKHKAEAA DFLAFIASPE
GQAAINAAGL IPDTEGATTS NPVNQQMLDF VSKDGMTPYP MLDNVVQGEI VDAGNKILPS
ILAGKISPSD GLGQLQSTWA HLSPDKRSNV YQ