Gene Acel_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0072 
Symbol 
ID4484664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp76150 
End bp77826 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content66% 
IMG OID639728834 
Productbeta-N-acetylhexosaminidase 
Protein accessionYP_871834 
Protein GI117927283 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATCAA AGCTGGCATG GTTCCGGATG GGGCTCTGCG CAGTCGGCGT CGCGGTGACC 
GGGTGCACGC CGCAACCGGT GACCAACCCT GATCACACGC TCCCGCCGGC TTCGGCCGCG
CCGAGCTCGC CCGCGGTTAC CGCGTCGCCC CTGCCGTGGC CCGGTCCCGC GGTGAGCGGC
GGGGCAATCA CCAAAGGGCT TGTGCCTCTG CCGGCGCTCG CGGAGAGCGA CCCGGCGGAG
ACGTTTCAGC TCAGCCCAGC AACCCGGATC TGGATCGGCC CAGACCACAC CCTCGAGCCG
ATAGCTGACG ACCTCGCCGC CGCACTCCGC CCAGCCACCG GTTTTTCGCT TCCGATTGAC
ACCGCGCCGA CCGCGCCGGC GAACGCGTTC CTGCTGGCGC TTGACAACAC CGAACCGCAG
CTCGGCACGG AAGGCTATGA CCTCTCCATC ACCCGCGACG CGGTACGCCT CGTCGCCCGT
ACCCCGGAGG GGCTCTTCCA CGGCATCCAA ACCATTCGCC AACTGCTGCC CGCACGCATT
GAGGCACGCA CGCCGCAGCC CGGTCCGTGG CACATGATCG GCGGCCGCAT CGTCGACTAC
CCCAGATTCG CGTATCGCGG CGCGATGCTT GATGTGGCCA GACATTTCTT CCCGGTTGCA
GACGTCGAGC GGTACATCGA CGAACTGGCT TTGTACAAGG TCAATGTGCT CCACCTGCAC
CTCAGCGACG ACCAGGGATG GCGGATCGCC ATTGACAGCT GGCCGAAACT TGCTCCAGTC
GGTGGAAAGA CCGAAGTCGG CGGCGGGCCT GGCGGTTATT ACACCCAGGC GGACTACCGC
GCGATCGTCG CCTACGCGCA AGCGCATTTC ATAACGGTGG TCCCGGAAAT TGAGACACCC
GGCCACGTCA ACGCCGCCCT CGTCGCATAT CCGCAATTGG CTTGCTCCGG TAAGCCGATC
CGCCCCTACA CCGGCACCGG CGTCGGTTTC AGTTCGCTCT GCATCAGCAA CCCGACGGTG
TATCAATTCG TCGACGATGT GGTCGGCGAG CTGGCCGCAC TCACCCCCGG ACCGTACATC
CACCTCGGCG GTGATGAAGC GATGAGCACA CCGCCGTCCG AGTATGCGGC GTTTGTGCAG
AAAGCTCAGG CGATCGTTGA GGCGCATGGC AAGACTCTCA TGGGTTGGGC GGAAATCGCC
AAGGGCTCAC TCGACGCATC GGCGGTAGCG GAGTACTGGA ATTTCCGCGA CGGCATGGCG
TCCGCCCGCC AAGCTCTGGC CCGTGGCATG CGCCTTGTTG CGGCACCAGC GGATCACGCC
TACCTTGACC AGAAATACAC CGCGACGAGC CGGCTCGGAC TCACGTGGGC CGGGCCGGTG
AGCGTCGCAG AAGCCTATGC CTGGGACCCA ACCATGATCG CTCCAGACGG CGATGTGCTG
GGGGTGGAGG CGCCGTTGTG GAGCGAGACC ATCCGGACGA TGGCCGATAT CGAGTACCTC
GCCTGGCCGC GGATGGCCGG CATTGCGGAA ATCGGTTGGA CGCCGCAATC CGAGCGATCC
TGGCAGGAGT ACCGGCTCCG GTTGGCGGCC CAGGGTCCGC GCTGGCAGGA GCTGGGCGTG
AACTTCTACC CCTCGCCGGA AGTGCCCTGG CCTACGGCGT CGACGAACGC GTCGTAA
 
Protein sequence
MRSKLAWFRM GLCAVGVAVT GCTPQPVTNP DHTLPPASAA PSSPAVTASP LPWPGPAVSG 
GAITKGLVPL PALAESDPAE TFQLSPATRI WIGPDHTLEP IADDLAAALR PATGFSLPID
TAPTAPANAF LLALDNTEPQ LGTEGYDLSI TRDAVRLVAR TPEGLFHGIQ TIRQLLPARI
EARTPQPGPW HMIGGRIVDY PRFAYRGAML DVARHFFPVA DVERYIDELA LYKVNVLHLH
LSDDQGWRIA IDSWPKLAPV GGKTEVGGGP GGYYTQADYR AIVAYAQAHF ITVVPEIETP
GHVNAALVAY PQLACSGKPI RPYTGTGVGF SSLCISNPTV YQFVDDVVGE LAALTPGPYI
HLGGDEAMST PPSEYAAFVQ KAQAIVEAHG KTLMGWAEIA KGSLDASAVA EYWNFRDGMA
SARQALARGM RLVAAPADHA YLDQKYTATS RLGLTWAGPV SVAEAYAWDP TMIAPDGDVL
GVEAPLWSET IRTMADIEYL AWPRMAGIAE IGWTPQSERS WQEYRLRLAA QGPRWQELGV
NFYPSPEVPW PTASTNAS