Gene Acel_0035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0035 
Symbol 
ID4484528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp39359 
End bp40534 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content69% 
IMG OID639728795 
Productputative aminotransferase 
Protein accessionYP_871797 
Protein GI117927246 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCC GGGAGATATC CGACTCCGCA CCGCGACCCG GCGCTCGACC TTTCCCACGC 
GCCGACCTGC GAGGCATCCC GACGTACAAG CCCGGCCGGC GGCCGGCCAC CGGCCGGCGC
GCGTACAAGC TCTCCTCGAA CGAATCCCCC TACCCGCCGC TGCCCAGCGT GCTCGACGCG
ATCGCCCAGG CGAGCGACAC CATCCACCGG TACCCGGACC TGCTCAGCTC GGATCTGGTC
GCCGCGATCG CGCACCGGTT CGGCGTCCCG GAAAGCCACG TCGTTGTCGG ATGCGGTTCG
GTGGGGCTCG CCACGCAGAT CGTGCAGGCG TTCGCTGGAC CCGGCGACGA AGTGGCCTAC
GCCTGGCGTT CCTTCGAGGC CTACCCGATC ATCGTGCAGG TTGCCGGTGC GGTGAGCATC
CAGATACCTC TGCGCCCCGA CGGCGTACAC GATCTCCCTC GCCTGGCCGC CTCAATCACG
CCGAAGACCC GCGTCGTCTT CATCTGCAAC CCCAACAACC CCACCGGAAC CGTCGTCGGC
GCCGACGCCC TGCTTCGGTT CCTCGACGCC GTACCCGCCG GTTGCCTGGT CGTCCTCGAC
GAGGCGTACC GCGAATTCGT CACCAACCCC GACAGTCCGG ACGGCATCAC CCTCTACCGT
GACCGTCCGA ACGTCGTCGT ACTCCGCACG TTCTCCAAGG CGTACGGGCT GGCCGGCCTG
CGTGTCGGAT ATGCGATTGC CCAGCCTGAG ATCGTCGACT CCATCCGGAT CACCGACGTC
CCGTTCTCCA CCAATGCCCT TGGGCAGGCA GCTGCGCTCG CCTCGCTCCA ACCGGCCGCG
GAAGCCGAGC TCATGGCCCG GGTACAGGCC ACAGTCTCCG AGCGGGAGCG GATCGTCGCC
GCATTACGGG CCGCCGGTTG GGACATTCCC CAGCCGGAAG GAAACTTCGT CTGGCTTCCC
ACCGGCGACC GAACCGAGAG CTTCGCGGCC GCATGCGAAG CCGCGGGAGT GATCGTACGG
CCCTTCGCCG GTGAGGGAGT ACGCGTCACC ATCGGCGAAC CCGAGGCCAA CAACCTCTTC
CTGGACGTCG CCCGCGCCCA CGGTCCCGCG CCCACCGCGC CAACGGCCCA TGGGGCTGCT
CAGCCCAGCC CGTCAGGCCC AGATGAACCA GCCTGA
 
Protein sequence
MTTREISDSA PRPGARPFPR ADLRGIPTYK PGRRPATGRR AYKLSSNESP YPPLPSVLDA 
IAQASDTIHR YPDLLSSDLV AAIAHRFGVP ESHVVVGCGS VGLATQIVQA FAGPGDEVAY
AWRSFEAYPI IVQVAGAVSI QIPLRPDGVH DLPRLAASIT PKTRVVFICN PNNPTGTVVG
ADALLRFLDA VPAGCLVVLD EAYREFVTNP DSPDGITLYR DRPNVVVLRT FSKAYGLAGL
RVGYAIAQPE IVDSIRITDV PFSTNALGQA AALASLQPAA EAELMARVQA TVSERERIVA
ALRAAGWDIP QPEGNFVWLP TGDRTESFAA ACEAAGVIVR PFAGEGVRVT IGEPEANNLF
LDVARAHGPA PTAPTAHGAA QPSPSGPDEP A