Gene Acel_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0229 
Symbol 
ID4485367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp247492 
End bp248598 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content65% 
IMG OID639728992 
ProductDNA integrity scanning protein DisA 
Protein accessionYP_871989 
Protein GI117927438 
COG category[R] General function prediction only 
COG ID[COG1623] Predicted nucleic-acid-binding protein (contains the HHH domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.743855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGCCA ACGATCGACC TGACCGCAGC GACCGGGGCG CGGATGAGCG GCTGCGGGCC 
ACGCTGGCTG CGATCGCCCC GGGGACGCAG ATGCGCGACG CGCTCGAGCG GATTCTTCGT
GGCAACACCG GTGCGCTCAT CGTCCTCGGC TACGACAAGA CCGTCGAGTC CATTTGCAGT
GGTGGTTTCA ACCTCGACGT CGAGCTCTCC GCTCCGCTGA TGCGCGAGCT CGCGAAGATG
GACGGCGCCA TCATTCTCGA CGAGAAGGCG ACCCGCATCA TCAAGGCCAA CGTTCATCTC
CAGCCGGATC CGTCAATCCC TACCAACGAG TCCGGCACCC GGCATCGGTC GGCGGAGCGG
ACCGCCCGGC AGACCGGTTT CCCGGTCATT TCCGTCAGCC AATCCATGCG GATCATTGCG
CTGTACGTGG ACGGCCGCCG GCACGTTCTC GATGAACCAA GCGCAATCTT GTCCCGGGCC
AACCAGGCGC TGGCCACCCT CGAGCGCTAC AAGCTTCGTC TCGACGAGGT TTCCGGCACG
CTCTCGGCAC TGGAAATCGA GGATTTGGTC ACCGTGCGGG ACGCGGCGGT CGTCGCCCAG
CGGCTGGAAA TGGTCCGGCG GATCGCCGAC GAAATTCAGG GGTACGTCGT CGAGTTGGGC
ACCGACGGCC GCTTGCTCAG CCTGCAACTG GATGAATTGC TCGCGGGTGT GGAGCCGGAG
CGCGACCTCA TCGTGCGGGA TTATCTGCCG GCAGCGGCCG GTAAGCGCGG CCGGAGTGTC
GATGACGTGC TCCGGGATCT CGATGCGCTC ACGCCTGAGG AACTTCTGGA CCTCGGGACC
GTCGCCCGGG TCATCGGGTG CGGCGGGACG GAGAATCTCG ACAACCCGGT GAGTCCACGC
GGTTACCGGC TGCTCGCGAA AATTCCACGG CTGCCGTGTG CGGTCATCGA GCGGTTGGTC
GAACATTTTG GCACCTTGCA GAAACTCTTG GCGGCCAGTG TCGACGACCT GCAGGCCGTC
GAGGGTGTCG GCGAGAGCCG CGCTCGCAGC GTCCGGGAGG GACTGTCCCG GCTTGCGGAA
TCGTCGATCC TCGAGCGATA CGTGTGA
 
Protein sequence
MAANDRPDRS DRGADERLRA TLAAIAPGTQ MRDALERILR GNTGALIVLG YDKTVESICS 
GGFNLDVELS APLMRELAKM DGAIILDEKA TRIIKANVHL QPDPSIPTNE SGTRHRSAER
TARQTGFPVI SVSQSMRIIA LYVDGRRHVL DEPSAILSRA NQALATLERY KLRLDEVSGT
LSALEIEDLV TVRDAAVVAQ RLEMVRRIAD EIQGYVVELG TDGRLLSLQL DELLAGVEPE
RDLIVRDYLP AAAGKRGRSV DDVLRDLDAL TPEELLDLGT VARVIGCGGT ENLDNPVSPR
GYRLLAKIPR LPCAVIERLV EHFGTLQKLL AASVDDLQAV EGVGESRARS VREGLSRLAE
SSILERYV