Gene Acel_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0221 
Symbol 
ID4486079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp236387 
End bp237394 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content74% 
IMG OID639728984 
ProductHhH-GPD family protein 
Protein accessionYP_871981 
Protein GI117927430 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.230318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.112611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAAGGA ACACTCCCGC CCAGACCCGG CCGGGCGCCG CTTCGACCCG GCTCATCACG 
GGGAGCGCGG GTCCAGACCC ACGAGGCAGG CGTGCGGCTT TATCGGATCT GACGCAGGCT
CCGGCGCAGC GCGTGGTCCA CCGGGTGCTC GCGTGGTACC GCCGGCACGG CCGCCGCGAC
CTGCCATGGC GTCGGTCGGA CGTGACGCCC TGGCAGGTCC TCGTGAGCGA GGTGATGCTG
CAGCAAACGC CGGTCAGCCG CGTACTGCCG GTTTACGCGG TGTGGACGGC CCGGTGGCCG
ACGCCGCAAT CGCTGGCCGC CGCCACTCCA GCGGACGCCG TGCGGGCGTG GGGCCGGCTC
GGTTATCCGC GACGCGCGCT CTGGCTGCAC CAAGCGGCAC GTGCGATCGT GGATCGGTTC
GGTGGGATCG TCCCCGATGA GCCCGGCGTC CTCGCCACGC TCCCGGGAAT CGGCCGTTAC
ACCGCGGCGG CGGTCGCCGC GTTCGCCTAC CGCCGCCGCG TCGCCGTCCT CGACACGAAT
GTGCGGCGCG TCCTCGCTCG GTTCCTCACC GGCGTGCCAC ACCCAACCGG CACTCCCCGG
GCCGCCGAGC ACCGAAGCCT CGACGCGTTG CTGCCCAAGA ATGCCGACCG CGCCGCGCAG
TTCTCCGTGG CGCTCATGGA ACTCGGCGCG CTCATCTGCA CCAGCCGCAG CCCCGGCTGT
GCCCGCTGTC CCCTCACCAC GGACTGCGCG TGGCACCGGG CCGGTCGGCC AGCCGGCACA
CGCCGACCCC GGGCGCCGTA CACCGGCAGC GATCGGCAGG CCCGTGGCGC GCTCCTGGCG
GCGCTTCGGG AATACCCGCA TCCCGTCAGC ACCGCGGACC TGGCCCGCGC CTGGCCGGAC
CCAACCCAGC GCAGGCGGGC GCTGGCCAGC CTTGTGTCCG ATGGGCTCGT GTCCTGCGCA
CCGGGAGGCC GCTACCAGCT CGGGCCGGCA ACAGCCGATG CGACGTAG
 
Protein sequence
MRRNTPAQTR PGAASTRLIT GSAGPDPRGR RAALSDLTQA PAQRVVHRVL AWYRRHGRRD 
LPWRRSDVTP WQVLVSEVML QQTPVSRVLP VYAVWTARWP TPQSLAAATP ADAVRAWGRL
GYPRRALWLH QAARAIVDRF GGIVPDEPGV LATLPGIGRY TAAAVAAFAY RRRVAVLDTN
VRRVLARFLT GVPHPTGTPR AAEHRSLDAL LPKNADRAAQ FSVALMELGA LICTSRSPGC
ARCPLTTDCA WHRAGRPAGT RRPRAPYTGS DRQARGALLA ALREYPHPVS TADLARAWPD
PTQRRRALAS LVSDGLVSCA PGGRYQLGPA TADAT