Gene Acel_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1107 
Symbol 
ID4485770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1226809 
End bp1228059 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID639729882 
Producthypothetical protein 
Protein accessionYP_872865 
Protein GI117928314 
COG category[R] General function prediction only 
COG ID[COG4552] Predicted acetyltransferase involved in intracellular survival and related acetyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.80545 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.383897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGGTG ACGTATCCCG TCCCGGCGAC ATCGTGTTCC GTCCGGTCAC GGACGAGGAG 
TTCCCCCGCT TCCTGGCGAC CGGCATGCGC GCCTTCGGCA CCGCCTGCCG AGACGAGCAC
GTCACCCGGG AGCGGCTGGT ATTCGAACTT GACCGCACCA CCGGGATGTT CGACGGTGAC
CGGCTCGTTG GCACCGGCGG GATTTTCTCC TTGCAGATGT CGGTGCCTGG CCGGGTCGTC
CCCGTGGCGG GAGTGACGTT CGTCTCCGTA GCCGCCTCGC ACCGGCGTCG CGGGCTGCTC
ACCCGGCTGA TGCGCCACCA GCTGCATGGG TTGCACGCCG ACGGTCGTGA GCCGATCGCG
GCGTTGTGGG CGTCGGAATC GTCGATTTAC CGGCGCTTTG GCTATGGGCC GGCAAGCCAG
ACGGTCTCTG TCGAGATACG CCGTGACGCC CGAGCCATCG ACGATGACCG GGTGGCGCAG
CGGGCCGATT TCCGCCTGTC ATTGCGCGAT GAGCCTGCTG AGACGGTGCT TCCGGAGATA
GCGGCGATTC ATGCCGCGCA CCTGCCGGAC GCTCCCGGTG AATTCGCGCG CGACAACCGG
TGGTGGCAAT ACCGATTGTC CGAGGCGACG GTGGAACGCC GGAGCGGTTG GACACCACTG
TCGGCAGTGG TCATTCCAGA CACCGGGTAC GCGCTCTACC GCACCAAGAG TGAATGGTCA
GACGGCGTTG CCGCCGGCGA AGTCGAGATT CTCGAAATCG TCGCAGCCGA GCCGACTGCC
CAGCTCGTGT TGTGGGATTA CCTGTTGCGC CGTGACCTCA TGGCAACGTG GCGGGCATGG
CTTGCGATCG ACGACCCGAT TCTGCTGGCG GCAGCGGATT CCCGCCGCCT CCACGCCCGC
CTTGCCGATA ATCTCTGGGT GCGTATCGTC GATGTCGACC GTGCGCTCGC CGCTCGCCGC
TACTCGGTCG ACATTGACCT GGTACTCGAG GTTGCCGACG AGTTCTGCCC GTGGAACAAC
GGCCGATGGC GTCTGATCGG CGGGCCGGAC GGAGCGTCGT GCGAGCCGAC ACGGCAATCT
GCGCACCTCG CCGTACCGGC GTGGGCGCTC GGCGCCGCTT ATCTCGGCAC GACATCGCTG
TATCAGCTTG CCCGAGGCGG TTGGGTTTCC GCGTCAAGCT CCGCGGCGCT CGCTCAGGCT
GCGTCGGCCT TCCGTTGGCC CGCCGCCGCG CACTGTTCGT TGGTTTTCTG A
 
Protein sequence
MAGDVSRPGD IVFRPVTDEE FPRFLATGMR AFGTACRDEH VTRERLVFEL DRTTGMFDGD 
RLVGTGGIFS LQMSVPGRVV PVAGVTFVSV AASHRRRGLL TRLMRHQLHG LHADGREPIA
ALWASESSIY RRFGYGPASQ TVSVEIRRDA RAIDDDRVAQ RADFRLSLRD EPAETVLPEI
AAIHAAHLPD APGEFARDNR WWQYRLSEAT VERRSGWTPL SAVVIPDTGY ALYRTKSEWS
DGVAAGEVEI LEIVAAEPTA QLVLWDYLLR RDLMATWRAW LAIDDPILLA AADSRRLHAR
LADNLWVRIV DVDRALAARR YSVDIDLVLE VADEFCPWNN GRWRLIGGPD GASCEPTRQS
AHLAVPAWAL GAAYLGTTSL YQLARGGWVS ASSSAALAQA ASAFRWPAAA HCSLVF