Gene Acel_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0995 
SymboldnaE2 
ID4485938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1094717 
End bp1097989 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content65% 
IMG OID639729770 
Producterror-prone DNA polymerase 
Protein accessionYP_872754 
Protein GI117928203 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTGGC ACAATCCGTC CGTCTCCTGG CCGGAACTTG AACGTCGCCT CGCCGGCCGA 
CCCGCCCGTG CCGTCGAGCC GGCGGATGCG GAAGCGCCGA CATCGCGCCG GCGGCCGCCG
TACCGGCCGA AAGAATTGCC GAAACAACCG GATGAGACCG TTCGTTATGC GGAATTGCAC
TGTCATTCGA ATTTCAGTTT TCTCGATGGG GCAAGCCATC CGGAGGAACT CGTTGAGGAA
GCCGCCCGGC TGGGTCTTGA GACCCTTGCC CTCACCGATC ACGACGGATT CTACGGGGTC
GTCCGGTTTG CCGAAGCGGC TGCTGAACTC GGCGTTCGTA CCGTTTTCGG CGCGGAGTTG
TCCCTCTCGC TTGTCGAGCC GCGGGCCGGT GCCGTCGATC CGGATGCCGT TCACCTCCTC
GTCCTGGCCC GTGATCCGGA GGGATACCGG CGGCTTGCCG TTGCGATGAG TGCGGCGCAC
CTGAGAGGCG GCGGGAAAGG GCGGCCGCAG TACGACGAGA CGGAACTCGC GCATACCGCC
GGCGGGCATT GGCTGATCCT CACCGGATGC CGGAAAGGGC GGGTCCGCCG TGCCCTCGCC
GCCGAGGGTC CGGACGGCGC GGTACGGGAA ATCGACCGGC TCGTTGAGAT GTTCGGTCGG
GAGAACGTTG CGGTGGAACT CACCGACGAC GGGCAGCCCC TCGATTCAGC AGCCAACGAT
GTGCTTGCCG CGGCTGCTGT CCGCTGCGGA CTTCCGGTGG TGGCGACGAC GAACGCCCAC
TACGCCACCC CTGATCGAGG TCGATTGGCC GCCGCACTGG CCGCGGTCCG CGCTCGAAAA
AGTCTGGATG AGATAGCCGG ATGGCTGCCG GCCGCGCCCG TCCGCCATCT GCGCTCCGGA
GTCGAGATGG CTCAGCGTTT CGCCCGGTAC CCCGGGGCGG TGGCGAACGC GGCCCGGCTC
GGCGCCGCCT GCGCATTCGA TGTGCGCTTG ATTGCTCCGC GGCTTCCGGA TTTTCCGGTG
CCGCCTGGTC ACACCGAGAT GAGTTATCTC CGCGAACTGG CGATGCGCGG CGCGGCCGAG
AAATACGGAC CGCCGCATCA GCGCCCTGAT GTGTACCGCC AGCTTGATTA TGAACTTGAC
GTCATCGAGC AGCTCGGGTT TCCCGGATAT TTTCTCGTCG TCTGGGAGAT CGTTGAATTC
TGCCGGTCCC GCGGCATCCT CTGCCAAGGC CGGGGATCGG CCGCTAATTC CGTCGTCTGC
TACGTTCTCG GTATCACCAA GGCCGACCCC ATCGCCTTCC GGTTGCTCTT CGAACGATTT
CTCAGTCCGG AGCGGGACGG ACCGCCGGAT ATTGACGTCG ACATCGAAAG CGGCCGCCGG
GAGGAGGTCA TCCAATACGT CTACTCTCGA TATGGCCGGG AGCGGGCCGC GCAGGTCGCC
ACGGTCATCA CCTACCGGCC CCGATCCGCC GTCCGGGACA TGGCCCGCGC CCTCGGATAT
TCGCCGGGCC AACAGGACGC ATGGTCGAAA CAGCTTGACC GGTATGGAGA GATTCCCCGC
GGTTCGGAGG CTGACCACGA TGTCCCGGAT GACGTGGTCC ATCTCGCCCG TCAGGTGCTT
GGTTTTCCCC GTCATCTGGG CATCCATTCC GGAGGAATGG TGATCTGTGA CCGGCCGGTC
GCCGAGGTCT GTCCGGTCGA GTGGGCGCGG ATGCCCGGGC GGAGCGTGCT GCAGTGGGAC
AAAGACGATT GCGCCCGGAT CGGTGTGGTG AAATTCGACC TTCTCGGACT GGGCATGCTG
TCGGCGTTGC GGGAGTGCTT CGATCTCATC GAGACCTATC ACGGTGTGCG GCTGTCTCTG
GAGACGATTC CGCCGGAAGA CCCGGCGGTG TACGACATGC TCTGCGCAGC GGATTCGGTG
GGCGTCTTCC AAGTGGAGAG TCGAGCGCAG ATGGCGACGC TTCCGCGATT GCAACCGAGA
AATTTCTACG ACCTCGTCAC CGAGGTGGCG ATCATTCGTC CCGGTCCGAT TCAAGGCGGT
TCCGTGCACC CGTTCATCCG CCGCCGCCGG GGACGGGAAC CCGTTCGTTA CGCCCATCCG
CTGCTCCGCC ATTCGCTGGA ACGCACGTAC GGCGTCCCGC TCTTCCAAGA GCAGATCATG
CAAATGGCCA TCGATGTCGC CGGGCTCACT CCCGCCGAAG CAGACCACCT GCGTCAGGCG
ATGGGGGCTA AACGTTCCGT CGAACGCATG GAGCGTCTGA AAGCGAAACT CTACGCGGGC
ATGGCCCGTA ACGGCATTAC CGGAGCGGTG GCGGACGAAA TCTACGAGAA ACTCCAGGCG
TTTGCGCATT TCGGTTTTCC TGAGAGTCAC GCCATCAGCT TTGCGTTTCT TGTTTATGCG
AGCGCGTGGC TGAAAAAGTA CTACCCGGCG GCGTTCTGCG CGGCGCTTCT CAACGCTGCT
CCGCTTGGCT TTTACTCGCC GCAGACGGTG GTGGCTGACG CCCGTCGTCA CGGGGTGGTG
ATCCGCCGTC CCTGTGTCGC CTCCAGTGCA ACGAAAACCA TTCTGGAACC GATGGACGGC
GGCTGGGCGG TGCGATTGGG TCTTTCCTTG GTGAAAGGAG TGAGCGCCGA GACGGCGGAT
CGCATCGTTG CCCGTCGGCC GTACCGTGAC ATGGCGGATC TGGCACGCCG CTGCGAACTC
TCCGTCACGC ACCTGGAGGC GCTGGCTGCG GCAGGGGCGT GTGAGTGTTT CGGTCTCGAC
CGGCGGCAGG CCTTGTGGCT CGCCGGTTCC GCCGCCGAGA CGCGGCCCGG ACAGCTTGAG
GGTTTTGGCG TCGAGGCCAC GCCGCCGCGG CTTCCACCGA TGACGGAGAT GGAGACGACG
GTGACGGACC TGCGGTTCCT GGGAATCAGC CCGGACGTCT ACCCGACGGC ACACGTTCGG
GATCAGTTGA CCGAGATGGG TGTCGTTCCG GCGGCGCAAT TGCATCGAGT GGCGCCGGAT
TCTCGAGTTC TCGTCGGGGG AGTGGTCACC CATTGGCAAC GCCCGGCGAC GGCGCGGGGA
ACGACGTTCC TCAACGTGGA GGACGAGACG GGGATGGTCA ACGTGATCTG CTCACCGGGA
GTCTGGGTGC GGTATCGGCG GACGGCACGG ACGGCGGCCG CCCTTCTGGT ACGCGGCCGG
CTGGAACGAG CGGACGGTGT GGTCAATGTT ATTGCTGACC GGCTTGAGCC GCTTCCGCTG
GTAATTCGGC GTTCCGCTCG AGATTTCCGG TGA
 
Protein sequence
MGWHNPSVSW PELERRLAGR PARAVEPADA EAPTSRRRPP YRPKELPKQP DETVRYAELH 
CHSNFSFLDG ASHPEELVEE AARLGLETLA LTDHDGFYGV VRFAEAAAEL GVRTVFGAEL
SLSLVEPRAG AVDPDAVHLL VLARDPEGYR RLAVAMSAAH LRGGGKGRPQ YDETELAHTA
GGHWLILTGC RKGRVRRALA AEGPDGAVRE IDRLVEMFGR ENVAVELTDD GQPLDSAAND
VLAAAAVRCG LPVVATTNAH YATPDRGRLA AALAAVRARK SLDEIAGWLP AAPVRHLRSG
VEMAQRFARY PGAVANAARL GAACAFDVRL IAPRLPDFPV PPGHTEMSYL RELAMRGAAE
KYGPPHQRPD VYRQLDYELD VIEQLGFPGY FLVVWEIVEF CRSRGILCQG RGSAANSVVC
YVLGITKADP IAFRLLFERF LSPERDGPPD IDVDIESGRR EEVIQYVYSR YGRERAAQVA
TVITYRPRSA VRDMARALGY SPGQQDAWSK QLDRYGEIPR GSEADHDVPD DVVHLARQVL
GFPRHLGIHS GGMVICDRPV AEVCPVEWAR MPGRSVLQWD KDDCARIGVV KFDLLGLGML
SALRECFDLI ETYHGVRLSL ETIPPEDPAV YDMLCAADSV GVFQVESRAQ MATLPRLQPR
NFYDLVTEVA IIRPGPIQGG SVHPFIRRRR GREPVRYAHP LLRHSLERTY GVPLFQEQIM
QMAIDVAGLT PAEADHLRQA MGAKRSVERM ERLKAKLYAG MARNGITGAV ADEIYEKLQA
FAHFGFPESH AISFAFLVYA SAWLKKYYPA AFCAALLNAA PLGFYSPQTV VADARRHGVV
IRRPCVASSA TKTILEPMDG GWAVRLGLSL VKGVSAETAD RIVARRPYRD MADLARRCEL
SVTHLEALAA AGACECFGLD RRQALWLAGS AAETRPGQLE GFGVEATPPR LPPMTEMETT
VTDLRFLGIS PDVYPTAHVR DQLTEMGVVP AAQLHRVAPD SRVLVGGVVT HWQRPATARG
TTFLNVEDET GMVNVICSPG VWVRYRRTAR TAAALLVRGR LERADGVVNV IADRLEPLPL
VIRRSARDFR