Gene Acel_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1972 
Symbol 
ID4485100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2244225 
End bp2246921 
Gene Length2697 bp 
Protein Length898 aa 
Translation table11 
GC content69% 
IMG OID639730765 
ProductDNA topoisomerase I 
Protein accessionYP_873730 
Protein GI117929179 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACGA CAGCAGCGCG GAGCGGACGA CGCTCAGGCA CGCCGGCCGA CGGCGCGACA 
CGGGCGGCGT CCGGGACTCG CCGAGGGCCG GCCTCGCAGG GCGGCACCCG CCTGGTGATC
GTCGAGTCGC CGGCCAAGGC GCGGACGATC GCCGGCTTCC TCGGACCCGG CTACGTCGTG
GAATCCAGCG TCGGCCACAT CCGGGACTTG CCGGCGTCCG CGGCGGAGAT TCCCGAGAAA
TATAAGAAAG AGCCCTGGGC CCGGCTGGGG GTCAACGTCG ACCGGAATTT CGAACCGCTG
TACGTCGTCG CCGCTGACAA GCGCAAACAG GTCACCAAGC TGAAGCAGGC GCTGGCCCAG
GCCGAGGAGC TCTATCTCGC AACAGATGAG GATCGCGAAG GAGAGGCCAT TGCCTGGCAC
CTCGTCGAGG TGCTGAAGCC CCGCGTGCCG GTCCGCCGGA TGGTCTTCCA CGAGATCACC
CGCCAGGCGA TCGAGGAGGC GGCCCGCAAT CCACGGGAAA TTGACCAAGC GCTGGTGGAC
GCACAAGAGA CCCGCCGCAT CCTCGACCGG CTGTACGGCT ACGAGATCAG CCCGGTGTTG
TGGAAGAAGG TCATGCCGGC GCTCTCCGCC GGCCGGGTGC AGAGCGTCGC CACCCGCATG
GTCGTCGAGC GGGAGCGGGA GCGGATCGCG TTCCGGCCGG CTGCGTACTG GGATGTCATC
GCCGAGATTG CGACCACCCC GTCGTTCTCC GCCACCCTCG TGGCGATTGA CGGCAAGCGG
GTGGCGGTCG GCCGGGACTT CGACGCCCAG GGCCGGTTGA AGAACCCGGA CGTCGTCCAT
CTCGGGGAAG CCGACGCGCA GCGGCTGGTG CGCGGCTTGG ACGGCGCGGA GTTCGTCGTC
CGGAGCGTTG AGCGCAAGCC GTACCGGCGC GCGCCGGCTG CACCGTTCAT CACCAGCACC
CTCCAGCAGG AGGCGAGCCG CAAACTCGGC CTCAGCTCGC AGGCCACGAT GCGGATTGCG
CAGCGGCTGT ACGAAGCCGG CTACATCACC TACATGCGGA CCGACTCCAC CACGCTCTCC
GAGACGGCGC TTGCTGCGGC CCGAGCGCAG ATCGCCGAGT TGTACGGTGA CCAGCTGCTC
CCGGCTGCGC CGCGGCGCTA CGAGAAGAAG GTCAAGAACG CCCAGGAGGC GCATGAGGCG
ATCCGCCCGG CGGGCGACCG GTTCCGGACG CCGGCTGACC TGGCCGGGGA GCTCTCCGGT
GACGAATTGC GGCTCTACGA ATTGATCTGG CAGCGCACCC TAGCCTCCCA GATGATCGAT
GCCACCGGAT ACACGGTGAG CATCCGGATC GCCGCCGTGA CGGACGCCGG CCGGGAGGCG
GTATTCGCCG CCACCGGAAC GGTGATCACG CAACCCGGGT TCCTCCGGGT GTACGTGGAG
AGCAGCGACG AGGAGGACGA CTCCGGCTCG GGCCGCACCC TCCCCGACCT CACCGAGGGT
GAGACGGTGC CCGTCGCGGC GCTCACTCCG GCGGAGCACA CGACGACACC GCCGCCGCGG
TACACCGAGG CCAGCCTCAT CCGGGCGTTG GAGGAGCGCG GCATCGGTCG GCCGTCCACC
TTCGCCAGCA TCGTGAGCAC CATCATCGAG CGCGATTACG TCTTCAAACG CGGTCAGGCG
TTGGTGCCGA CGTTCCTCGC CTTTGCCGTG GTGCGGCTGT TGGAGAAGCA CTTCGGCAAC
CTCGTCGACT ACGACTTCAC CGCCGAAATG GAGGAGAACC TTGACCGGAT CGCCAACGGC
GAGGCGCACC GCCTGGAGTG GTTGACGCGG TTCTACTTCG GTTCACCGAC CGACGGCGGC
AGTGCCGCCG GGCTCAAGCA GCTGGTCACC GAACGGTTGG CGGAGATCGA CGCCCGGGAC
ATCAGCACCT TCCCGCTCGG TGAGTCCGGG ATCGTCGTCC GGGTCGGCCG TTACGGCACG
TACGTCGAAC GGGACGGGCA GCGGGCGTCC CTTCCGCCGG GCACGGCACC CGACGAGGTC
ACGGTCGAGT TCGCCACCCG GTTGCTGGAA CAGCCGACCG GCGAGCGGGA ACTTGGCGTC
GACCCGGCGA CGGGTCACAC CATCGTCGCT CGGGTCGGCC GGTATGGGCC GTACGTCACC
GAGGTCCTGC CCGAGGGCGC CAAGGGCAAG CCGCGGACGG CGTCCCTGCT CTCCTCGATG
ACCGTTGAAT CGATCACCCT GGACGATGCG CTGAAACTGC TCAGCCTGCC GCGGGTGCTT
GGCGAGGTCG ACGGCGAGCA GGTGATCGTG AGCAACGGGC GGTTCGGTCC GTTCGTGAAG
AAGGGCGCCG AGACCCGGTC GCTGGGGTCA GAGGAGGAGC TCTTCACCCT CACCCTTGAC
GAGGCGCTGC AGCTGCTGGC GCAGCCGAAG CAGCGCGGCC GTCGTCTGCC GGCCGCGGCC
GCGCAGGTCC GGGAGCTGGG GGTGGACCCC GCGACCGGAC GGACCGTCAC GGCGCGGACC
GGGCGGTATG GTCCCTACGT GACGGACGGC GAGACCAACG CAACGCTCCG CCATGGTGAC
ACCCTGGAGA ACGTGACCCT GGAACGTGCC AGCGAGTTGC TCGCGGATCG GCGCAGTCGT
GGACCCACGA CGCCGCGTCC ACGACGGGGG CGCCGGGCCG CCGTCCGTTC GGAGTAG
 
Protein sequence
MATTAARSGR RSGTPADGAT RAASGTRRGP ASQGGTRLVI VESPAKARTI AGFLGPGYVV 
ESSVGHIRDL PASAAEIPEK YKKEPWARLG VNVDRNFEPL YVVAADKRKQ VTKLKQALAQ
AEELYLATDE DREGEAIAWH LVEVLKPRVP VRRMVFHEIT RQAIEEAARN PREIDQALVD
AQETRRILDR LYGYEISPVL WKKVMPALSA GRVQSVATRM VVERERERIA FRPAAYWDVI
AEIATTPSFS ATLVAIDGKR VAVGRDFDAQ GRLKNPDVVH LGEADAQRLV RGLDGAEFVV
RSVERKPYRR APAAPFITST LQQEASRKLG LSSQATMRIA QRLYEAGYIT YMRTDSTTLS
ETALAAARAQ IAELYGDQLL PAAPRRYEKK VKNAQEAHEA IRPAGDRFRT PADLAGELSG
DELRLYELIW QRTLASQMID ATGYTVSIRI AAVTDAGREA VFAATGTVIT QPGFLRVYVE
SSDEEDDSGS GRTLPDLTEG ETVPVAALTP AEHTTTPPPR YTEASLIRAL EERGIGRPST
FASIVSTIIE RDYVFKRGQA LVPTFLAFAV VRLLEKHFGN LVDYDFTAEM EENLDRIANG
EAHRLEWLTR FYFGSPTDGG SAAGLKQLVT ERLAEIDARD ISTFPLGESG IVVRVGRYGT
YVERDGQRAS LPPGTAPDEV TVEFATRLLE QPTGERELGV DPATGHTIVA RVGRYGPYVT
EVLPEGAKGK PRTASLLSSM TVESITLDDA LKLLSLPRVL GEVDGEQVIV SNGRFGPFVK
KGAETRSLGS EEELFTLTLD EALQLLAQPK QRGRRLPAAA AQVRELGVDP ATGRTVTART
GRYGPYVTDG ETNATLRHGD TLENVTLERA SELLADRRSR GPTTPRPRRG RRAAVRSE