Gene Athe_2181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2181 
Symbol 
ID7408374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2308857 
End bp2310938 
Gene Length2082 bp 
Protein Length693 aa 
Translation table11 
GC content39% 
IMG OID643716546 
ProductDNA topoisomerase I 
Protein accessionYP_002574029 
Protein GI222530147 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial
[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0024842 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAC TTGTCATTGT AGAGTCACCT GCAAAGGCAA AAACAATTGC AAAGTATCTT 
GGTAAAGAGT TTAAAGTAGA AGCTTCAATG GGGCATGTAA GGGACCTTCC CAAGAGTGAT
TTGGGCGTTG ATATAGAAAA TGGTTTTGTC CCTAAGTATA TAAACATCAG AGGTAAGGCA
GATGTAATAA ACAAGCTAAA AAAATATGCA CAAGAAGCAG AGAAGGTATA CCTTGCAACA
GACCCCGACA GGGAAGGCGA GGCAATCTCA TGGCATTTAG CAACTATTTT AGGGCTTGAT
ACAAACGATA ATGTGAGAAT TACATTCAAT GAGATAACAA AAAAGGCTGT ACAGGAATCT
TTGAAAAATG CAAGGCCAAT TGACCAGAAC TTAGTTAATG CCCAGCAAGC CAGAAGAATT
TTAGACAGAC TTGTCGGCTA CAAGCTAAGT CCATTTTTGT GGGAAAAGGT CAAGGGTGGA
CTTTCTGCAG GAAGAGTTCA GTCTGTTGCA ACAAGGCTTG TGGTTGAAAG AGAAGAAGAG
ATAGAAAATT TTAAGCCTGA AGAGTACTGG ACCTTAGAAG CTGTATTTAA AAAAGATGTC
CAAGAGTTTA AGGCAAAGTT CTATGGAGAT AAGAAAGGGA AGATAGAGCT AAAAAATCAA
GATCACGTTC AAAAAATTGA AGAAAAGATA AAAAATAAAG AATTCAAGGT TGTAAAGATA
AAGGTGTCAG AGAAGAAGAA AAATCCGCCC CCACCTTTTA TAACAAGCAC ACTTCAGCAG
GAGGCATCAA GAAAACTGAG ATTTACTCCT GCAAAGACAA TGGCAGTTGC GCAGATGCTG
TATGAAGGTG TTGAGATAAA AGGTGAGGGA AGTGTTGGAC TTATAACATA TATGAGAACA
GATTCAACAA GGGTTTCTGA AGAGGCACAG CAGGCAGCAA GAAGTCTTAT CGTACAGAAG
TTTGGCAAAG AATATCTTCC TGAAAAGCCG AGGGTTTACA AGACAAAAAA AGATGCGCAG
GACGCTCATG AGGCTATAAG ACCTACTTAT TTGGATATGG ACCCTGAGAG TATAAAAGAT
TCTCTGACTC TTGATCAGTA CAAGCTGTAC AAACTCATTT ATGACAGATT TTTAGCGTCG
CAGATGGAAA GCAGCGTATA TGAGGTTCTT TCAGCCGAGC TTGAAGTTGA GGGTTATATT
TTTAAACTCA CAGGTTCAAA GCTCAAGTTT GCAGGGTTTA TGGAAGTATA TGTTGAAGGT
AAGGATACAG AAGATGAAGA GGAGGAAAAT CAGCTTCCAG AAATTAGAGA AGGAGAGGCT
TTAAAGCCCA TAAAACTTGA GAGCAAACAG CATTTTACTC AACCGCCTTC TCGCTATACT
GAAGCAACCT TAATAAAGGC TTTAGAAGAA AATGGGATAG GAAGACCCAG CACATACGCT
CCAACAATCC AGACAATTCT GGAGAGAGGA TATGTTGCCA AAGAAGATAG GTTTTTAAAA
CCAACCGAAT TGGGCAGAAT TGTAACAAAT ATACTTAAAG AATATTTCAA AGACATAATA
GACATTGAAT TTACTGCAGA GCTTGAGAGC AACCTTGACA AAATTGAGGA AGGAAAACTT
GAGTGGACAG AGGTGGTAAA AAAATACTAC CAGCCACTTG AAAAAGAACT TGAGATAGCA
CGAGCTACTT TGCTAAAGGT TAAGGTTGAG GATGAGGAGA CAGACATTGT ATGCGAAAAC
TGTGGAAGAA AAATGGTGAT AAAAAAAGGT AGATACGGAA AGTTCTTGGC ATGTCCAGGA
TATCCTGAAT GCAAAAACAC AAAACCTTAT TACGATTACC TTGATGTGTT GTGTCCAAAG
TGCGGCAAGA GGATAATAGA AAAGAAGTCC AAGAAGGGCA AGAGATATTA CACGTGCGAG
GGGTATCCTG ACTGTGACCT AATTTTGTGG GAAAAACCAG TCAAAAACTG TCCGAAGTGT
GGCAGTCTCA TGTTTGAAAA GGGCAAGAAA GGGAATAAAA AGCTTGTATG TTCAAATGAA
AACTGTGCTT ACCAAGAAAA AACGGGGGAA AAAGGTGAGT AA
 
Protein sequence
MKKLVIVESP AKAKTIAKYL GKEFKVEASM GHVRDLPKSD LGVDIENGFV PKYINIRGKA 
DVINKLKKYA QEAEKVYLAT DPDREGEAIS WHLATILGLD TNDNVRITFN EITKKAVQES
LKNARPIDQN LVNAQQARRI LDRLVGYKLS PFLWEKVKGG LSAGRVQSVA TRLVVEREEE
IENFKPEEYW TLEAVFKKDV QEFKAKFYGD KKGKIELKNQ DHVQKIEEKI KNKEFKVVKI
KVSEKKKNPP PPFITSTLQQ EASRKLRFTP AKTMAVAQML YEGVEIKGEG SVGLITYMRT
DSTRVSEEAQ QAARSLIVQK FGKEYLPEKP RVYKTKKDAQ DAHEAIRPTY LDMDPESIKD
SLTLDQYKLY KLIYDRFLAS QMESSVYEVL SAELEVEGYI FKLTGSKLKF AGFMEVYVEG
KDTEDEEEEN QLPEIREGEA LKPIKLESKQ HFTQPPSRYT EATLIKALEE NGIGRPSTYA
PTIQTILERG YVAKEDRFLK PTELGRIVTN ILKEYFKDII DIEFTAELES NLDKIEEGKL
EWTEVVKKYY QPLEKELEIA RATLLKVKVE DEETDIVCEN CGRKMVIKKG RYGKFLACPG
YPECKNTKPY YDYLDVLCPK CGKRIIEKKS KKGKRYYTCE GYPDCDLILW EKPVKNCPKC
GSLMFEKGKK GNKKLVCSNE NCAYQEKTGE KGE