Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2181 |
Symbol | |
ID | 7408374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2308857 |
End bp | 2310938 |
Gene Length | 2082 bp |
Protein Length | 693 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643716546 |
Product | DNA topoisomerase I |
Protein accession | YP_002574029 |
Protein GI | 222530147 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial [TIGR01057] DNA topoisomerase I, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0024842 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAAAC TTGTCATTGT AGAGTCACCT GCAAAGGCAA AAACAATTGC AAAGTATCTT GGTAAAGAGT TTAAAGTAGA AGCTTCAATG GGGCATGTAA GGGACCTTCC CAAGAGTGAT TTGGGCGTTG ATATAGAAAA TGGTTTTGTC CCTAAGTATA TAAACATCAG AGGTAAGGCA GATGTAATAA ACAAGCTAAA AAAATATGCA CAAGAAGCAG AGAAGGTATA CCTTGCAACA GACCCCGACA GGGAAGGCGA GGCAATCTCA TGGCATTTAG CAACTATTTT AGGGCTTGAT ACAAACGATA ATGTGAGAAT TACATTCAAT GAGATAACAA AAAAGGCTGT ACAGGAATCT TTGAAAAATG CAAGGCCAAT TGACCAGAAC TTAGTTAATG CCCAGCAAGC CAGAAGAATT TTAGACAGAC TTGTCGGCTA CAAGCTAAGT CCATTTTTGT GGGAAAAGGT CAAGGGTGGA CTTTCTGCAG GAAGAGTTCA GTCTGTTGCA ACAAGGCTTG TGGTTGAAAG AGAAGAAGAG ATAGAAAATT TTAAGCCTGA AGAGTACTGG ACCTTAGAAG CTGTATTTAA AAAAGATGTC CAAGAGTTTA AGGCAAAGTT CTATGGAGAT AAGAAAGGGA AGATAGAGCT AAAAAATCAA GATCACGTTC AAAAAATTGA AGAAAAGATA AAAAATAAAG AATTCAAGGT TGTAAAGATA AAGGTGTCAG AGAAGAAGAA AAATCCGCCC CCACCTTTTA TAACAAGCAC ACTTCAGCAG GAGGCATCAA GAAAACTGAG ATTTACTCCT GCAAAGACAA TGGCAGTTGC GCAGATGCTG TATGAAGGTG TTGAGATAAA AGGTGAGGGA AGTGTTGGAC TTATAACATA TATGAGAACA GATTCAACAA GGGTTTCTGA AGAGGCACAG CAGGCAGCAA GAAGTCTTAT CGTACAGAAG TTTGGCAAAG AATATCTTCC TGAAAAGCCG AGGGTTTACA AGACAAAAAA AGATGCGCAG GACGCTCATG AGGCTATAAG ACCTACTTAT TTGGATATGG ACCCTGAGAG TATAAAAGAT TCTCTGACTC TTGATCAGTA CAAGCTGTAC AAACTCATTT ATGACAGATT TTTAGCGTCG CAGATGGAAA GCAGCGTATA TGAGGTTCTT TCAGCCGAGC TTGAAGTTGA GGGTTATATT TTTAAACTCA CAGGTTCAAA GCTCAAGTTT GCAGGGTTTA TGGAAGTATA TGTTGAAGGT AAGGATACAG AAGATGAAGA GGAGGAAAAT CAGCTTCCAG AAATTAGAGA AGGAGAGGCT TTAAAGCCCA TAAAACTTGA GAGCAAACAG CATTTTACTC AACCGCCTTC TCGCTATACT GAAGCAACCT TAATAAAGGC TTTAGAAGAA AATGGGATAG GAAGACCCAG CACATACGCT CCAACAATCC AGACAATTCT GGAGAGAGGA TATGTTGCCA AAGAAGATAG GTTTTTAAAA CCAACCGAAT TGGGCAGAAT TGTAACAAAT ATACTTAAAG AATATTTCAA AGACATAATA GACATTGAAT TTACTGCAGA GCTTGAGAGC AACCTTGACA AAATTGAGGA AGGAAAACTT GAGTGGACAG AGGTGGTAAA AAAATACTAC CAGCCACTTG AAAAAGAACT TGAGATAGCA CGAGCTACTT TGCTAAAGGT TAAGGTTGAG GATGAGGAGA CAGACATTGT ATGCGAAAAC TGTGGAAGAA AAATGGTGAT AAAAAAAGGT AGATACGGAA AGTTCTTGGC ATGTCCAGGA TATCCTGAAT GCAAAAACAC AAAACCTTAT TACGATTACC TTGATGTGTT GTGTCCAAAG TGCGGCAAGA GGATAATAGA AAAGAAGTCC AAGAAGGGCA AGAGATATTA CACGTGCGAG GGGTATCCTG ACTGTGACCT AATTTTGTGG GAAAAACCAG TCAAAAACTG TCCGAAGTGT GGCAGTCTCA TGTTTGAAAA GGGCAAGAAA GGGAATAAAA AGCTTGTATG TTCAAATGAA AACTGTGCTT ACCAAGAAAA AACGGGGGAA AAAGGTGAGT AA
|
Protein sequence | MKKLVIVESP AKAKTIAKYL GKEFKVEASM GHVRDLPKSD LGVDIENGFV PKYINIRGKA DVINKLKKYA QEAEKVYLAT DPDREGEAIS WHLATILGLD TNDNVRITFN EITKKAVQES LKNARPIDQN LVNAQQARRI LDRLVGYKLS PFLWEKVKGG LSAGRVQSVA TRLVVEREEE IENFKPEEYW TLEAVFKKDV QEFKAKFYGD KKGKIELKNQ DHVQKIEEKI KNKEFKVVKI KVSEKKKNPP PPFITSTLQQ EASRKLRFTP AKTMAVAQML YEGVEIKGEG SVGLITYMRT DSTRVSEEAQ QAARSLIVQK FGKEYLPEKP RVYKTKKDAQ DAHEAIRPTY LDMDPESIKD SLTLDQYKLY KLIYDRFLAS QMESSVYEVL SAELEVEGYI FKLTGSKLKF AGFMEVYVEG KDTEDEEEEN QLPEIREGEA LKPIKLESKQ HFTQPPSRYT EATLIKALEE NGIGRPSTYA PTIQTILERG YVAKEDRFLK PTELGRIVTN ILKEYFKDII DIEFTAELES NLDKIEEGKL EWTEVVKKYY QPLEKELEIA RATLLKVKVE DEETDIVCEN CGRKMVIKKG RYGKFLACPG YPECKNTKPY YDYLDVLCPK CGKRIIEKKS KKGKRYYTCE GYPDCDLILW EKPVKNCPKC GSLMFEKGKK GNKKLVCSNE NCAYQEKTGE KGE
|
| |