Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_1820 |
Symbol | |
ID | 8597289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | + |
Start bp | 1953739 |
End bp | 1955979 |
Gene Length | 2241 bp |
Protein Length | 746 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | DNA topoisomerase I |
Protein accession | YP_003308609 |
Protein GI | 269120432 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000828965 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCAAAAA AATTAGTTAT TGTTGAGTCT CCCTCGAAGG CTAAAACAAT AGAGAAGATA CTGGGAAAGG GTTATGAGGT AGAAGCATCA TACGGACACG TTATTGATCT CCCAAAAACT AAAATCGGCG TAGATATAGA AAATAAATTT GAGCCTCACT ATCAGGTAAT AAAGGGTAAG GGAGATATAT TAAAAAAGCT GAAAGATAAA GCAAAAAAAG CAGATGCTGT ATATCTGGCA TCGGATAAGG ACCGTGAAGG TGAAGCTATA GCATGGCATA TCTCTAATTA TATCAAGGTT CCTGCTAAGA CAAAAAGAAT AGAGTTTAAC GAAATTACCA AAAGTGCAAT AAATAATGCA ATAAAGCATC CGAGAAACAT AGATGAAAAT CTGGTGAATG CCCAGCAGGC AAGAAGAATA CTGGACAGGA TAGTGGGTTA TAAAATAAGC CCGCTTCTGT GGAAGATAAT AAATAAGAAT GCAAGTGCAG GAAGAGTACA GTCTGTGGCT TTGAAGCTTA TCTGCGATCT CGAGGATGAA ATCAGAGAGT TTATTCCGCA GAAATACTGG GAAGTAACGG CAATTACAGA TAAAGGAATA GAACTGGGAA TATACGAAAT AGCTGGAAAA AAAGTAGACA GAATTTTTGA CGAAAAAGTA ATGAAGAAGC TGAAAAAGGA TCTGGCAAAG AAAAATCTTG AAGTATTTAA GATAAAAGTA ACAAAAAAGA CACAAAGACC GCCGCTGGTA TTCAAAACAA GCACGCTGCA GCAGCTGGCA TCTTCATATC TCGGTTTTGC CACATCAAAA ACAATGAGAG TGGCTCAGCA GCTTTATGAG GGACTTTCGA TAAACGGTGA AAATGTGGGA CTGATTACTT ATATGAGAAC TGATTCCACA AGAATTTCCA ATGAATCTAT GGCAGATGCA GGAAAGTACA TAGAAAAAAA CTTCGGAAAA GAATATGTAG GGAAGTATAT GCCGGCTAAA GCCAAAGGAA ATGTACAGGA TGCCCATGAG GGAATAAGAC CTACAGATAT TAATCTTTCT CCTGACAGCA TAAAAGACTC ATTGAGTGCG GAGCAGTATA AATTATACAA GCTGATATGG GAAAGATTTC TGGTATCACA GTTTTCGGCA ATGAAATATG ATCAGATGCA GATAAATGCC AAAAATGGGG ACTATGTATT CAGAGGAACT ATAAATAAAG TAACTTTTGA CGGATATTAT AAAGTATTTA AAGATGAAGA CGAAATAAAA ACAGCTGATT TTCCTGAAAT AAAAGAAGGA GACGAGCTTG TAGTAAAAGA GCTGAATATA AAAGATGGAA TGACAAAGCC GCCTGCAAGA TTTTCAGAGT CTTCACTGGT AAAAAAGCTG GAAGCGGAAG GAATAGGAAG ACCGTCTACA TATGCTTCCA TTATAGAAAC ACTGAAAACA AGAAATTATG CAGAGCTCGT GGATAAAAGA TTTATTCCGA CAGTTCTGGG CTATGAAGTA AAGTCTGAGC TGGAAAAGCA CTTTGAAAAG ATAATGAATA TAAAGTTCAC CGCTAATATG GAAGAGGAAC TTGATGAAAT AGAGAACGGA AGCATAAAGT GGGAAGAGCT TATGGCTAAT TTTTATAAAG GGCTGGAGGT AGATCTTACC AAGTTTGAAA AAGAGATTCA GGATCTTCAG GACAGAAGAA TAGAAGCAGA TATTATGTGT GCTAACGGAA CAGAGGCTAT GATACTGAAA ACCGGGAGAT TCGGAAAGTA TCTTATCTGT GAGTCGAATC CTGACGAGAA GGTTTCTCTG AAGGGTATTC AGATACCGAA GGAGGAGCTG GAGGCAGGTA AAATAATAGT AAAGGACAAG GTAGCCGAAA AGGAATCAGA GAAAAAAGGA GTTCCTACTG ATCATTTTAC TAAAGACGGA GCAAGAATTT TTGTAAAAAA GGGAAGATAC GGAGAATATC TTGAAAGTGA AGATTATGAA AATGATGAGA TAAGAATGCC GCTTCCTTAT AAGATAAAAC AGGAAATAAA AAAAGGTACT GCAAAAATGC AGGACGGGAT GTATATCATA CATGAAGAGC TTGAGAAAAT GCTGGCTGAG GATCAGAGAA TTATAGAAGA AGCAGGATTG TGTGAAAAAT GCGGCAGACC TTTTGAAATA AAAATAGGGA GATTTGGAAG ATTTTTGGCA TGTACGGGAT ATCCGGACTG CAAAAATATA AAGAAAATTC CGAAAAAGTA A
|
Protein sequence | MAKKLVIVES PSKAKTIEKI LGKGYEVEAS YGHVIDLPKT KIGVDIENKF EPHYQVIKGK GDILKKLKDK AKKADAVYLA SDKDREGEAI AWHISNYIKV PAKTKRIEFN EITKSAINNA IKHPRNIDEN LVNAQQARRI LDRIVGYKIS PLLWKIINKN ASAGRVQSVA LKLICDLEDE IREFIPQKYW EVTAITDKGI ELGIYEIAGK KVDRIFDEKV MKKLKKDLAK KNLEVFKIKV TKKTQRPPLV FKTSTLQQLA SSYLGFATSK TMRVAQQLYE GLSINGENVG LITYMRTDST RISNESMADA GKYIEKNFGK EYVGKYMPAK AKGNVQDAHE GIRPTDINLS PDSIKDSLSA EQYKLYKLIW ERFLVSQFSA MKYDQMQINA KNGDYVFRGT INKVTFDGYY KVFKDEDEIK TADFPEIKEG DELVVKELNI KDGMTKPPAR FSESSLVKKL EAEGIGRPST YASIIETLKT RNYAELVDKR FIPTVLGYEV KSELEKHFEK IMNIKFTANM EEELDEIENG SIKWEELMAN FYKGLEVDLT KFEKEIQDLQ DRRIEADIMC ANGTEAMILK TGRFGKYLIC ESNPDEKVSL KGIQIPKEEL EAGKIIVKDK VAEKESEKKG VPTDHFTKDG ARIFVKKGRY GEYLESEDYE NDEIRMPLPY KIKQEIKKGT AKMQDGMYII HEELEKMLAE DQRIIEEAGL CEKCGRPFEI KIGRFGRFLA CTGYPDCKNI KKIPKK
|
| |