Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TBFG_13675 |
Symbol | |
ID | 5224365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium tuberculosis F11 |
Kingdom | Bacteria |
Replicon accession | NC_009565 |
Strand | - |
Start bp | 4097456 |
End bp | 4100260 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640608445 |
Product | DNA topoisomerase I |
Protein accession | YP_001289602 |
Protein GI | 148824848 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 140 |
Plasmid unclonability p-value | 0.000348117 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 179 |
Fosmid unclonability p-value | 0.0874905 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCTGACC CGAAAACGAA GGGCCGTGGC AGCGGCGGCA ATGGCAGCGG CCGGCGACTG GTCATCGTCG AGTCGCCCAC CAAGGCGCGC AAGCTGGCCT CCTACCTGGG CTCTGGCTAC ATCGTCGAGT CCTCCCGGGG GCACATCCGT GACTTGCCGC GGGCCGCGTC GGATGTACCC GCAAAGTACA AGTCGCAGCC GTGGGCGCGG CTCGGGGTCA ACGTCGACGC CGACTTCGAA CCGCTCTACA TCATCAGCCC GGAGAAACGG AGCACCGTCA GCGAGCTCAG GGGCCTGCTC AAAGACGTGG ACGAGCTGTA TCTGGCCACG GATGGGGACC GTGAGGGCGA AGCTATTGCC TGGCATCTGC TGGAAACCCT CAAACCGCGC ATACCGGTAA AGCGGATGGT CTTCCACGAG ATCACCGAAC CGGCGATCCG CGCCGCCGCC GAGCACCCCC GCGACCTAGA CATCGACCTG GTCGACGCGC AGGAGACCCG GCGCATCCTG GACCGGCTGT ACGGCTACGA AGTCAGCCCA GTGCTGTGGA AGAAGGTCGC CCCCAAGTTG TCGGCGGGCC GGGTGCAGTC GGTGGCCACC CGCATCATCG TGGCGCGCGA ACGCGACCGC ATGGCGTTCC GCAGCGCGGC CTACTGGGAC ATCCTTGCCA AGCTGGATGC CAGCGTGTCC GACCCGGACG CCGCGCCGCC CACCTTCAGC GCCCGGCTGA CGGCCGTGGC TGGCCGGCGG GTGGCCACTG GCCGCGATTT CGACTCGCTG GGCACGCTGC GCAAAGGCGA CGAAGTCATT GTGCTCGACG AGGGGAGCGC GACCGCGTTG GCCGCGGGCC TGGATGGCAC GCAGCTGACC GTGGCCTCGG CCGAGGAGAA GCCCTACGCC CGGCGCCCGT ACCCGCCGTT CATGACCTCC ACGCTGCAGC AAGAGGCCAG CCGCAAGCTG CGGTTCTCCG CCGAGCGGAC GATGAGCATC GCCCAGCGGC TGTACGAAAA CGGCTACATC ACCTATATGC GTACCGACTC CACCACGCTG TCGGAGTCGG CGATCAACGC CGCACGTACC CAGGCGCGCC AGCTCTACGG CGACGAGTAC GTCGCGCCGG CGCCGCGCCA ATACACCCGC AAGGTGAAGA ACGCCCAGGA AGCGCACGAG GCTATCCGGC CCGCCGGTGA AACGTTTGCC ACCCCGGACG CGGTGCGTCG CGAACTCGAC GGTCCCAACA TTGATGATTT CCGGCTCTAT GAGCTGATTT GGCAACGCAC CGTAGCCTCG CAGATGGCCG ATGCGCGGGG CATGACGCTG AGCCTGCGGA TCACTGGCAT GTCGGGGCAC CAGGAGGTGG TGTTCTCCGC GACCGGACGC ACCTTGACGT TCCCGGGCTT CCTCAAGGCC TACGTGGAGA CCGTGGACGA GCTGGTCGGC GGCGAGGCTG ACGATGCCGA GCGGCGACTG CCCCATCTGA CCCCGGGTCA ACGGTTGGAC ATCGTCGAGT TGACCCCAGA CGGCCATGCC ACCAACCCGC CGGCCCGCTA CACCGAGGCG TCGCTGGTCA AAGCGCTCGA GGAGCTGGGC ATCGGCCGCC CGTCGACCTA CTCGTCGATC ATCAAGACCA TCCAGGATCG CGGCTACGTG CACAAGAAGG GCAGTGCACT GGTGCCGTCA TGGGTGGCGT TCGCGGTAAC CGGTCTGCTC GAGCAGCATT TCGGTCGGCT CGTCGACTAC GACTTCACCG CGGCGATGGA AGACGAGCTC GACGAGATCG CCGCCGGCAA CGAGCGCCGC ACCAACTGGC TCAACAACTT CTACTTTGGT GGCGATCACG GTGTGCCCGA TTCGGTAGCC CGATCGGGTG GCCTCAAGAA GCTTGTCGGG ATCAATCTCG AGGGCATCGA CGCACGAGAA GTAAACTCTA TCAAGCTTTT TGACGACACC CACGGACGCC CCATATATGT TCGGGTGGGC AAGAACGGTC CCTACCTGGA ACGTTTGGTG GCCGGCGACA CCGGTGAGCC CACGCCGCAG CGGGCCAACC TCAGCGACTC GATTACCCCG GACGAGCTGA CTCTACAGGT GGCCGAAGAG CTCTTTGCCA CACCGCAACA GGGACGGACT TTGGGCTTGG ACCCAGAAAC CGGCCACGAG ATCGTGGCCA GGGAAGGCCG GTTTGGGCCG TATGTGACCG AGATCCTGCC GGAGCCTGCG GCTGATGCGG CCGCGGCCGC TCAGGGAGTC AAGAAACGCC AGAAGGCCGC CGGGCCCAAA CCGCGCACCG GTTCGTTGCT GCGGAGCATG GACCTACAGA CGGTCACCCT CGAAGACGCG CTGAGGCTGC TGTCACTGCC GCGCGTGGTC GGAGTGGACC CCGCCTCGGG TGAGGAGATC ACCGCGCAGA ACGGGCGCTA CGGACCGTAT CTAAAGCGCG GCAACGATTC TCGATCACTG GTCACCGAAG ACCAGATATT CACCATCACG CTCGACGAAG CCCTGAAGAT CTACGCAGAG CCGAAACGTC GTGGCCGGCA AAGCGCTTCG GCTCCGCCGC TGCGCGAGCT GGGAACAGAT CCGGCGTCGG GCAAGCCAAT GGTCATCAAG GACGGCCGAT TCGGGCCGTA CGTCACCGAC GGTGAGACCA ATGCCAGCAT GCGTAAGGGC GACGACGTGG CTTCCATAAC CGACGAGCGC GCCGCCGAGC TGTTGGCCGA TCGCCGAGCC CGGGGTCCGG CAAAACGGCC AGCCAGGAAA GCTGCCCGGA AGGTGCCGGC GAAGAAGGCA GCCAAGCGCG ACTAG
|
Protein sequence | MADPKTKGRG SGGNGSGRRL VIVESPTKAR KLASYLGSGY IVESSRGHIR DLPRAASDVP AKYKSQPWAR LGVNVDADFE PLYIISPEKR STVSELRGLL KDVDELYLAT DGDREGEAIA WHLLETLKPR IPVKRMVFHE ITEPAIRAAA EHPRDLDIDL VDAQETRRIL DRLYGYEVSP VLWKKVAPKL SAGRVQSVAT RIIVARERDR MAFRSAAYWD ILAKLDASVS DPDAAPPTFS ARLTAVAGRR VATGRDFDSL GTLRKGDEVI VLDEGSATAL AAGLDGTQLT VASAEEKPYA RRPYPPFMTS TLQQEASRKL RFSAERTMSI AQRLYENGYI TYMRTDSTTL SESAINAART QARQLYGDEY VAPAPRQYTR KVKNAQEAHE AIRPAGETFA TPDAVRRELD GPNIDDFRLY ELIWQRTVAS QMADARGMTL SLRITGMSGH QEVVFSATGR TLTFPGFLKA YVETVDELVG GEADDAERRL PHLTPGQRLD IVELTPDGHA TNPPARYTEA SLVKALEELG IGRPSTYSSI IKTIQDRGYV HKKGSALVPS WVAFAVTGLL EQHFGRLVDY DFTAAMEDEL DEIAAGNERR TNWLNNFYFG GDHGVPDSVA RSGGLKKLVG INLEGIDARE VNSIKLFDDT HGRPIYVRVG KNGPYLERLV AGDTGEPTPQ RANLSDSITP DELTLQVAEE LFATPQQGRT LGLDPETGHE IVAREGRFGP YVTEILPEPA ADAAAAAQGV KKRQKAAGPK PRTGSLLRSM DLQTVTLEDA LRLLSLPRVV GVDPASGEEI TAQNGRYGPY LKRGNDSRSL VTEDQIFTIT LDEALKIYAE PKRRGRQSAS APPLRELGTD PASGKPMVIK DGRFGPYVTD GETNASMRKG DDVASITDER AAELLADRRA RGPAKRPARK AARKVPAKKA AKRD
|
| |