Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3876 |
Symbol | |
ID | 9158057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3999194 |
End bp | 4002100 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | DNA topoisomerase I |
Protein accession | YP_003648790 |
Protein GI | 296141547 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGCAC GCGGGAAGGC AGCAGACGGC GACGGTCTAC AGCGTCTGGT GATCGTGGAG TCCCCCGCCA AGGGCAAGAA GATCGGCGAT TTCCTGGGGC CGAACTACAC GGTCCGGGCG TCGATGGGGC ACATCCGCGA TCTGCCCAGC CGCGATAATC CGCTCCCCGA GGCCGATCAG GGCAAGTCCT GGTCGCGTCT CGGTGTGGAC GTCGATCACG ACTTCGAAGC CCACTACGTG ACCTCCGCGA GTAAGCGCTC GACGGTGTCC GAACTGAAGT CACTGCTCAA GCAGGCCGAC GAGCTCTACC TCGCGACCGA CGGTGACCGC GAGGGTGAAG CCATCGCGTG GCACCTCCAA GAGGTGCTCA AGCCCAAGGT CCCGGTCAAG CGGATGGTGT TCCACGAGAT CACCCAGCAG GCCATCCAGG CCGCCGCCCA GGAACCGCGC GAGCTCGACA TGAACCTCGT CGACGCGCAG GAGACCCGCC GCATCCTGGA CCGCCTCTAC GGCTACGAGG TCAGCCCCGT GCTGTGGCGC AAGGTCAACC AGGGCCTGTC CGCCGGCCGT GTGCAGTCGG TGGCGACGCG CATCATCGTC GACCGTGAGC GCGAGCGCAT CGCCTTCCGC ACCGCCGGGT ACTGGGATAT CGCCGCCCAG CTGGACGCCG GCGCCGAAGC CAGCCCCCGC ACGTTCGGGG CGCGACTGGT CAGTGTGGAT GGCGACCGCG TGGCCACCGG CCGGGATTTC GATGCCCAGG GCCAGCTCAA GAAGCCCACC GGGATCACCG TGCTCGACGG TACGCGGGCG AACGCCCTGG TCGCGGGCCT GCAGGGCGCG AACCTCGCCG TCACCTCGGT CGAGGAGAAG CCGTACACGC GCAAGCCCTA CGCGCCGTTC ATGACGTCGA CCCTGCAGCA GGAGGCGAGC CGCAAGCTGC GGTTCAACAC CGATCGCACG ATGCAGATCG CGCAGCGCCT CTACGAAGGC GGCTACATCA CCTACATGCG TACCGACTCG ACCACGCTGT CGGAGACCGC GATCGCCGCT GCCCGCGACC AGGCGCGGCA GCTGTACGGC GCCGATTTCG TGCACCCGAC ACCGCGGCAG TACACGCGCA AGGTCAAGAA CGCGCAGGAG GCGCACGAGG CGATCCGTCC CGCGGGCGAG ACCTTCCAGA CGCCCGGCGC ACTGGCGTCC GTGCTGAACT CGGATGAGTT CCGTCTGTAC GAACTGATCT GGCAGCGCAC CGTCGCCTCG CAGATGGCCG ATGTCAAGGG CACCACCCTG AGCTTGCGCA TCGGGGGCAC CGCCTCCTCC GGCGAGAGCG TCGAGTTCGC CGCCTCGGGC CGCACCATCA CCTTCCCCGG CTTCCTCTCG GCCTACGTCG AGACGGTCGA CGAGCAGGCG GGCGGCGAGG CCGATGACGC CGAATCCCGA CTGCCGCAGC TCACCAAGGG CCAGCGGGTG ACCGCGGCCG AGCTCACCGC GGCGGACCAC GTCACCAGCC CGCCCGCGCG CTACACCGAG GCCTCGCTGG TCAAGACGCT CGAAGAGCTG GGCATCGGCC GCCCGTCGAC CTACGCCTCG ATCATCAAGA CCATCCAGGA CCGCGGCTAC GTGGTGAAGA AGGGCAACGC CCTCGTCCCG CAGTGGGTCG CCTTCGCCGT GATCGGACTG CTGGAGGGCC ACTTCGGTGG CCTCGTCGAC TACAACTTCA CCGCCTCGAT GGAGGACGAT CTCGACGAGA TCGCCGGCGG CCGTGAGGGC CGGGTCGACT GGCTCACCCG GTTCTACTTC GGCGACGCCG GTGACGCAGC CGCGGCGGCC GCCGATGGTC CGGGCCCCGA CAGCCCCGGC CTGAAGAACC TCGTCGCCGC CAACCTCGAT GCGATCGACG CCCGCGAGAT CAACTCGATC CCGCTCTACA CCGACACCGA CGGCAACGTC GTCTACGTAC GAGTCGGCCG CTACGGGCCG TATCTCGAGC GCACCGTCGC ACCGAAGGAC GGCGAGACCG AGCCCCAGGT GCAGCGGGCC AACATCCTCG CGTCGATGAC CCCGGACGAG CTGACCGAGG AGGTGGCGGA GAAGCTCTTC GCGACCCCGC AGGACGGCCG CCCGCTCGGC GTGGACCCGG CCACGGGGCA CGAGATCGTC GCCAAGGAAG GACGATTCGG TCCTTACGTC ACCGAGATCC TGCCGGAGCC CGAGACCGAC GACGACACCG AGAATCTCGA CGAGGCGGCG GCGAAGCCCA AGCGCAAGAA GAAGACCGAC GCGCCGAAGC CGCGCACCGG ATCACTGCTC AAGAGTATGG ATATCGAGAC GGTGACGCTC GACGATGCGC TGCGGCTCCT GTCGCTGCCC CGCGTCGTGG GTGTCGACCC CGAGTCCAAG GAAGAGATCA CCGCTCAGAA CGGCCGCTAC GGTCCGTACC TGAAGAAGGG CACCGACTCC CGGTCGCTGG CCACCGAGGA ACAGATGTTC ACGGTGACCC TCGAGGAGGC GCTCAAGCTG TACGCGGAGC CCAAGCGTCG GGGCCGCGGT GCGGCTGCCG CGCCGCCACT GCGCGAGTTG GGCAACGACC CGGTTTCCGG GAACGCCATG GTGATCAAGG ACGGCCGGTT CGGCCCGTAC GTCACCGATG GGGAGACCAA CGCCTCGCTC CGCAAGGGCG ACGAGGTCGC GTCGATCACC GACGAGCGCG CTTCCGAGCT GCTCGCGGAC CGTCGAGCGC GCGGCCCGGT GAAGAAGAAG GCCACCAAGA AGGCGCCCGC CAAGAAGGCG CCGGCGAAGA AGGCCGCCGC GAAGAAGGCG GTCGCGAAGA AGACCACCGC CACGAAGACG GCCGCGAAGA AGGCACCGGC GAAGAAGGCG GCCCCGGCCG CCGACGAGAC GGTCTAA
|
Protein sequence | MAARGKAADG DGLQRLVIVE SPAKGKKIGD FLGPNYTVRA SMGHIRDLPS RDNPLPEADQ GKSWSRLGVD VDHDFEAHYV TSASKRSTVS ELKSLLKQAD ELYLATDGDR EGEAIAWHLQ EVLKPKVPVK RMVFHEITQQ AIQAAAQEPR ELDMNLVDAQ ETRRILDRLY GYEVSPVLWR KVNQGLSAGR VQSVATRIIV DRERERIAFR TAGYWDIAAQ LDAGAEASPR TFGARLVSVD GDRVATGRDF DAQGQLKKPT GITVLDGTRA NALVAGLQGA NLAVTSVEEK PYTRKPYAPF MTSTLQQEAS RKLRFNTDRT MQIAQRLYEG GYITYMRTDS TTLSETAIAA ARDQARQLYG ADFVHPTPRQ YTRKVKNAQE AHEAIRPAGE TFQTPGALAS VLNSDEFRLY ELIWQRTVAS QMADVKGTTL SLRIGGTASS GESVEFAASG RTITFPGFLS AYVETVDEQA GGEADDAESR LPQLTKGQRV TAAELTAADH VTSPPARYTE ASLVKTLEEL GIGRPSTYAS IIKTIQDRGY VVKKGNALVP QWVAFAVIGL LEGHFGGLVD YNFTASMEDD LDEIAGGREG RVDWLTRFYF GDAGDAAAAA ADGPGPDSPG LKNLVAANLD AIDAREINSI PLYTDTDGNV VYVRVGRYGP YLERTVAPKD GETEPQVQRA NILASMTPDE LTEEVAEKLF ATPQDGRPLG VDPATGHEIV AKEGRFGPYV TEILPEPETD DDTENLDEAA AKPKRKKKTD APKPRTGSLL KSMDIETVTL DDALRLLSLP RVVGVDPESK EEITAQNGRY GPYLKKGTDS RSLATEEQMF TVTLEEALKL YAEPKRRGRG AAAAPPLREL GNDPVSGNAM VIKDGRFGPY VTDGETNASL RKGDEVASIT DERASELLAD RRARGPVKKK ATKKAPAKKA PAKKAAAKKA VAKKTTATKT AAKKAPAKKA APAADETV
|
| |