Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_0738 |
Symbol | |
ID | 8446325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 810703 |
End bp | 813639 |
Gene Length | 2937 bp |
Protein Length | 978 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645039873 |
Product | DNA topoisomerase I |
Protein accession | YP_003200141 |
Protein GI | 258650985 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACTG GCACCTCCGC GAGCACCCGC AACGGCGGCG TGCGGCTGGT CGTCGTCGAG TCACCGAGCA AGGCGAAGAC CATCTCCGGT TACCTCGGCG ACGGCTACAT CGTCGAGTCC TCGGTCGGGC ACATCCGCGA CCTGCCCCGC GGCGCCGCCG ACGTACCGGC CAAGTACAAG GGCGAGCCCT GGGCCCGGCT GGGCGTGGAC ACCGAGCACG GCTTCGAGCC GCTCTACGTC GTCTCCCCGG AGAAGAAGGC CCAGGTCGCT AAGCTCAAGT CGCTGCTGGC CGGGGCCGAC GAGCTCTACC TGGCCACAGA CGAGGACCGC GAGGGCGAGG CCATCGCCTG GCATCTGCTG GAGACCCTCA AGCCCAAGGT CCCGGTCAAG CGGATGGTCT TCCACGAGAT CACCCCGGCC GCGATCCGCG AGGCCGCCGC CAACCCGCGC GCCCTGGACG AGAACCTGGT CGACGCCCAG GAGACCCGCC GCATCCTGGA CCGGTTGTAC GGCTACGAGG TCTCCCCGGT GCTGTGGAAG AAGGTGATGC CCAAGCTGTC CGCCGGTCGC GTGCAGTCGG TGGCCACCCG CATCATCGTG CAGCGCGAAC GCGAGCGGAT GGCGTTCCGC TCGGGCACCT ACTGGGGCCT GGACGCGCTC ATGTCGCCGG CCGGCACCGG GGCCGAGCCG TTCAAGTCGG CGCTGAGCAC CGTGGACGGG CGCCGGCTGG CCGCCGGCCG CGACTTCGAC CCGGCCACCG GCGGGCTCAA GGCCGACGCC GACGTGCTGC TGCTGGACGA GGACGGGGCG CGCACCCTGG CCACCGCGCT GGCCGGCGGC ACCGCCACCG TCACCTCGGT GGAGGAGAAG CCCTACACCC GCAAGCCCTA CCCGCCGTTC ATGACCTCCA CCCTGCAGCA GGAGGCCGGC CGCAAACTGG GGTTCAACTC CGAGCGCACC ATGCGCACCG CGCAGCGGCT GTACGAGAAC GGCTTCATCA CCTACATGCG AACCGACTCG ACCACCCTGA GCTCGTCGGC CCAGGAGGCC GCCCGGGCCC AGGCCCGCGA GCTGTACGGG CCGGAGTACG TGCCGCCGAC CCCGCGGCAG TACACCCGCA AGGTCAAGAA CGCCCAGGAG GCGCACGAGG CCATCCGCCC GGCCGGCGAC AACTTCCGCA CCCCGGGTCA GGTCGCCAAC CAGATCTCCG GCGACGAGTA CCGGCTCTAC GAGCTGATCT GGCAGCGCAC CATCGCCTCG CAGATGGTCG ACGCCCGCGG CCTGACCCTG TCGGTCAAGA TCGCCGCGAC CGCGCGGGAG CAGGAGTGCG TGTTCAGCGC GTCCGGCCGC ACCATCACCT TCCCCGGGTT CCTGCGGGCC TACGTGGAGA CGGTGGACGC GGAGGCCGGC GGCGAGGCCG ACGACGCCGA ACGGCGGCTG CCCAAGCTGG AGACCGGGCA AAAGCTCGAC ATCCGTGACC TGATCCCGGC CAGCCACGTG ACCACCCCGC CGGCCCGGTA CACCGAGCCG TCGCTGATCG GCGCGTTGCA GGACCTGGGC ATCGGCCGTC CCTCCACCTA CACCTCGATC ATCCGCACGA TCATCGACCG CGGGTACGTG TGGAAGAAGG GGCAGGCGCT GGTCCCGTCC TGGATCGCGT TCGCCGTCAT CGGCCTGCTC GAGCAGCACT TCTCCCGGCT GGTGGACTAC AACTTCACCG CGGCGATGGA GGACGAGCTC GACGGCATCG CCGACGGGCG GATCGGCCGC ACCGACTGGC TGTCCGCGTT CTACTTCGGC GGCGACCTGG GCCCGGCCGG CTCGGTCGGC CGCTCCGGCG GCCTGAAGAA GCTGGTCGGC GAGCGGCTGG AGGACATCGA CGCCCGCGAG GTCAACTCGT TGCCGTTGCT GACCGACGCC GAGGGCCGGC AGGTGCTGGT CCGGGTCGGC CGGTACGGCC CGTACCTGGA GCGGATGGTG CAGGGCGAGG ACGGCGAGCC GACCGCCCAG CGGGCCAACC TGCCCGAGGA CCTGCCCCCG GACGAGGTCG ACGCCGAGGT CGCCGAGAAG CTGTTCAGCC AGTCCGGCGA CGGTGGCGAG ACCGAGCTCG GGGTGGATCC GGACACCGGG CACCTGATCG TCGCCAAGGA CGGCCGGTTC GGCCCCTACG TCACCGAGGT GCTGCCGGAG GCCGCTCCGG CGGCCACCGG GGCCGACGGG ACCGCCAAGA AGACGACCAA GGCCAAGGCG GCGGCCAAGC CGCGTACCGC GTCGCTGTTC AAGTCGATGA CCCTGGACAC CATCGACCTG CCCACCGCGC TACGGCTGCT GTCGCTGCCC CGGGTGGTCG GCGTCGATCC GGCCGACGGC CAGGAGATCA CCGCGCAGAA CGGCCGGTAC GGGCCCTACC TGAAGAAGGG CACCGACTCC CGGTCGCTGA CCAGCGAGGA CGCGCTGTTC GACGTCACCC TGGACGAGGC GCTGGCCCTG TACGCGCAGC CCAAGACCCG CGGCCGGTCC GCGGCGGCCG CGCCGCCGCT GCGGGAGGTC GGCATCGACC CGTCCGGCGG CAAACCGATG GTGATCAAGG ACGGCCGGTT CGGGCCGTAC GTCACCGACG GGGAGACCAA CGCCTCCCTG CGCAAGGGTG ACGAGGTCGA GACCCTCACC GTGGAGCGCG CGGCCGAGCT GCTGGCCGAT CGACGCGCCC GCGGGCCGGC CCCCAAGCGG GCGACCACCC GCAAGCCGGC GGCGGCCAAG GCCGGTGCCG CGGCCGGTGG CACCAAGACG GCCACCAAGA CTGCGGCGGC CAAGACGACG GCCACCAAGA CCGCCACCAA GACTGCTTCC AAGACCACGG CGGCGGCGAA GGCGCGGACC ACCAAGGCGG CCGGCACCAC CCGCAGCACC ACCCGTCGGA CCGGCCCCGC GGAGTGA
|
Protein sequence | MATGTSASTR NGGVRLVVVE SPSKAKTISG YLGDGYIVES SVGHIRDLPR GAADVPAKYK GEPWARLGVD TEHGFEPLYV VSPEKKAQVA KLKSLLAGAD ELYLATDEDR EGEAIAWHLL ETLKPKVPVK RMVFHEITPA AIREAAANPR ALDENLVDAQ ETRRILDRLY GYEVSPVLWK KVMPKLSAGR VQSVATRIIV QRERERMAFR SGTYWGLDAL MSPAGTGAEP FKSALSTVDG RRLAAGRDFD PATGGLKADA DVLLLDEDGA RTLATALAGG TATVTSVEEK PYTRKPYPPF MTSTLQQEAG RKLGFNSERT MRTAQRLYEN GFITYMRTDS TTLSSSAQEA ARAQARELYG PEYVPPTPRQ YTRKVKNAQE AHEAIRPAGD NFRTPGQVAN QISGDEYRLY ELIWQRTIAS QMVDARGLTL SVKIAATARE QECVFSASGR TITFPGFLRA YVETVDAEAG GEADDAERRL PKLETGQKLD IRDLIPASHV TTPPARYTEP SLIGALQDLG IGRPSTYTSI IRTIIDRGYV WKKGQALVPS WIAFAVIGLL EQHFSRLVDY NFTAAMEDEL DGIADGRIGR TDWLSAFYFG GDLGPAGSVG RSGGLKKLVG ERLEDIDARE VNSLPLLTDA EGRQVLVRVG RYGPYLERMV QGEDGEPTAQ RANLPEDLPP DEVDAEVAEK LFSQSGDGGE TELGVDPDTG HLIVAKDGRF GPYVTEVLPE AAPAATGADG TAKKTTKAKA AAKPRTASLF KSMTLDTIDL PTALRLLSLP RVVGVDPADG QEITAQNGRY GPYLKKGTDS RSLTSEDALF DVTLDEALAL YAQPKTRGRS AAAAPPLREV GIDPSGGKPM VIKDGRFGPY VTDGETNASL RKGDEVETLT VERAAELLAD RRARGPAPKR ATTRKPAAAK AGAAAGGTKT ATKTAAAKTT ATKTATKTAS KTTAAAKART TKAAGTTRST TRRTGPAE
|
| |