Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1387 |
Symbol | |
ID | 8012480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1374676 |
End bp | 1377330 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644823972 |
Product | DNA topoisomerase I |
Protein accession | YP_002975218 |
Protein GI | 241204122 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.903307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.17492 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTTG TAGTGGTGGA ATCGCCTGCC AAGGCCAAGA CGATCAATAA GTATCTGGGT CCGGGATACA AAGTGCTCGC CTCCTTCGGC CATGTGCGCG ATCTGCCTGC CAAGGACGGC TCCGTTCTTC CTGATCAGGA TTTCGAAATG CTTTGGGAGG TCGATAGCGC CTCCGCCAAG CGGATGAAGG ACATTGCCGA CGCGGTGAAG TCCGCCGATG GCCTGTTTCT CGCGACCGAC CCGGATCGCG AAGGCGAAGC GATTTCCTGG CACGTTCTCG ACATGCTGAA CAAGAAGCGC GTGCTGAACG GCAAACCGGT CAAGCGCGTC GTCTTCAATG CGATCACCAA GAAAGCGGTG CTCGACGCGA TGGCCGATCC GCGCGACATC GACGTGCCGC TGGTCGATGC CTATCTCGCG CGCCGCGCGC TCGACTATCT CGTCGGCTTC AATCTTTCGC CGGTGCTGTG GCGCAAGCTG CCCGGCGCAC GTTCGGCCGG CCGCGTCCAG TCTGTGGCGC TGCGTCTCGT CTGCGACCGC GAATCCGAGA TCGAGCGCTT CATTTCCGAA GAGTACTGGA ACATCTCAGC GCTCCTGAAG ACGCCGCGCG GCGACGAGTT CGAGGCAAGA CTGGTTTCGG CGAATGGCAA ACGACTGCAG CCGCGCGCAA TCGGTAACGG CGAGGATGCC GGCCGGCTCA AGGCCTTGCT CGAAGGCGCG AGCTACGTCG TCGACTCTGT CGAGGCGAAA CCGGTCAAAC GTAATCCCGG ACCGCCCTTC ACGACCTCGA CGCTGCAACA GGCCGCCTCC TCCAACCTCG GCTTCTCGGC CTCGCGCACC ATGCAGGTTG CCCAAAAGCT CTATGAGGGC GTCGATATCG GCGGCGAGAC GGTCGGTCTG ATCACCTATA TGCGAACCGA CGGCGTGCAG ATGGCGCCCG AGGCGATCGA CGCGGCGCGC AGTGCCATCG TCGACCAGTT CGGCGAACGC TACATGCCGG AGAAGCCGCG CTTCTATTCG ACCAAGGCGA AGAACGCCCA GGAGGCGCAC GAGGCGATCC GTCCGACCGA TTTCGACCGC TCGCCCGACC GTGTCCGCAA ATTCCTCGAT GCCGACCAGA TCCGGCTCTA CGAGCTGATC TGGAAGCGCG GCATCGCCAG CCAGATGGCG TCGGCCGAGA TCGAGCGCAC GACGGCTGAA ATCACCGCCG ACAACAAGGG CGAGAAGGCC GGGCTTCGTG CCGTCGGTTC GGTCATCCGA TTCGATGGTT TCATCGCCGC CTATACCGAC CAGAAGGAAG ATGGCGAGCA GAGCGACGAT GGCGACGAGG ATGGCCGTCT GCCGGAGATC ATTGCGCGCG AGGCGCTCGC CAAGCAGAAG ATCAATTCGA CGCAGCATTT CACCGAACCG CCGCCGCGCT ATTCGGAAGC GACGCTGATC AAGAAGATGG AAGAGCTCGG CATCGGCCGC CCCTCCACCT ATGCCGCGAC GCTGGCAACG CTGCGCGACC GCGACTATGT GACGATCGAC AAGCGCAAGC TGATCCCGCA GGCCAAGGGC CGGCTGGTGA CGGCTTTCCT CGAGAGCTTC TTCACCAAAT ATGTCGAATA CGACTTCACC GCCGATCTCG AAGAGAAGCT TGACCGGATT TCCGCCGGCG AGTTGAACTG GAAGCAGGTG CTGCGCGATT TCTGGAAGGA TTTCTTCGCC CAGATCGAGG ACACCAAGGA ACTGCGCGTC ACCAACGTGC TGGATTCGCT GAACGAGGCG CTGGCACCGC TCGTCTTCCC GAAACGGGAG GACGGCAGCG ACCCCAGAAT CTGTCAGGTC TGCGGCACCG GCAACCTGTC GCTGAAGCTC GGCAAATACG GCGCCTTCGT CGGCTGCTCG AACTATCCGG ACTGCAACTA CACCCGCCAG CTCTCCTCCG AAAACGGCGG AGATGCGGAT GGTGCTGCGC TCAACGAGCC GAAGAACCTC GGCACCGATC CGACCACCGG CGAGGAACTG ACGCTGCGTT CCGGCCGCTT CGGCCCCTAT ATCCAGCGCG GCGACGGCAA GGAAGCCAAA CGGGCCTCGC TGCCGAAGGG CTGGAAACCC GAGGACATCG ATTATGAAAA GGCGATGGCG CTGATCTCGC TGCCGCGCGA TATCGGCAAA CATCCCGAAT CGGGCAAGAT GATCTCGTCA GGCATCGGCC GCTATGGGCC GTTCCTCCTG CATGACGGTT CCTATGCCAA TCTGGAAACC GTCGAGGACG TGTTCTCGGT CGGCCTCAAC CGCGCCGTGA CTGTTATCGC CGAAAAGGCG AACCAGGCAC CCGGCCGGGG CGCGCGCGGC ACACCGGCGG CACTAAAGAC GCTCGGCGAT CATCCTGATG GCGGCGCCAT TACCGTTCGC GACGGCAAAT ACGGCCCCTA TGTCAACTGG GGCAAGGTCA ACGCCACACT GCCGAAGGGC AAGGATCCCC AGGCGATCAC CGTCGAGGAG GCGCTCGCCT TGATCGCCGA GAAGGCCGGA AAGACACCTG TGGGCAAAGC GGCCAAGACG AAAGCCAAGC CGAAGGCAGC GGCCGCCGAA GCCAAGAGCA CCAAGACGGC CGCCAAACCG AAGGCAACGA AGGCGAAGGC GCCCGCGAAA TCGAAAAAGA GCTGA
|
Protein sequence | MNVVVVESPA KAKTINKYLG PGYKVLASFG HVRDLPAKDG SVLPDQDFEM LWEVDSASAK RMKDIADAVK SADGLFLATD PDREGEAISW HVLDMLNKKR VLNGKPVKRV VFNAITKKAV LDAMADPRDI DVPLVDAYLA RRALDYLVGF NLSPVLWRKL PGARSAGRVQ SVALRLVCDR ESEIERFISE EYWNISALLK TPRGDEFEAR LVSANGKRLQ PRAIGNGEDA GRLKALLEGA SYVVDSVEAK PVKRNPGPPF TTSTLQQAAS SNLGFSASRT MQVAQKLYEG VDIGGETVGL ITYMRTDGVQ MAPEAIDAAR SAIVDQFGER YMPEKPRFYS TKAKNAQEAH EAIRPTDFDR SPDRVRKFLD ADQIRLYELI WKRGIASQMA SAEIERTTAE ITADNKGEKA GLRAVGSVIR FDGFIAAYTD QKEDGEQSDD GDEDGRLPEI IAREALAKQK INSTQHFTEP PPRYSEATLI KKMEELGIGR PSTYAATLAT LRDRDYVTID KRKLIPQAKG RLVTAFLESF FTKYVEYDFT ADLEEKLDRI SAGELNWKQV LRDFWKDFFA QIEDTKELRV TNVLDSLNEA LAPLVFPKRE DGSDPRICQV CGTGNLSLKL GKYGAFVGCS NYPDCNYTRQ LSSENGGDAD GAALNEPKNL GTDPTTGEEL TLRSGRFGPY IQRGDGKEAK RASLPKGWKP EDIDYEKAMA LISLPRDIGK HPESGKMISS GIGRYGPFLL HDGSYANLET VEDVFSVGLN RAVTVIAEKA NQAPGRGARG TPAALKTLGD HPDGGAITVR DGKYGPYVNW GKVNATLPKG KDPQAITVEE ALALIAEKAG KTPVGKAAKT KAKPKAAAAE AKSTKTAAKP KATKAKAPAK SKKS
|
| |