Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0932 |
Symbol | |
ID | 5321773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1007892 |
End bp | 1010636 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640789872 |
Product | DNA topoisomerase I |
Protein accession | YP_001326622 |
Protein GI | 150396155 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.231933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.891499 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTGGC GTCGGTATTG GCGCGCATGG TCGCCAAAGC AATTGTTTCA GAGAATGTCG ATGAATGTTG TAGTGGTGGA ATCGCCTTCC AAGGCCAAGA CGATCAACAA GTACCTGGGC CCCGGCTACA AGGTGCTTGC CTCGTTCGGC CACGTGCGTG ACCTGCCTGC CAAGGACGGA TCGGTTCGCC CCGACGAAGA CTTCGAGATG TCCTGGGAGG TCGACGGCGC TTCCGCAAAG CGGATGAAAG ACATCGCGGA CGCGGTGAAA GCGTCCGACG GTCTCATTCT GGCAACCGAC CCCGACCGCG AGGGCGAGGC GATCTCCTGG CATGTGCTCG ACCTTCTCAG AAAAAAGAAG GTTCTCGGCG ACAAGCCGGT CAAGCGCGTC GTCTTCAATG CGATCACAAA AAAGGCCGTG CTCGATGCGA TGGCTCAACC TCGCGACATC GATGCCTCGC TGGTCGACGC ATATCTCGCC CGCCGCGCCC TCGACTACCT GGTCGGCTTC AACCTTTCGC CCGTGCTCTG GCGCAAGCTG CCGGGCGCGC GCTCGGCGGG CCGCGTGCAG TCGGTGGCGC TGCGCCTCGT CTGCGACCGC GAATCGGAAA TCGAGCGCTT CGTCACCGAA GAATACTGGA ATATTTCGGC ACTTCTGAAG ACGCCCCGCG GTGACGAATT CGAAGCGCGC CTCGTTTCGG CGGACGGCAA GCGGCTACCG GCAAAGGCAA TCGGCAACGG CGAGGAAGCC AACCGGTTGA AGACCCTGCT CGACGGCGCA AGCTACGTGG TCGAGAGCGT CGAGGCCAAG CCGGTCAAAC GCAACCCCTC GCCCCCCTTC ACGACTTCGA CGCTGCAGCA GGCGGCCTCC TCCAAGCTCG GCTTCTCGGC TTCGCGCACC ATGCAGGTGG CGCAGAAGCT CTACGAAGGC ATCGATATCG GCGGCGAGAC GGTCGGCCTC ATTACCTATA TGCGTACTGA CGGCGTGCAG ATGGCGCCGG AGGCGATCGA CGCCGCACGC CAGGCGATCG GCAGCCAGTT CGGCGAACGT TATCTGCCTG AAAAGCCGCG TTTCTATTCG ACCAAGGCAA AAAATGCCCA GGAAGCGCAC GAGGCCATTC GCCCCACCGA TTTCAACCGT ACCCCCGACC AGGTCCGCCG CTTCCTCGAT GGCGACATGC TGAGGCTCTA CGACCTCGTC TGGAAGCGCG GTATAGCGAG CCAGATGGCG TCCGCGGAGA TAGAGCGCAC CACCGCCGAG ATTGTCGCCG ACAATGCTGG CAAGAGAGCA GGTCTCAGGG CAACCGGCTC GGTCATCCGC TTCGACGGTT TCATCGCGGC CTATACCGAC ATGAAGGAAG ACGGCGAACA GACGGATGAT GGCGACGAAG ACGGCCGCCT TCCGGAAATC AACGCGCGTG AAAACCTCGC CAAGCAGAAG ATCAACGCAA GCCAGCACTT CACCGAACCC CCGCCGCGTT ATTCGGAAGC GACGCTCATC AAGAAAATGG AAGAGCTCGG TATCGGTCGC CCCTCCACCT ACGCGGCGAC CGTCACGACG CTTATCGACC GCGATTACGT GGAGATCGAC AAGCGCAAGC TGTTGCCGCA GGCCAAGGGC CGCCTTGTCA CGGCATTTCT CGAGAGCTTC TTTACCCGCT ACGTCGAATA CGATTTTACC GCATCGCTCG AAGAAAAGCT GGATCGGATT TCGGCCGGCG AACTCAACTG GAAGGACGTA CTGCGCGACT TCTGGAAGGA CTTCTTCTCC CAGATCGAAG ACACCAAGGA ACTGCGCGTC ACCAATGTGC TCGATGCGCT CAACGAGGAA CTGGCTCCGC TCGTCTTTCC GAAGCGGGAA GATGGTGGCG ATCCGCGCAT CTGTCAGGTC TGCGGCACCG GCAAACTCTC CTTGAAGCTC GGTAAATACG GCGCCTTCGT CGGGTGCTCG AACTATCCGG AATGCAACTA TACACGCCAG CTTTCCTCTG ACAGCAGCGG TGACGCGGAA GCAGCCGCCT CGAACGAGCC CCAAAGCCTC GGCAAGGACC CGCATACCGG CGAGGAAATC ACGCTGCGCA ACGGCCGCTT CGGCCCCTAT GTCCAGCGCG GCGACGGCAA GGAGGCGAAA CGCGCCAGTC TGCCGAAGGG CTGGACGCCG GCAACGATCG ATCACGAAAA GGCGCTTGCC CTTCTCTCCC TGCCACGCGA CCTCGGTCCG CATCCGGAAA CCGGCAAGAT GATCTCCGCC GGGATCGGCC GCTATGGCCC CTTCGTACTC CACAACGGAA CCTATGCCAA TCTGGAGTCG GTCGAGGACG TCTTCTCGAT CGGTCTCAAC CGCGCGACTT CCGTCCTGGC CGACAAGCAG TCCAAGGGCG CCGGCGGTGC GGGCGGCCGC ACCGGCGCGG CAGCCGTGAA GGAACTTGGG GAACATCCTG ACGGCGGGGC GATTACTGTT CGCGACGGCC GCTACGGACC CTATGTCAAC TGGGGCAAGG TGAATGCCAC CCTGCCTCGG GGCAAGGACC CGCAATCGGT CACGGTCGAG GAAGCGCTTG CGTTCATCGC CGAGCGAGCA GCGAAAGGCG GCGTGACGAA AGGCAAGACG GCCAAGGGGA AGTCGGCCGG GAGAAAACAA GCCGGAACGA AGACTGCCAA GGCGGCCGGA ACTGCGACAG CTGAGAAGCC GAAACGCGCA GCAAAGACGA AAACGAAGTC GGCCGCGAAG GCCAAGAAGG ACTGA
|
Protein sequence | MSWRRYWRAW SPKQLFQRMS MNVVVVESPS KAKTINKYLG PGYKVLASFG HVRDLPAKDG SVRPDEDFEM SWEVDGASAK RMKDIADAVK ASDGLILATD PDREGEAISW HVLDLLRKKK VLGDKPVKRV VFNAITKKAV LDAMAQPRDI DASLVDAYLA RRALDYLVGF NLSPVLWRKL PGARSAGRVQ SVALRLVCDR ESEIERFVTE EYWNISALLK TPRGDEFEAR LVSADGKRLP AKAIGNGEEA NRLKTLLDGA SYVVESVEAK PVKRNPSPPF TTSTLQQAAS SKLGFSASRT MQVAQKLYEG IDIGGETVGL ITYMRTDGVQ MAPEAIDAAR QAIGSQFGER YLPEKPRFYS TKAKNAQEAH EAIRPTDFNR TPDQVRRFLD GDMLRLYDLV WKRGIASQMA SAEIERTTAE IVADNAGKRA GLRATGSVIR FDGFIAAYTD MKEDGEQTDD GDEDGRLPEI NARENLAKQK INASQHFTEP PPRYSEATLI KKMEELGIGR PSTYAATVTT LIDRDYVEID KRKLLPQAKG RLVTAFLESF FTRYVEYDFT ASLEEKLDRI SAGELNWKDV LRDFWKDFFS QIEDTKELRV TNVLDALNEE LAPLVFPKRE DGGDPRICQV CGTGKLSLKL GKYGAFVGCS NYPECNYTRQ LSSDSSGDAE AAASNEPQSL GKDPHTGEEI TLRNGRFGPY VQRGDGKEAK RASLPKGWTP ATIDHEKALA LLSLPRDLGP HPETGKMISA GIGRYGPFVL HNGTYANLES VEDVFSIGLN RATSVLADKQ SKGAGGAGGR TGAAAVKELG EHPDGGAITV RDGRYGPYVN WGKVNATLPR GKDPQSVTVE EALAFIAERA AKGGVTKGKT AKGKSAGRKQ AGTKTAKAAG TATAEKPKRA AKTKTKSAAK AKKD
|
| |