Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_3847 |
Symbol | |
ID | 7092543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 4211872 |
End bp | 4214805 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643467132 |
Product | DNA topoisomerase I |
Protein accession | YP_002364091 |
Protein GI | 217979944 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0674028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATGTCG TCATCGTCGA ATCGCCGGCG AAGGCTAAGA GCATTAATAA ATACCTCGGC AAGGGCTATG AGGTTTACCC GTCCTATGGC CATGTCCGCG ATTTGCCGGC CAAGGACGGA TCGGTCGATC CCGACGCCGA TTTCGCCATG CTGTGGGACG TCGACGTCAA ATCCGCCAAG CGGCTGAACG AGATCGCAAG GGCCGTCAAG GACGCCGACA AGATCATTCT GGCGACCGAC CCCGATCGCG AAGGCGAGGC GATCTCCTGG CATGTCCTCG AGGTTTTAAA GGCGAAGAAA GTCCTCAAGG ACAAGCCGGT CGAGCGCGTC GTCTTCAACG CCATCACCAA ATCCGCCGTG CTGGAGGCGA TGCAGCATCC GCGTCAGATC GACGTGGCGC TGGTCGACGC CTATCTGGCG CGCCGCGCCC TGGATTATCT GGTCGGCTTC AATCTCTCAC CGGTGCTGTG GCGCAAGCTT CCCGGCGCCC GCTCGGCGGG GCGGGTGCAG TCCGTCGCGC TGCGGCTTGT TTGCGACCGT GAGCTTGAGA TCGAGGCTTT CGTCGCACGA GAGTATTGGT CGATCGTCGC CCATCTCAGG ACGCAGGCGG ACGCCCCGTT CACGGCGCGG CTCGTTGGCG CCGACGGCAA GAAGATTACG CGCCTCGACG TCGGGACCGG GGCTGAGGCC GCCGCCTTCA AGACCGATCT CGAAAACGCG ACGTTCCGCG TCGCCGAGAT CGAGGCCAAG CCCGCCAAGC GCCACCCGTC CGCGCCTTTC ACGACCTCGA CGCTGCAGCA GGAGGCCTCG CGAAAGCTCG GCTTCGCGCC GGCCCGCACC ATGCAACTCG CGCAGCGGCT CTATGAGGGC GCCGACATCG ACGGCGAGAC GGTCGGACTC ATTACTTATA TGCGAACGGA CGGCGTCGAT CTCGCCCCCG AGGCGATCAC CAGCGCAAGG TCGGTGATCG CCCGCCAGTT CGGCGACGCC TATGTGCCGA AGGCGCCGCG CAAATATACG GTGAAGGCCA AGAATGCGCA GGAAGCGCAT GAGGCGATCC GTCCGACCGA CCTCGCCCGC CTGCCGCGCG CCGTCGCCCG CGCGCTGGAG CCGGATCAGG CCAAGCTCTA TGAGCTGATC TGGACCCGCA CCATCGCCAG CCAGATGGAA TCGGCCGAGC TTGAGCGCAC CACCGTGGAC ATTGAGGCGC TGGCCGGAGC GCGCAAGCTT GATCTGCGCG CAACCGGTCA GGTCGTGCGC TTCGACGGCT TTCTCAAACT TTATCAGGAA GGGCGCGACG ACGAAGACGA TGACGAAAGC GCCCGCCTGC CGGATATGGC CAAGGACGAG CGCCTGAAGC GCGAACGCAT CGAGGCGGAC CAGCATTTTA CCGAGCCGCC GCCGCGCTTC ACCGAGGCGA CGCTGGTCAA ACGCATGGAA GAGCTCGGCA TCGGCCGCCC CTCGACCTAC GCCTCGACGC TCGCCGTGCT GCGCGAACGC GATTACGTGC GGATCGAGAA GAAGCGCCTG CATCCGGAGG ACAAGGGACG TCTCGTCACG GCCTTCCTCG AGGCGTTCTT CGCCCGTTAT GTCGGCTATG ACTTTACCGC CGACCTTGAG GAAAAGCTCG ACCGGGTCTC GAACCATGAG ATCGACTGGA AGCAGGTGCT GCGCGATTTC TGGCTGGATT TTTCGGGCGC CCTCGCCGGC ACCAAGGATC TGCGCACGAC GGAGGTGCTC GACCGCCTGA ACGAGATTTT GGGGCCGCAT ATTTTTCCGC CGAAGGAAGA CGGCTCCGAT CCGCGCGGCT GCCCGTCCTG CGGCGAAGGC AAGCTGTCGC TAAAGCTCGG CAAATTCGGC GCCTTCATCG GCTGCTCCAA CTATCCCGAA TGCAAATTCA CGCGGACCCT CGCTGCGCCC GACGGCGCCG AGGCGAATGG AGGCGAGCGT CCGGGCGTCA AATCGCTTGG CGTCGATCCC GACAGCGGCG AAGAGATCAC CCTGCGCGAC GGCCGCTTCG GCGTCTATGT GCAGCAGGGC GAAGGCGAGA AGCCGAAACG CTCCTCCTTG CCGCCGACGA TCGCGCCCGC CGATCTGACG CTCGAACAGG CGATCGCACT CTTGTCGCTG CCGCGCGAAG TCGCCCGCCA CCCCGAATCG AAAGAGCCGA TCGTCGCCGG CATCGGCCGT TACGGGCCCT ATGTCCAGCA CGGCAAGACC TACGCCAATG TCGCCAAGGA CGAGGACATC ATCTCGATCG GCGCCAATCG CGCGATCGAC CTCATCGTCG CCAAGGAAAG CGGCCTCACC GGACGCCGGT TTGGCGCCTC CGAGAGCGCT CCGGCCCGCG TTTTGGGCGA ACATCCCGCC GGCGGCGCGG TCAGCGTCAA GGCCGGGCGC TATGGGCCTT ATGTTACGCA TGGCAAGATC AACGCGACCT TGCCGAAGGA GGCCGACCCC ACCACATTGA CGCTTGAGGG CGCCGTGGCG CTGCTCGCCG CCAAAGCGTC GGGCGGCGGG GCGCCGATGC AGGGCCGTCT TCTCGGCGAT CATCCGTCCG GCGGCCCCAT CACCGTGCGC GCCGGCCGTT TCGGCGCCTA TGTCAATCAC GGCAAGACCA ACGCCACTTT GAAGCGCGAC GCCTCCGCCG AAACCATCAC CCTCGAAGAA GCGATCCGGC TGATCGAGGA CAAGGAAGCG GCGGGCGGCG GGCCAAGAAA AGCGGCCGCG CCGAAGAAGG CGGCCAAATC TTCGGCCGCC GCCAAGAAGG CGGCGCCGAA TGCAAAAAAC AAAACCGCGC CGGCCAATGA CGAAGATCCG CCATTCGAGC CGTCGCCCAA CCGAAAACCC GCCGCCAAGG CGGCCGCGGC GGCGCACAAA AAACCCGCCG CCAAGGCCGC GGCGGCAAAA CTGAAACAGG CAAAACAAGC GTGA
|
Protein sequence | MYVVIVESPA KAKSINKYLG KGYEVYPSYG HVRDLPAKDG SVDPDADFAM LWDVDVKSAK RLNEIARAVK DADKIILATD PDREGEAISW HVLEVLKAKK VLKDKPVERV VFNAITKSAV LEAMQHPRQI DVALVDAYLA RRALDYLVGF NLSPVLWRKL PGARSAGRVQ SVALRLVCDR ELEIEAFVAR EYWSIVAHLR TQADAPFTAR LVGADGKKIT RLDVGTGAEA AAFKTDLENA TFRVAEIEAK PAKRHPSAPF TTSTLQQEAS RKLGFAPART MQLAQRLYEG ADIDGETVGL ITYMRTDGVD LAPEAITSAR SVIARQFGDA YVPKAPRKYT VKAKNAQEAH EAIRPTDLAR LPRAVARALE PDQAKLYELI WTRTIASQME SAELERTTVD IEALAGARKL DLRATGQVVR FDGFLKLYQE GRDDEDDDES ARLPDMAKDE RLKRERIEAD QHFTEPPPRF TEATLVKRME ELGIGRPSTY ASTLAVLRER DYVRIEKKRL HPEDKGRLVT AFLEAFFARY VGYDFTADLE EKLDRVSNHE IDWKQVLRDF WLDFSGALAG TKDLRTTEVL DRLNEILGPH IFPPKEDGSD PRGCPSCGEG KLSLKLGKFG AFIGCSNYPE CKFTRTLAAP DGAEANGGER PGVKSLGVDP DSGEEITLRD GRFGVYVQQG EGEKPKRSSL PPTIAPADLT LEQAIALLSL PREVARHPES KEPIVAGIGR YGPYVQHGKT YANVAKDEDI ISIGANRAID LIVAKESGLT GRRFGASESA PARVLGEHPA GGAVSVKAGR YGPYVTHGKI NATLPKEADP TTLTLEGAVA LLAAKASGGG APMQGRLLGD HPSGGPITVR AGRFGAYVNH GKTNATLKRD ASAETITLEE AIRLIEDKEA AGGGPRKAAA PKKAAKSSAA AKKAAPNAKN KTAPANDEDP PFEPSPNRKP AAKAAAAAHK KPAAKAAAAK LKQAKQA
|
| |