Gene Msil_3847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3847 
Symbol 
ID7092543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp4211872 
End bp4214805 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content65% 
IMG OID643467132 
ProductDNA topoisomerase I 
Protein accessionYP_002364091 
Protein GI217979944 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0674028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGTCG TCATCGTCGA ATCGCCGGCG AAGGCTAAGA GCATTAATAA ATACCTCGGC 
AAGGGCTATG AGGTTTACCC GTCCTATGGC CATGTCCGCG ATTTGCCGGC CAAGGACGGA
TCGGTCGATC CCGACGCCGA TTTCGCCATG CTGTGGGACG TCGACGTCAA ATCCGCCAAG
CGGCTGAACG AGATCGCAAG GGCCGTCAAG GACGCCGACA AGATCATTCT GGCGACCGAC
CCCGATCGCG AAGGCGAGGC GATCTCCTGG CATGTCCTCG AGGTTTTAAA GGCGAAGAAA
GTCCTCAAGG ACAAGCCGGT CGAGCGCGTC GTCTTCAACG CCATCACCAA ATCCGCCGTG
CTGGAGGCGA TGCAGCATCC GCGTCAGATC GACGTGGCGC TGGTCGACGC CTATCTGGCG
CGCCGCGCCC TGGATTATCT GGTCGGCTTC AATCTCTCAC CGGTGCTGTG GCGCAAGCTT
CCCGGCGCCC GCTCGGCGGG GCGGGTGCAG TCCGTCGCGC TGCGGCTTGT TTGCGACCGT
GAGCTTGAGA TCGAGGCTTT CGTCGCACGA GAGTATTGGT CGATCGTCGC CCATCTCAGG
ACGCAGGCGG ACGCCCCGTT CACGGCGCGG CTCGTTGGCG CCGACGGCAA GAAGATTACG
CGCCTCGACG TCGGGACCGG GGCTGAGGCC GCCGCCTTCA AGACCGATCT CGAAAACGCG
ACGTTCCGCG TCGCCGAGAT CGAGGCCAAG CCCGCCAAGC GCCACCCGTC CGCGCCTTTC
ACGACCTCGA CGCTGCAGCA GGAGGCCTCG CGAAAGCTCG GCTTCGCGCC GGCCCGCACC
ATGCAACTCG CGCAGCGGCT CTATGAGGGC GCCGACATCG ACGGCGAGAC GGTCGGACTC
ATTACTTATA TGCGAACGGA CGGCGTCGAT CTCGCCCCCG AGGCGATCAC CAGCGCAAGG
TCGGTGATCG CCCGCCAGTT CGGCGACGCC TATGTGCCGA AGGCGCCGCG CAAATATACG
GTGAAGGCCA AGAATGCGCA GGAAGCGCAT GAGGCGATCC GTCCGACCGA CCTCGCCCGC
CTGCCGCGCG CCGTCGCCCG CGCGCTGGAG CCGGATCAGG CCAAGCTCTA TGAGCTGATC
TGGACCCGCA CCATCGCCAG CCAGATGGAA TCGGCCGAGC TTGAGCGCAC CACCGTGGAC
ATTGAGGCGC TGGCCGGAGC GCGCAAGCTT GATCTGCGCG CAACCGGTCA GGTCGTGCGC
TTCGACGGCT TTCTCAAACT TTATCAGGAA GGGCGCGACG ACGAAGACGA TGACGAAAGC
GCCCGCCTGC CGGATATGGC CAAGGACGAG CGCCTGAAGC GCGAACGCAT CGAGGCGGAC
CAGCATTTTA CCGAGCCGCC GCCGCGCTTC ACCGAGGCGA CGCTGGTCAA ACGCATGGAA
GAGCTCGGCA TCGGCCGCCC CTCGACCTAC GCCTCGACGC TCGCCGTGCT GCGCGAACGC
GATTACGTGC GGATCGAGAA GAAGCGCCTG CATCCGGAGG ACAAGGGACG TCTCGTCACG
GCCTTCCTCG AGGCGTTCTT CGCCCGTTAT GTCGGCTATG ACTTTACCGC CGACCTTGAG
GAAAAGCTCG ACCGGGTCTC GAACCATGAG ATCGACTGGA AGCAGGTGCT GCGCGATTTC
TGGCTGGATT TTTCGGGCGC CCTCGCCGGC ACCAAGGATC TGCGCACGAC GGAGGTGCTC
GACCGCCTGA ACGAGATTTT GGGGCCGCAT ATTTTTCCGC CGAAGGAAGA CGGCTCCGAT
CCGCGCGGCT GCCCGTCCTG CGGCGAAGGC AAGCTGTCGC TAAAGCTCGG CAAATTCGGC
GCCTTCATCG GCTGCTCCAA CTATCCCGAA TGCAAATTCA CGCGGACCCT CGCTGCGCCC
GACGGCGCCG AGGCGAATGG AGGCGAGCGT CCGGGCGTCA AATCGCTTGG CGTCGATCCC
GACAGCGGCG AAGAGATCAC CCTGCGCGAC GGCCGCTTCG GCGTCTATGT GCAGCAGGGC
GAAGGCGAGA AGCCGAAACG CTCCTCCTTG CCGCCGACGA TCGCGCCCGC CGATCTGACG
CTCGAACAGG CGATCGCACT CTTGTCGCTG CCGCGCGAAG TCGCCCGCCA CCCCGAATCG
AAAGAGCCGA TCGTCGCCGG CATCGGCCGT TACGGGCCCT ATGTCCAGCA CGGCAAGACC
TACGCCAATG TCGCCAAGGA CGAGGACATC ATCTCGATCG GCGCCAATCG CGCGATCGAC
CTCATCGTCG CCAAGGAAAG CGGCCTCACC GGACGCCGGT TTGGCGCCTC CGAGAGCGCT
CCGGCCCGCG TTTTGGGCGA ACATCCCGCC GGCGGCGCGG TCAGCGTCAA GGCCGGGCGC
TATGGGCCTT ATGTTACGCA TGGCAAGATC AACGCGACCT TGCCGAAGGA GGCCGACCCC
ACCACATTGA CGCTTGAGGG CGCCGTGGCG CTGCTCGCCG CCAAAGCGTC GGGCGGCGGG
GCGCCGATGC AGGGCCGTCT TCTCGGCGAT CATCCGTCCG GCGGCCCCAT CACCGTGCGC
GCCGGCCGTT TCGGCGCCTA TGTCAATCAC GGCAAGACCA ACGCCACTTT GAAGCGCGAC
GCCTCCGCCG AAACCATCAC CCTCGAAGAA GCGATCCGGC TGATCGAGGA CAAGGAAGCG
GCGGGCGGCG GGCCAAGAAA AGCGGCCGCG CCGAAGAAGG CGGCCAAATC TTCGGCCGCC
GCCAAGAAGG CGGCGCCGAA TGCAAAAAAC AAAACCGCGC CGGCCAATGA CGAAGATCCG
CCATTCGAGC CGTCGCCCAA CCGAAAACCC GCCGCCAAGG CGGCCGCGGC GGCGCACAAA
AAACCCGCCG CCAAGGCCGC GGCGGCAAAA CTGAAACAGG CAAAACAAGC GTGA
 
Protein sequence
MYVVIVESPA KAKSINKYLG KGYEVYPSYG HVRDLPAKDG SVDPDADFAM LWDVDVKSAK 
RLNEIARAVK DADKIILATD PDREGEAISW HVLEVLKAKK VLKDKPVERV VFNAITKSAV
LEAMQHPRQI DVALVDAYLA RRALDYLVGF NLSPVLWRKL PGARSAGRVQ SVALRLVCDR
ELEIEAFVAR EYWSIVAHLR TQADAPFTAR LVGADGKKIT RLDVGTGAEA AAFKTDLENA
TFRVAEIEAK PAKRHPSAPF TTSTLQQEAS RKLGFAPART MQLAQRLYEG ADIDGETVGL
ITYMRTDGVD LAPEAITSAR SVIARQFGDA YVPKAPRKYT VKAKNAQEAH EAIRPTDLAR
LPRAVARALE PDQAKLYELI WTRTIASQME SAELERTTVD IEALAGARKL DLRATGQVVR
FDGFLKLYQE GRDDEDDDES ARLPDMAKDE RLKRERIEAD QHFTEPPPRF TEATLVKRME
ELGIGRPSTY ASTLAVLRER DYVRIEKKRL HPEDKGRLVT AFLEAFFARY VGYDFTADLE
EKLDRVSNHE IDWKQVLRDF WLDFSGALAG TKDLRTTEVL DRLNEILGPH IFPPKEDGSD
PRGCPSCGEG KLSLKLGKFG AFIGCSNYPE CKFTRTLAAP DGAEANGGER PGVKSLGVDP
DSGEEITLRD GRFGVYVQQG EGEKPKRSSL PPTIAPADLT LEQAIALLSL PREVARHPES
KEPIVAGIGR YGPYVQHGKT YANVAKDEDI ISIGANRAID LIVAKESGLT GRRFGASESA
PARVLGEHPA GGAVSVKAGR YGPYVTHGKI NATLPKEADP TTLTLEGAVA LLAAKASGGG
APMQGRLLGD HPSGGPITVR AGRFGAYVNH GKTNATLKRD ASAETITLEE AIRLIEDKEA
AGGGPRKAAA PKKAAKSSAA AKKAAPNAKN KTAPANDEDP PFEPSPNRKP AAKAAAAAHK
KPAAKAAAAK LKQAKQA