Gene Rleg_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1387 
Symbol 
ID8012480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1374676 
End bp1377330 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content63% 
IMG OID644823972 
ProductDNA topoisomerase I 
Protein accessionYP_002975218 
Protein GI241204122 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.903307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.17492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTG TAGTGGTGGA ATCGCCTGCC AAGGCCAAGA CGATCAATAA GTATCTGGGT 
CCGGGATACA AAGTGCTCGC CTCCTTCGGC CATGTGCGCG ATCTGCCTGC CAAGGACGGC
TCCGTTCTTC CTGATCAGGA TTTCGAAATG CTTTGGGAGG TCGATAGCGC CTCCGCCAAG
CGGATGAAGG ACATTGCCGA CGCGGTGAAG TCCGCCGATG GCCTGTTTCT CGCGACCGAC
CCGGATCGCG AAGGCGAAGC GATTTCCTGG CACGTTCTCG ACATGCTGAA CAAGAAGCGC
GTGCTGAACG GCAAACCGGT CAAGCGCGTC GTCTTCAATG CGATCACCAA GAAAGCGGTG
CTCGACGCGA TGGCCGATCC GCGCGACATC GACGTGCCGC TGGTCGATGC CTATCTCGCG
CGCCGCGCGC TCGACTATCT CGTCGGCTTC AATCTTTCGC CGGTGCTGTG GCGCAAGCTG
CCCGGCGCAC GTTCGGCCGG CCGCGTCCAG TCTGTGGCGC TGCGTCTCGT CTGCGACCGC
GAATCCGAGA TCGAGCGCTT CATTTCCGAA GAGTACTGGA ACATCTCAGC GCTCCTGAAG
ACGCCGCGCG GCGACGAGTT CGAGGCAAGA CTGGTTTCGG CGAATGGCAA ACGACTGCAG
CCGCGCGCAA TCGGTAACGG CGAGGATGCC GGCCGGCTCA AGGCCTTGCT CGAAGGCGCG
AGCTACGTCG TCGACTCTGT CGAGGCGAAA CCGGTCAAAC GTAATCCCGG ACCGCCCTTC
ACGACCTCGA CGCTGCAACA GGCCGCCTCC TCCAACCTCG GCTTCTCGGC CTCGCGCACC
ATGCAGGTTG CCCAAAAGCT CTATGAGGGC GTCGATATCG GCGGCGAGAC GGTCGGTCTG
ATCACCTATA TGCGAACCGA CGGCGTGCAG ATGGCGCCCG AGGCGATCGA CGCGGCGCGC
AGTGCCATCG TCGACCAGTT CGGCGAACGC TACATGCCGG AGAAGCCGCG CTTCTATTCG
ACCAAGGCGA AGAACGCCCA GGAGGCGCAC GAGGCGATCC GTCCGACCGA TTTCGACCGC
TCGCCCGACC GTGTCCGCAA ATTCCTCGAT GCCGACCAGA TCCGGCTCTA CGAGCTGATC
TGGAAGCGCG GCATCGCCAG CCAGATGGCG TCGGCCGAGA TCGAGCGCAC GACGGCTGAA
ATCACCGCCG ACAACAAGGG CGAGAAGGCC GGGCTTCGTG CCGTCGGTTC GGTCATCCGA
TTCGATGGTT TCATCGCCGC CTATACCGAC CAGAAGGAAG ATGGCGAGCA GAGCGACGAT
GGCGACGAGG ATGGCCGTCT GCCGGAGATC ATTGCGCGCG AGGCGCTCGC CAAGCAGAAG
ATCAATTCGA CGCAGCATTT CACCGAACCG CCGCCGCGCT ATTCGGAAGC GACGCTGATC
AAGAAGATGG AAGAGCTCGG CATCGGCCGC CCCTCCACCT ATGCCGCGAC GCTGGCAACG
CTGCGCGACC GCGACTATGT GACGATCGAC AAGCGCAAGC TGATCCCGCA GGCCAAGGGC
CGGCTGGTGA CGGCTTTCCT CGAGAGCTTC TTCACCAAAT ATGTCGAATA CGACTTCACC
GCCGATCTCG AAGAGAAGCT TGACCGGATT TCCGCCGGCG AGTTGAACTG GAAGCAGGTG
CTGCGCGATT TCTGGAAGGA TTTCTTCGCC CAGATCGAGG ACACCAAGGA ACTGCGCGTC
ACCAACGTGC TGGATTCGCT GAACGAGGCG CTGGCACCGC TCGTCTTCCC GAAACGGGAG
GACGGCAGCG ACCCCAGAAT CTGTCAGGTC TGCGGCACCG GCAACCTGTC GCTGAAGCTC
GGCAAATACG GCGCCTTCGT CGGCTGCTCG AACTATCCGG ACTGCAACTA CACCCGCCAG
CTCTCCTCCG AAAACGGCGG AGATGCGGAT GGTGCTGCGC TCAACGAGCC GAAGAACCTC
GGCACCGATC CGACCACCGG CGAGGAACTG ACGCTGCGTT CCGGCCGCTT CGGCCCCTAT
ATCCAGCGCG GCGACGGCAA GGAAGCCAAA CGGGCCTCGC TGCCGAAGGG CTGGAAACCC
GAGGACATCG ATTATGAAAA GGCGATGGCG CTGATCTCGC TGCCGCGCGA TATCGGCAAA
CATCCCGAAT CGGGCAAGAT GATCTCGTCA GGCATCGGCC GCTATGGGCC GTTCCTCCTG
CATGACGGTT CCTATGCCAA TCTGGAAACC GTCGAGGACG TGTTCTCGGT CGGCCTCAAC
CGCGCCGTGA CTGTTATCGC CGAAAAGGCG AACCAGGCAC CCGGCCGGGG CGCGCGCGGC
ACACCGGCGG CACTAAAGAC GCTCGGCGAT CATCCTGATG GCGGCGCCAT TACCGTTCGC
GACGGCAAAT ACGGCCCCTA TGTCAACTGG GGCAAGGTCA ACGCCACACT GCCGAAGGGC
AAGGATCCCC AGGCGATCAC CGTCGAGGAG GCGCTCGCCT TGATCGCCGA GAAGGCCGGA
AAGACACCTG TGGGCAAAGC GGCCAAGACG AAAGCCAAGC CGAAGGCAGC GGCCGCCGAA
GCCAAGAGCA CCAAGACGGC CGCCAAACCG AAGGCAACGA AGGCGAAGGC GCCCGCGAAA
TCGAAAAAGA GCTGA
 
Protein sequence
MNVVVVESPA KAKTINKYLG PGYKVLASFG HVRDLPAKDG SVLPDQDFEM LWEVDSASAK 
RMKDIADAVK SADGLFLATD PDREGEAISW HVLDMLNKKR VLNGKPVKRV VFNAITKKAV
LDAMADPRDI DVPLVDAYLA RRALDYLVGF NLSPVLWRKL PGARSAGRVQ SVALRLVCDR
ESEIERFISE EYWNISALLK TPRGDEFEAR LVSANGKRLQ PRAIGNGEDA GRLKALLEGA
SYVVDSVEAK PVKRNPGPPF TTSTLQQAAS SNLGFSASRT MQVAQKLYEG VDIGGETVGL
ITYMRTDGVQ MAPEAIDAAR SAIVDQFGER YMPEKPRFYS TKAKNAQEAH EAIRPTDFDR
SPDRVRKFLD ADQIRLYELI WKRGIASQMA SAEIERTTAE ITADNKGEKA GLRAVGSVIR
FDGFIAAYTD QKEDGEQSDD GDEDGRLPEI IAREALAKQK INSTQHFTEP PPRYSEATLI
KKMEELGIGR PSTYAATLAT LRDRDYVTID KRKLIPQAKG RLVTAFLESF FTKYVEYDFT
ADLEEKLDRI SAGELNWKQV LRDFWKDFFA QIEDTKELRV TNVLDSLNEA LAPLVFPKRE
DGSDPRICQV CGTGNLSLKL GKYGAFVGCS NYPDCNYTRQ LSSENGGDAD GAALNEPKNL
GTDPTTGEEL TLRSGRFGPY IQRGDGKEAK RASLPKGWKP EDIDYEKAMA LISLPRDIGK
HPESGKMISS GIGRYGPFLL HDGSYANLET VEDVFSVGLN RAVTVIAEKA NQAPGRGARG
TPAALKTLGD HPDGGAITVR DGKYGPYVNW GKVNATLPKG KDPQAITVEE ALALIAEKAG
KTPVGKAAKT KAKPKAAAAE AKSTKTAAKP KATKAKAPAK SKKS