Gene Rleg_6784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6784 
Symbol 
ID8022714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp220393 
End bp221400 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content61% 
IMG OID644833651 
Productputative DNA topoisomerase I protein 
Protein accessionYP_002984785 
Protein GI241666701 
COG category[L] Replication, recombination and repair 
COG ID[COG3569] Topoisomerase IB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCCG AAGCCATCAC CGACCTTGGT CTTGTCTATG TCAGCGACAC CGAACCAGGC 
ATCCGCAGGC GAAGGAAGGG TAAGGGCTTC AGCTATGTGA TGCCCGACGG TACGACGCTT
GCCGACGAAT TGCAGCGGGC GCGCATAGGC GCGCTCGGTC TGCCCCCAGC CTATGAGAAT
GTCTGGATCT GCCTCTACGA CAACGGCCAT TTGCAGGCGA CAGGCTTCGA TGCGCGCGGG
CGCAAGCAAT ACCGCTACCA TAAGGAATGG CAATCCTTCC GAAGTGCGGG AAAATTCCAT
CAATTGATCG AGTTCGGCCG GGCGCTGCCT CGAATACGCC GCACCGTGCT GCGCCATCTC
GATACCGGTG CAGAGGATGT CAATGGCGTG CTTGCGGCTT TGACGACGCT GCTCGACGAG
GCGCACCTCC GCGTCGGCAA TCAGGCCTAT GTCAGGGAGA ACGGCACCTA TGGCGCAACG
ACGCTGCTAA AACGCCACCT GAAGATCGTC GACGGGCAGA TCGAGCTGAA ATTCCGTGCG
AAAGGTGGCA AGCGCGTCCA GCGCAGCCTC AAGCATCCGA GGCTGCAGAA GATCCTGGAG
GAGATAGCCG ACCTGCCAGG CCGCCAACTC TTCGTCTGGA AGGACGAAAG CGGGACGCTG
AAGCCAATCG ATTCCGGGCG ATTGAACGCC TATCTGGCCG AGATATCCGG CATTCCGATT
TCGGCGAAGA CCTTTCGCAC CTGGGCCGGA TCGCTGGCGG CTTTCGGAGC GGCGCGCGAG
ACGATCCTCG GTGGCGGCCG GCCGACCGTG AAGCAGATGT CGGAGGCCGC GGCCGAGGCG
CTACACAACA CACCGGCGAT CTCGCGCTCG AGCTATATCC ATCCCGCGAT CATCTCGCTC
GCCGGCAACG ATCATCCGCT GATCGAGACT GGCAACGAGC CGCTGCGGGG CTTGCGGGCC
GAGGAAAACA GGCTACTTGA TTTCCTCACA AGCGAGATCG AAGAATGA
 
Protein sequence
MNAEAITDLG LVYVSDTEPG IRRRRKGKGF SYVMPDGTTL ADELQRARIG ALGLPPAYEN 
VWICLYDNGH LQATGFDARG RKQYRYHKEW QSFRSAGKFH QLIEFGRALP RIRRTVLRHL
DTGAEDVNGV LAALTTLLDE AHLRVGNQAY VRENGTYGAT TLLKRHLKIV DGQIELKFRA
KGGKRVQRSL KHPRLQKILE EIADLPGRQL FVWKDESGTL KPIDSGRLNA YLAEISGIPI
SAKTFRTWAG SLAAFGAARE TILGGGRPTV KQMSEAAAEA LHNTPAISRS SYIHPAIISL
AGNDHPLIET GNEPLRGLRA EENRLLDFLT SEIEE