Gene Rleg_5331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5331 
Symbol 
ID8007399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp738176 
End bp739216 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content62% 
IMG OID644822236 
ProductDNA polymerase LigD, ligase domain protein 
Protein accessionYP_002973496 
Protein GI241113661 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.51032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCA CGCGCCGCCC ACTGCCCTTG CTCGACGAGT CTCACTCAAC GCTGCATTCT 
CGTCCGATCC GCAAACGCGA TCCTGACCAG CCCGGCCTGC CCTTCGATCC AATGCCGTCG
CGCGTCGAGC CCTGCCTCGC GCTGCTGAAG CCGACTGTGC CCATTGGGCC GGATTGGCTC
TATGAGGTGA AGCTGGATGG CTATCGATTG GCAATCCACG TTGAACCGAA GGGCGTGCGG
GTCATCACCC GTGGCGGCCA TGACTGGACC CATCGCTTCC CCACCATCGC CGCGGCAGCG
AAAGAGCTTG GCGTAACGAC CGCCATTCTC GATGGCGAGG CCGTTGTACT CGATGATAAC
GGCCGATCGG ATTTTGGCGC CCTGCAGCGT TCGCTCGGCG GGCGGGGAGG CAAGCGAGTA
TCGACCGAGT CGGTCCTCGT CGCCTTCGAC CTTCTCTATC TCGATGGACA CGATCTGACC
GGCACCGAGC TTGACGTACG CCGACACCTG CTCGAAGACC TGATACCGGG CGGCGACGAT
CAGACGATCC GCCTCTCCGA GCAGATAGAG CTGCCGGCCG AAGAACTCCT CGAGCACGCC
TGCCATCATC ATCTGGAAGG TATCATCGCC AAGCATCGCG ACCGGCCCTA CGGCAGTGGC
CGTACGGGCG ACTGGCTGAA GATCAAATGC GTCCAGAGCG AGAGCTTCAT GATCGTCGGT
TATGAGCAGT CCGCATCCGC CCGCGGCGGC ATCGGCAGGC TATTGCTGGC CGGCAGACGA
GGGCTCGACT GGATTTACGT TGGCTCCGTC GGAACTGGTT TCGGTGCCAG GGATGCTGAA
TACCTGAAAA AGACGCTGGA CCGGTTAAAG ACGAACCGGC CGGTCGTTCC GCTGAATGGC
AAGCGCCTCG TCCTCGTCCA GCCGACGCTG ATCGCTGAGA TCGAGTTTCG CGGCTGGACG
GATGACGGCA ATCTCCGCCA TGCTTCGTAC AAGGGGCTGC GCGAGGTCCA GGATAATGCC
GCAGTCTTCG ATATGACCTA A
 
Protein sequence
MKRTRRPLPL LDESHSTLHS RPIRKRDPDQ PGLPFDPMPS RVEPCLALLK PTVPIGPDWL 
YEVKLDGYRL AIHVEPKGVR VITRGGHDWT HRFPTIAAAA KELGVTTAIL DGEAVVLDDN
GRSDFGALQR SLGGRGGKRV STESVLVAFD LLYLDGHDLT GTELDVRRHL LEDLIPGGDD
QTIRLSEQIE LPAEELLEHA CHHHLEGIIA KHRDRPYGSG RTGDWLKIKC VQSESFMIVG
YEQSASARGG IGRLLLAGRR GLDWIYVGSV GTGFGARDAE YLKKTLDRLK TNRPVVPLNG
KRLVLVQPTL IAEIEFRGWT DDGNLRHASY KGLREVQDNA AVFDMT