Gene Rleg2_5705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5705 
SymbolligD 
ID6977096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp103682 
End bp106333 
Gene Length2652 bp 
Protein Length883 aa 
Translation table11 
GC content64% 
IMG OID643393162 
ProductATP-dependent DNA ligase 
Protein accessionYP_002277980 
Protein GI209546090 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase
[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02778] DNA polymerase LigD, polymerase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGTG ACAACCTCTC GACATACCGA TCGAAGCGCG ACTTTAAAAA AACGGCGGAG 
CCGAGCGGCG AGAAGCAGAT CGCGCGCAGC AATCGCCGTC GTTTCATCAT CCAGAAACAT
GACGCCACCC GGCTGCATTA CGACCTGCGG CTCGAGCTCG ACGGCGTCTT CAAATCCTGG
GCTGTCACCA AAGGCCCTTC GCTCGATCCG CACGACAAGC GGCTGGCCGT CGAGGTCGAA
GATCACCCGC TCGATTATGG TGATTTTGAA GGCACGATCC CGAAAGGGCA ATATGGCGGC
GGCACGGTGA TGTTGTGGGA CCGCGGCTAT TGGGAGCCTG AGGGAAAGAA GAGCCCGGAG
CAGGCGCTTG CCAAGGGCGA CTTCAAGTTC ACCCTCGAAG GCGAAAGGCT GCACGGCAGC
TTCGTCCTGG TGCGCATGCG CAATGACCGC GACGGCGGCA AGCGAACCAA CTGGCTGCTG
ATCAAGCACC ACGACGCGTT CTCGGTTGAG GAGAATGGCG CAGCCGTCCT GGAGGAGAAC
GACACCTCCG TCGCCTCCGG CCGGACGATG GATGCGATCG CCACGGGCAA GGGCCGCAAG
CCGAAGCCGT TCATGATCAA GGGCGGGGAT ATCAAGGCGG ATGCGGTCTG GGACAGCAAC
CAGGGGCTGG CCGCCGAAGA GAGGAAAGAG GAGGGTCGAG CCGGCAGGAC GCCGAAAACG
GCGGCAAAGG TCGACCTCCC GGATTTCATC GCGCCACAGC TTTGCCAGAC CCTGGAGCGG
CCGCCGGCCG GCGAGGGCTG GATTCACGAG ATCAAGTTCG ACGGCTACCG TATCCAGATG
CGCATACTCG ACGGCGAGGT AACGCTGAAG ACCCGCAAAG GGCTTGACTG GACCGGCAAA
TATCCAGAGG TCGCCGAGGC GGCATCGGCG CTGCCCGACG CCATCATCGA CGGCGAGATC
TGCGCTCTCG ACGACCATGG CGTACCGGAT TTCGCAGCCC TGCAGGCCGC CCTTTCGGAA
GGCCGGACCG GCGATCTCGT CTATTTCGCC TTCGATCTTC TTTACGAGGG TGGCGAGGAC
TTGCGATCCT TGCCTGTGAT CGAGCGCAAG GCGCGGCTTC AGAGCCTGCT TGCGGATGCC
GGTGACGACC CTCGGCTCCG TTTCGTCGAG CATTTCGAGA CCGGCGGCGA TGCGGTCCTT
CGCTCCGCCT GCAAGCTCTC CCTGGAAGGC ATCGTCTCCA AGCAGAGCGA TGCGCCCTAT
CAATCCGGCC GCACCGACAG CTGGGCCAAG TCGAAATGCC GCGCCGGCCA CGAAGTGGTG
ATCGGCGCCT ATGCCAAGAC CAACGGCAAA TTCCGATCGC TGCTGGTCGG CGTCTACCGC
GGCGACCACT TCGTCTATGT CGGCCGCGTC GGCACGGGAT ATGGCGCCAA GAAAGTGGAA
ACGTTGCTTC CGAAACTGCA AGCGCTCGAG ACGGCAAAAT CGCCCTTCAC CGGCATCGGT
GCGCCGAAGA AGGAGGCCGA GGTCACTTGG GTGAAGCCCA AGCTCGTGGC GGAAATCGAG
TTCGCCGGCT GGACCGCCGA CGGCATCGTC CGCCAGGCGG CCTTCAAGGG CCTGCGGGAG
GACAAGCCCG CCAGGGAGGT CAAGGCCGAA CGGCCGGCCA AGCCAGCGGA GACCGACGTG
CCGGAGCCGG AGCCGGCGGC GGAGGTGAAG GCGAGGCCAG CCCGCCGAAA GGGCGCCAAA
GCCGAGGTCA TGGGCGTGAT GATCTCCAAT CCGGACAAGC CGCTTTGGCC GGATGCCAAT
GACCGCAAAC CGGTGACCAA GGAGGAGCTG GCCCGCTATT ATGAAGCCGT CGGCGGCTGG
ATGATCGCGC ATATCGAGGG GCGCCCCTGT TCGATCATCC GTGCGCCCGA TGGGCTGGGC
GGCGAGCAGT TCTTCCAGCG CCATGCAATG CCCGGCACCT CGAACCTTCT CGAACTGGTC
AAGGTCTTCG GCGATAAAAA GCCCTATCTG CAGATCGACC GGGTCGAGGG CCTAGCGGCT
GTCGCGCAGA TCGGCGCCGT CGAGTTGCAC CCCTGGAATT GCCAACCGCA CAAGCCGGAG
GTGCCGGGCC GGCTGGTGTT CGACCTCGAC CCTGGCCCCG ACGTGCCGTT CTCGACAGTC
GTTTCCGCCG CCCGCGAGAT GCACGATCGG CTCGACGCGC TCGGTCTCGT CAGTTTCTGC
AAGACCACAG GCGGCAAGGG CCTGCACGTC GTCACGCCGC TTGCGATCAA TAAGCGCAAG
CCGCTTTCCT GGGCAGAGGC GAAGAGTTTT GCGCACGACG TCTGCCAGCA GATGGCGCGC
GACAATCCCG ATCTCTATCT GATCAAGATG ACCAAGAGCC TCAGGAACGG CCGTATCTTC
CTCGACTATC TCCGCAACGA CCGGATGGCG ACGGCCGTGG CGCCATTGTC GCCCCGTGCC
CGGCCGGGCG CCACCGTCTC GATGCCGCTG ACCTGGACCC AGGTCAAATC CGATCTCGAC
CCGAAACGCT TTACCATCCG CACCGTGCCG GCACTGCTGT CGAAATCGTC GGCCTGGGAA
GATTATAGCG ATGGCCAGCG CCCGCTGGAG CAGGCGATCA AACGGCTGAG CAAAGTCTCG
AACGCGGCGT GA
 
Protein sequence
MASDNLSTYR SKRDFKKTAE PSGEKQIARS NRRRFIIQKH DATRLHYDLR LELDGVFKSW 
AVTKGPSLDP HDKRLAVEVE DHPLDYGDFE GTIPKGQYGG GTVMLWDRGY WEPEGKKSPE
QALAKGDFKF TLEGERLHGS FVLVRMRNDR DGGKRTNWLL IKHHDAFSVE ENGAAVLEEN
DTSVASGRTM DAIATGKGRK PKPFMIKGGD IKADAVWDSN QGLAAEERKE EGRAGRTPKT
AAKVDLPDFI APQLCQTLER PPAGEGWIHE IKFDGYRIQM RILDGEVTLK TRKGLDWTGK
YPEVAEAASA LPDAIIDGEI CALDDHGVPD FAALQAALSE GRTGDLVYFA FDLLYEGGED
LRSLPVIERK ARLQSLLADA GDDPRLRFVE HFETGGDAVL RSACKLSLEG IVSKQSDAPY
QSGRTDSWAK SKCRAGHEVV IGAYAKTNGK FRSLLVGVYR GDHFVYVGRV GTGYGAKKVE
TLLPKLQALE TAKSPFTGIG APKKEAEVTW VKPKLVAEIE FAGWTADGIV RQAAFKGLRE
DKPAREVKAE RPAKPAETDV PEPEPAAEVK ARPARRKGAK AEVMGVMISN PDKPLWPDAN
DRKPVTKEEL ARYYEAVGGW MIAHIEGRPC SIIRAPDGLG GEQFFQRHAM PGTSNLLELV
KVFGDKKPYL QIDRVEGLAA VAQIGAVELH PWNCQPHKPE VPGRLVFDLD PGPDVPFSTV
VSAAREMHDR LDALGLVSFC KTTGGKGLHV VTPLAINKRK PLSWAEAKSF AHDVCQQMAR
DNPDLYLIKM TKSLRNGRIF LDYLRNDRMA TAVAPLSPRA RPGATVSMPL TWTQVKSDLD
PKRFTIRTVP ALLSKSSAWE DYSDGQRPLE QAIKRLSKVS NAA