Gene Rleg_1519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1519 
Symbol 
ID8015571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1505458 
End bp1507782 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content60% 
IMG OID644824106 
ProductIntegrase catalytic region 
Protein accessionYP_002975348 
Protein GI241204252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.263526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG CGGCTAGAAG AGACTACACG CAGTTCGGCT TAAACAGGGA CGACCGTATC 
TATTTCGACA AACGACACTG GTGTTTGAAA ACCCGTAGTG CCGAACGTGA AACGGAGGAA
GAGTCCGCCA TGTGGGAGGC GGGCTACATT CTCGAAACCG TCGACGGCAA GTCGGAGCGC
CGTGTAGAAG CGCTCACGTT CCAGGACGTC GACAAGCTCT TCAGCGCTAG CAAGATCGAC
TGCGAGCGGG GTTACTTCAG TAGCCGTAAC GCCATCGACA GGAGGATGAA GTCGCCCAAG
ATCTTCGATC TTCCGGCGGC GACGATTCTG CGAGCCCGCA TGGTTTCGGA GTTCCTCGAG
CTGGAACTGG ACAGGTACGA CACAGGAGAA TTCTTCACGC GGTCCGACGC CGGCTACGCC
AAGTTCATCA AAGCGTTCAG AGCCGAGAAC GAATACCTCA TTCCACCGAA GACGGGGAAG
CGCGAGGTCG TCCCAGGACC GAGGCAGTTC GGCCGGCTGG TCGAACGGTT CGAAAACAAC
CTCTTCGAAC CGGGATCGCT GCTCCCCCGA CACCGCGGCG GCGTCGGGCA CAAATCTCGC
TTTACGCCCG ACGAAGTCAA ATTTCACGCC GAGCACGCGC AGAAGTATCA GTCCACGGAC
CGCCCGACCA AGCTCGACTG CTACGCCCAG ATGGCAGAGG CGAACGAAAA TCGAAAGGCC
GTTGGCGAGC AGGAGCATCA GGTCCCTTCC CTCAGGACCT TCCAGAGGTT GATCGACGAA
CTCGGAGATT ATCGTAACGA ACTGACCAGG GCTGGCGATC CCAGCCGGGT GACCAGGAAG
TTCGGCATGT CCCGGAACGG TTATCAGCCA ATGCGTCCGC TGGAATACAT CGAGATGGAC
GAGCACAAGC TCGACGTCAT CCGCCTGCTC GTGAAGAACG GCATCTGGGA ATATCTTCAC
CCCGACGTTC AAAGACGGAT CGAGGAGAAG ATCGGGGAGA CTAACGACGG TCGCGTCTGG
CTGTCGGTCG CACTTGACGC CTATTCGCGG TCCGTTTGCG GGGCCAAGCT GCTGTGGAGT
GCTCCTGACG GGGCGTCCGC AGTTGCGACG CTTGCAATGG TCGCGCGTCC CAAGGACTAC
GAGACCAAGC TCTACGGCAC CATGACCAAA TGGCCGCAGG GTGGCACGGC TGAATACATC
CACGTCGATG CAGGCGCGAG CTACACGTCG CTTGAGTTTC AACACGCCGC CTACTCCTAC
ACTGGCAACG TCGCCGTGAC GCCCTCGAAG CATCCGCATC TTCGCGGCCG CGTCGAACGC
TTCTTCCGGA CGATCAACCA ACGCTATATC CATCTGCTGT CGGGTCAGAC CTTCAGCAAC
GTCCTTCTCC GTGACCGATA CGACGCCCAG AAGTACCGGC ACATGACCGA CGATCAACTT
GCCGAATTTT TGGTCATGCT AATCGTCGAC TGCTACCACA ATACCAAGCA CCGTGGGCTC
TACGGCTTGA CACCACTTCA GGCCTGGTAT TTCGGAACGC AGTTTGGCAA CGGGACCGTG
CGCGGCTACC CCGGCGAGAG GGAATACGCC GAGGCCTTCA GCGTTGTCTC CCGATACAAG
GTTGGCAACG ACGGCATCAA GATCCTAGGC CTGCCGTATT CGAACCTCTG GCTCCAGCAG
CAGCGCGAAC GGCGGTACGA TCTCGAGGTG ATCGTCCGAT TCAACGAGAT CGACGTCTCG
CGCATCCAGG TGAAGGATCC CTTCTCCGAC GACTGGATGG AGGTTCCCTG CATCCTCGAC
GGCGTCAAAG GCTTAAGAGT TTTCGACTGG CTGGAACAGC TTAAGTATAT GCGTCGAAAG
TTCGGTCCTT CCGCGAAAAT CAGCGCGAAG GTTGCGCAGG CCGCGCTCGC GGGCGCCCGC
GCCTTCGCAA ACGCAAGCAA GTCAGCGGCA GGCATCGGCT CGCCGATCAC GACCCGCTCC
ATGCTCGACT ACTGGGACCA GCAGATGGTC GGAAGCTTCA CCATCGCACG CGAGGCGGCG
ACGGACTACG GCGACCTTGA AGCCTTCGAG GCATTCGACA CCACTCCGCG GCCCGATCCG
TTCGCGCCAT CGCATCCTCG GGCCGGCCAG TCCACGACGT CTGGAGACAA CGTGGACGCC
GGTCCGGCCG GGGCGGGCGA CCATGGGCCA GAGAGTTCGA CTTCCAAGCA CCGCGCGGCA
CCAGAGCCCA AATCAGCTCC AGCGAGGACG TCGCAAGTCG AAGAGCCCCT TCCGCAGCGC
CCCCGTTCAA TCCGCATCGA GAAAATCCGG AAGGACGACA AGTGA
 
Protein sequence
MSAAARRDYT QFGLNRDDRI YFDKRHWCLK TRSAERETEE ESAMWEAGYI LETVDGKSER 
RVEALTFQDV DKLFSASKID CERGYFSSRN AIDRRMKSPK IFDLPAATIL RARMVSEFLE
LELDRYDTGE FFTRSDAGYA KFIKAFRAEN EYLIPPKTGK REVVPGPRQF GRLVERFENN
LFEPGSLLPR HRGGVGHKSR FTPDEVKFHA EHAQKYQSTD RPTKLDCYAQ MAEANENRKA
VGEQEHQVPS LRTFQRLIDE LGDYRNELTR AGDPSRVTRK FGMSRNGYQP MRPLEYIEMD
EHKLDVIRLL VKNGIWEYLH PDVQRRIEEK IGETNDGRVW LSVALDAYSR SVCGAKLLWS
APDGASAVAT LAMVARPKDY ETKLYGTMTK WPQGGTAEYI HVDAGASYTS LEFQHAAYSY
TGNVAVTPSK HPHLRGRVER FFRTINQRYI HLLSGQTFSN VLLRDRYDAQ KYRHMTDDQL
AEFLVMLIVD CYHNTKHRGL YGLTPLQAWY FGTQFGNGTV RGYPGEREYA EAFSVVSRYK
VGNDGIKILG LPYSNLWLQQ QRERRYDLEV IVRFNEIDVS RIQVKDPFSD DWMEVPCILD
GVKGLRVFDW LEQLKYMRRK FGPSAKISAK VAQAALAGAR AFANASKSAA GIGSPITTRS
MLDYWDQQMV GSFTIAREAA TDYGDLEAFE AFDTTPRPDP FAPSHPRAGQ STTSGDNVDA
GPAGAGDHGP ESSTSKHRAA PEPKSAPART SQVEEPLPQR PRSIRIEKIR KDDK