Gene Rleg2_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1789 
Symbol 
ID6980526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1836121 
End bp1837548 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content59% 
IMG OID643396510 
ProductIntegrase catalytic region 
Protein accessionYP_002281300 
Protein GI209549383 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.239535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTGTT TGATCACCAT GTCGCAGAAA GAATTGCATC GCCTCGAAGT TGTTCAGAAG 
ATCCGTGATC GACGCCTGAG CGTCGTCCAG GCGACTGAGA TGCTCGATCT CAGCCGAAGT
CAGGTTCATA GGCTGCTGCA GGCTTATGAT CGTTCCGGTG AAGCCGGTCT TGTTTCGAAG
AAGCGATCAC GGCCGAGCAA TCGGCGCCAC AGCGAGGATT TTCGCAATGC GGCGCTGGAT
CTGATCCGCG AACGCTATCC GGATTTCGGT CCGACGTTAG CACGCGAGAA ACTGATCGAA
CTGCATCAGA TTTCGGTCGC CAAGGAGACG TTACGCCAAT GGATGACCGA GGCCGGCATC
TGGATCTCGC GACGCGAACG CAAGAAGCGG ATTTTTCAGC CGCGCGGCCG GCGGGATTGT
CTCGGCGAGC TCGTCCAGAT CGACGGTTCG CATCACTGGT GGTTTGAGAA CCGCGGTCCC
AAATGCGCCC TGCTCGTCTA CATCGATGAT GCTACCGGCA AGCTCTTGCA TCTGCGGTTC
GCCGGATCGG AGAACACCTT CGACTACCTG CATGCGACGA AGGCCTATCT GCAGCAATGG
GGCAAACCTC TGGCATTCTA TAGCGACAAG CACGGTGTCT TCCGCCCGGT CCATGCATCG
GAGAAAGACC GGACGAGCGG CCTGACGCAG TTCGGTCGTG CCCTTTATGA GCTCAACATC
GATATCATCT GCGCCAACAC CCCGCAGGCC AAAGGTCGCG TGGAACGCGC CAATCAGACG
CTGCAGGATC GATTGGTCAA GGAACTGCGG CTGCGCGGCA TCGACACGAT CGAGGCTGCA
AACCTCTATG CACCGGCGTT CATTGCGGAT TTCAATGCTC GTTTCGGCAA GCCGCCGCGC
AATCCGAAGG ACATGCATCG GCCGCTGGCC GCACATGAGA ACCTCGATGC CGCCATGTGC
CGGAAGGAGG TCCGCACGCT GTCGCAGTCC CTGACGCTAC GCTACGACAA GGTCCTGTTC
ATCCTCGATC CGACGGAACA GGCCAAGGCG CTGGCGGGTA AGAAGGTCGT CGTCTGCGAC
TATCCGGATG GTCGCCTCGA GATCATGCAC GAGAGTTTTG CCCTGCCCTA CAGAACCTTC
GATAAGTTGC GATCGATACA TAGACCCGAA GTCGTCGACA ACAAGCGTTT GGACGACATG
CTTTCGATCG TCGCTGAGCT GCAGGCTGGA CGTGAACTGC AGCGCAGTAA GAGCGGCCCG
CGCCGCACCG GTCAGACGGA TCATATGTTT GGGATACCGG ATGGTAGCCA GGGCAACGGT
TATCAAAAGC GTGGTCGCAA GCCCGGCCCG CGGACGGACT TCATGAACGA TCCGGAAGTC
ATTGCCAAAC GGCAGAAGGC GCTGGCGCGG ATGGAGGCTG CCGAATGA
 
Protein sequence
MSCLITMSQK ELHRLEVVQK IRDRRLSVVQ ATEMLDLSRS QVHRLLQAYD RSGEAGLVSK 
KRSRPSNRRH SEDFRNAALD LIRERYPDFG PTLAREKLIE LHQISVAKET LRQWMTEAGI
WISRRERKKR IFQPRGRRDC LGELVQIDGS HHWWFENRGP KCALLVYIDD ATGKLLHLRF
AGSENTFDYL HATKAYLQQW GKPLAFYSDK HGVFRPVHAS EKDRTSGLTQ FGRALYELNI
DIICANTPQA KGRVERANQT LQDRLVKELR LRGIDTIEAA NLYAPAFIAD FNARFGKPPR
NPKDMHRPLA AHENLDAAMC RKEVRTLSQS LTLRYDKVLF ILDPTEQAKA LAGKKVVVCD
YPDGRLEIMH ESFALPYRTF DKLRSIHRPE VVDNKRLDDM LSIVAELQAG RELQRSKSGP
RRTGQTDHMF GIPDGSQGNG YQKRGRKPGP RTDFMNDPEV IAKRQKALAR MEAAE