Gene Rleg_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1969 
Symbol 
ID8013008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1962883 
End bp1964307 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content59% 
IMG OID644824557 
ProductIntegrase catalytic region 
Protein accessionYP_002975789 
Protein GI241204693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.233913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.387035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTTA TGATCACCAT GTCGCAGAAA GAGTTGCATC GCCTTGAAGT CATCCAGAAG 
ATCCGTGACC TACGCCTGAG CGTTGTCCAG GCTGCCGAAC TGCTCGGGCT CAGTCGAAGT
CAGGTCCATC GGTTGCTGCA GGCCTATGAC CGGGATGGTC CAGCCGGCCT CGTTTCCAAG
AAACGATCGC AGCCGAGCAA CCGGCGCCAC AGCGAGGAGT TTCGCAATGC AGCGCTGGAT
CTGATCCGCG AGCACTATCT GGATTTCGGC CCGACGCTGG CTCGCGAGAA GCTGATCGAG
CTGCACCGGA TCTCTGTTGC TAAGGAGACG CTGCGGCAAT GGATGACCGA GGCCGGCATC
TGGATCTCGC GCCGGGAACG CAAGAAACGG GTTTTCCAGC CACGCGGCCG GCGCGACTGC
TTTGGCGAAC TGGTGCAGAT CGATGGTTCG CATCACTGGT GGTTCGAGAA CCGCGGCCCC
AAATGCGCCC TGCTCGTCTA TATCGACGAT GCGACCGGCA AGCTGCTGCA CCTGCGGTTC
GCCGGATCAG AGAACACGTT CGACTATCTG CATGCGACTA AGGCGTATTT GCAGCAATGG
GGCAAACCTC TCGCGTTCTA CAGCGACAAA CATGGGGTTT TCCGTAGCAC GCATGCGTCA
GAGAAAGACC GGACGAGCGG CCTGACGCAG TTTGGTCGTG CGCTTTATGA GCTGAACATC
GACATTATCT GTGCCAACAC TCCGCAAGCC AAGGGCCGTG TGGAACGCGC CAACCAGACA
TTGCAGGATC GGCTGGTGAA GGAGATGCGG CTTCGCGGCA TCGATACGAT CGAAGCTGCC
AATGCCTATG CGCCTGAGTT CATTGCGGAT TTCAATTCGC GGTTCGGCAA GCAACCGCGC
AATCCGAAGG ACATGCACCG ACCGCTGGCC GACCATGAGA ATCTCGATGG CGCCATGTGC
CGGAAGGAAG TCCGCACACT GTCGCAGTCG CTGACGCTCC GTTACGACAA GGTCTTGTTC
ATTCTCGATC CGACCGAGAT TTCCAGGCCA TTGGCGGGTA AGAAGGTCGT TGTCTGCGAC
TACCCGGACG GCCGCCTCGA GATCATGCAC GAGAGCTTCA CCCTGCCCTA CAGAACCTTC
GATACATTGC GGTCGGTCCA CCGGGCAGAG GTTGTCGAGA ACAAGCGTCT CGACGACATG
CTTTCCATCG TCGCCGAGCT GCAGGCTGGG CGGGAACAGC AGCGCAGCAA GAGCGGCCCT
CGCCGCACCG GCCAGACGGA TCATATGTTC GGTATTCCCG ATGGCAGCCA GGGCAATGGT
TATCAGAAGC GTGGCCGCAA GCCTGGCCGG CGGACGGACT TCATGAATGA TCCGGAGGTG
ATTGCCAAAA GGCAGAAGGC GTTGATGAGG CAGCCAGCGG AATGA
 
Protein sequence
MSFMITMSQK ELHRLEVIQK IRDLRLSVVQ AAELLGLSRS QVHRLLQAYD RDGPAGLVSK 
KRSQPSNRRH SEEFRNAALD LIREHYLDFG PTLAREKLIE LHRISVAKET LRQWMTEAGI
WISRRERKKR VFQPRGRRDC FGELVQIDGS HHWWFENRGP KCALLVYIDD ATGKLLHLRF
AGSENTFDYL HATKAYLQQW GKPLAFYSDK HGVFRSTHAS EKDRTSGLTQ FGRALYELNI
DIICANTPQA KGRVERANQT LQDRLVKEMR LRGIDTIEAA NAYAPEFIAD FNSRFGKQPR
NPKDMHRPLA DHENLDGAMC RKEVRTLSQS LTLRYDKVLF ILDPTEISRP LAGKKVVVCD
YPDGRLEIMH ESFTLPYRTF DTLRSVHRAE VVENKRLDDM LSIVAELQAG REQQRSKSGP
RRTGQTDHMF GIPDGSQGNG YQKRGRKPGR RTDFMNDPEV IAKRQKALMR QPAE