Gene Rleg2_4247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4247 
Symbol 
ID6983020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4422564 
End bp4424267 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content59% 
IMG OID643398975 
Productintegrase family protein 
Protein accessionYP_002283732 
Protein GI209551815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAATGC GGATCTCCCG ACCCATGAAG CGCGCTGGCA CGAAGAACGA GCAATTTAAG 
AAGCGCGTGC CCACTGCTCT CCTGCCGGTT TTACGCGGCA AGAAGTTCGC AGTTGACCTC
CCTCAGACAG TCGCTCCTGA CAGCCCGACA TATACAGTCA CGGTCACGCT TTCCGACATG
ATCACCTTTT CGCTGGGCGC TCCAGCCTCG CGTCTGGCTG CGATCCGTTA CGCCGCAGCC
TTAGGCCATG TTGAAGCCTT CCTAGAGGCG GCGCAGCGCG GGCCTGAAGA CCTTTCGCAC
TTGCAGATCA CCGCACTGTT AGGAGACGTC CACAAGGCAC TCCTTGCCCA GTATGAGGCC
AATCCCCCAG CGCGGCACAA GGTCGTCGTT GATGGTGGCG TCGAAGAGTG GAGCGAGCTT
TCAGAGTGGC AGGACATGAT CGCTGACGCC GAGGTGGCCG TCCTAACGAT GACACCGCAG
GGCAGAGCCG CCGCTATCTC ACGGGTTAGG CGCATCATCG ACATGGACGC CTTTCTGTCC
GCACGCGCAT TGCTGCTGTC GGATAAGTCC TATCGCGATC TGGTCGAAGG ACTTCCGGCA
ATCTTAAGGC GGGTCGTGGA AACCCTCTCA AATCGCAGCC AGGGCAACTA CTCGGCCGAC
CCTTACGCAC CGCAATACCC CCAGTGGAAA GCCAAGACCG CTGCTAAACA GCGTGCGACT
GATGGCGATG TGAAGACCTT TGACGATCTC TTCGACCGCT GGAAGGCGGC AGATAAGCGC
GCGGCTTCTA CGCTCTCGAC ATGGCGCGGC TACTTGGCGC GGTTTAAGCA GTTCGTGGGC
CATGATGATC CGCACCGGGT CGAGCGTGTC GACGCCCTGC GATGGAAAGA CGCGCTGGTA
GCCGAAGGTC TCAAGAAGAT CTCAACAACC TACCTCGCAG CGCTCAACAC GCTTTACCGC
TTCGGCCTCA ATAACAGCGA GACGACTGGC ATCACCCGCA ATCCCTTCGA TGGCGTGAAG
GCACCGCAGA AGGCTACTGC AGGGACAAAG CGTTTGCCTT TCACCCGCGT TGAGGTCGCG
GTGATACTTA ATGCTGGGCG CAAGGAAAAG CTTGCGCACC TCCGTTGGAT ACCGTGGCTC
CAAGCCCAGA CAGGATCGCG TGTCGCAGAG ATAGCACAGC TGTGGGCAAC CATGGTTATC
ACCGATGATG CCGGTCACCC GTGCCTGCAC ATTACAACGG CACCGGATGG CGGGTCTCTC
AAGAACGAAG GATCTGAGAG GGTCGTACCG ATCCACCCTG ATCTCATTGA GGATGGCTTC
CTTGAGTTCG TCAAGACACG CGGCAAGGGG CCGTTGTTCT ACGGCGGCAG CAAGGGTAAG
GCCGCAGTTC GGTTGCGAGA TGACCAGAAG CACCCGTCGA AGGGTGTCAG CAATCGGGTT
GGAACTTGGG TGCGGGGACT GGGCATTACG GACCCGCGCA AGGGGCCAAC GCATTCATTC
CGGCATTGGT TTAAGTCAGA GCTTCCGCGC CGGTCGGGGT GCAACATCCG CTTGGTTGAT
GCGATCCAAG GCCACTCTGC TGAAAGTGAT GCGGCTGGAT ATCATCATTC GGAAACATCG
GAGATGCTTG AGGCTATCTC AAAGCTCGAC CTCAAAGGGC TGGCCGACAG CGTGCCTATG
GCCGAGCAAC CGGCAGACGC ATAG
 
Protein sequence
MVMRISRPMK RAGTKNEQFK KRVPTALLPV LRGKKFAVDL PQTVAPDSPT YTVTVTLSDM 
ITFSLGAPAS RLAAIRYAAA LGHVEAFLEA AQRGPEDLSH LQITALLGDV HKALLAQYEA
NPPARHKVVV DGGVEEWSEL SEWQDMIADA EVAVLTMTPQ GRAAAISRVR RIIDMDAFLS
ARALLLSDKS YRDLVEGLPA ILRRVVETLS NRSQGNYSAD PYAPQYPQWK AKTAAKQRAT
DGDVKTFDDL FDRWKAADKR AASTLSTWRG YLARFKQFVG HDDPHRVERV DALRWKDALV
AEGLKKISTT YLAALNTLYR FGLNNSETTG ITRNPFDGVK APQKATAGTK RLPFTRVEVA
VILNAGRKEK LAHLRWIPWL QAQTGSRVAE IAQLWATMVI TDDAGHPCLH ITTAPDGGSL
KNEGSERVVP IHPDLIEDGF LEFVKTRGKG PLFYGGSKGK AAVRLRDDQK HPSKGVSNRV
GTWVRGLGIT DPRKGPTHSF RHWFKSELPR RSGCNIRLVD AIQGHSAESD AAGYHHSETS
EMLEAISKLD LKGLADSVPM AEQPADA