Gene Rleg2_3916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3916 
Symbol 
ID6982680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4063041 
End bp4064687 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content62% 
IMG OID643398639 
ProductHeparinase II/III family protein 
Protein accessionYP_002283404 
Protein GI209551487 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.729618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.655355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGTTC GGGAGGCTTG GCGGCGCGCC TCGCGCCGCG TCGCGTTGCT GCGCCTAAAG 
CTCTTCCGCC ATTCAATCAA CGTGCCCGAG CGTCTGATCG TCGCGCCGAC CGATCTCCGC
AGCATCGATT CGCATGTGGC CGACGAGATT CTCAACGGAC GTTTTCTGCT GGCCGGGCGG
ATGCTGGAAA CGAGTGGAAA GTCACCCTTC ACCTTTACCT TGCCCTCGCG CCCCTTCGCG
ATCCGCCTTC ATAGCTTCGG CTGGCTGCGG CATATCCGGG CGAACAAGAC GGAGCGCAGT
TCGGCGGCGG CCCGCGCGAT CGTCGACAGC TGGCTTTCCA TCCATGCCGG GCGCATGGAA
GGGATTGCAT GGGAGATCGA CGTCACCGCG CAGCGCGTCA TTGCCTGGCT CTCGCATTCG
CCGGTGGTGC TGCAGAATGC CGACCGCGGC TTTTATCGCC GCTTCATGAA GTCGCTGGCC
TTCCAAGTGA GGTTCCTGCA CCGCATGGCG CCGTATACGC TTGGCGGCCT GGAGCTGTTT
CGGCTGCGTA TCGCGCTCGC CATGGCCTCC GTCGCCATGC CTGCCCGCGC ATCGACGCTC
AGACGGGCGG CGCAGGCGCT CGACCGCGAA TTCGATAGCC AGATTCTACC GGATGGCGGC
CATATCTCGC GCAATCCGCG CGTCGGTCTG GAATTGCTGC TCGATCTGCT GCCGCTGAGA
CAGACCTATG TCAATCTCGG CCATGACCTG CCGCAGAAGC TGATCTCCGG CATCGACCGC
ATCTATCCGG CTCTGCGGTT CTTTCGCCAT CAGGACGGGG ATCTGGCGCT GTTTAACGGG
GCGACCTCGA CGCTGGCAAA CGAGCTGATT TCCGTGCTGC GCTATGACGA GACCGCCGGT
CAGCCGTTCA AGGCTTTGCC GCAGTCGCGC TATCAGCGGC TTTCCGGCGG AAAAACAGTC
ATCATTGCCG ATGTCGGCAC GCCGCCTTCG GGCGGCGCGT TGCGGACCGT TCATGCCGGC
AGCCTCTCCT TCGAAATGTC GTCGGGCCGC CACCGCTTCA TCGTCAATTC CGGCTCGCCT
AAATTTGCCG GGCACCGCTA TGTCCAGATG GCCCGCACGA CGGCCGCGCA TTCGACGGTT
ATTTTGAACG ACACTTCGTC CAGCCGCTTT TCGCCTTCGC CCTTCCTCAA TCACGCGATT
ACCGAACCGG TGAGAACAGT GACCGTCGAG CGTGCCGAAA CGGAGGATGG ACGTGACGGC
ATCAAGCTCA GCCATGACGG TTATCTCAGG GCGTTCGGGG TGCTGCACGA ACGTGAGCTG
ACACTCAATG CCGCAGGCTC GATCGTGACC GGGCGCGACC GGCTCGTCGT CCGGGAAGGG
TATGAACACG ACGAGCCCTT GAAGGCCGTC GCTCGTTTCC ACATTCATCC CTCCATCGTC
CTGCAGCAGA GCGACGGGGA GTCCGTGCTG CTGACAGCGC CGGACGGCGA AAGCTGGCTG
TTTTCGGCGC CCGGCAACGA AGTGCTGATC ACCGAGGACA TCTTCTTTGC CGACAGCTCC
GGCATTTGCG GTTCGGATCA GATCGAAATC GACTTCGATC TTGCCGAGAA GACGGAAATC
CGCTGGTTTT TGTCCCGCAA GGGATAG
 
Protein sequence
MYVREAWRRA SRRVALLRLK LFRHSINVPE RLIVAPTDLR SIDSHVADEI LNGRFLLAGR 
MLETSGKSPF TFTLPSRPFA IRLHSFGWLR HIRANKTERS SAAARAIVDS WLSIHAGRME
GIAWEIDVTA QRVIAWLSHS PVVLQNADRG FYRRFMKSLA FQVRFLHRMA PYTLGGLELF
RLRIALAMAS VAMPARASTL RRAAQALDRE FDSQILPDGG HISRNPRVGL ELLLDLLPLR
QTYVNLGHDL PQKLISGIDR IYPALRFFRH QDGDLALFNG ATSTLANELI SVLRYDETAG
QPFKALPQSR YQRLSGGKTV IIADVGTPPS GGALRTVHAG SLSFEMSSGR HRFIVNSGSP
KFAGHRYVQM ARTTAAHSTV ILNDTSSSRF SPSPFLNHAI TEPVRTVTVE RAETEDGRDG
IKLSHDGYLR AFGVLHEREL TLNAAGSIVT GRDRLVVREG YEHDEPLKAV ARFHIHPSIV
LQQSDGESVL LTAPDGESWL FSAPGNEVLI TEDIFFADSS GICGSDQIEI DFDLAEKTEI
RWFLSRKG