Gene Rleg2_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2090 
Symbol 
ID6980829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2148541 
End bp2150154 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content63% 
IMG OID643396812 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_002281600 
Protein GI209549683 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00759162 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.572245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGCA ATCCCATCCT GCCCGGGTTC AACCCCGATC CGTCGATCTG CCGCGTGGGC 
GCGGACTATT ATATCGCGAC CTCGACCTTC GAATGGTATC CCGGCGTGCA GATCCACCAT
TCGCGCGACC TGGTGAACTG GACGCTGGTG CGCCGGCCGC TGGAACGCCG GTCGCAGCTC
GACATGCGCG GCAATCCCGA CAGCTGCGGC ATCTGGGCGC CGTGTCTTTC CTATGCCGAC
GGGCAGTTCT GGCTTGTTTA TACCGACGTC AAGCGCTTCG ATGGCAGTTT CAAGGACGCG
CCGAACTATA TCGTCACCGC GCCTGCCATC GAGGCCGAAT GGTCCGAGCC GGTGTACGTC
AATTCCTCCG GCTTCGATCC CTCGCTGTTC CACGACGATG ACGGCCGCAA GTGGTTCCTC
AACATGCAGT GGAACCACCG CACCGAAAGC TATGGCGGCT CGCCGAAATC GCCGGCCTTC
GACGGTATCC TGCTGCAGGA ATGGGACCCG GTGACGAAGG CCCTGAAAGG CCCGCTCCGC
AATATTTTCG CCGGCAGTCC GCTCGGCCTG GTCGAGGGCC CGCACCTCTT CAAGCGCAAT
GGCTGGTACT ATCTGACGAC CGCGGAAGGC GGTACCGGCT ATGACCACGC CGTCACCATG
GCGCGCTCGC GCCGCATCGA AGGCCCTTAC GAGATGCATC CTAACATGCA TCTCATCACC
TCCAAGGATC ATCCGGGCGC GGTGCTGCAG CGGGCAGGGC ACGGCCAATA TGTCGAGACG
CCGGACGGTG AGGCCTATCA CACCCATCTC TGCGGCCGGC CTCTACCGCC GAAGCGGCGC
TGCACGCTGG GGCGAGAGAC GAGCCTGCAG AAATGCGTCT GGCGCGACGA TGACTGGCTC
TATCTCGAAA ATGGCACCTC GGTGCCCGAT GTCGATGTGC CCGGCCTCTT CGGCGCCGTG
CCTGCGGAAA AGCCGATGCG CAGCGAATAC AGCTTCGATG GCGGCACCCT GCTGGCCGAT
TTCCAATGGC TGCGCACGCC CGAGCCCGAG CGCATCTTCA ACCTGACGGA CCGCCCCGGC
CATCTCAGGC TGATTGCGCG CGAAAGCATC GGCTCCTGGT TCGAGCAGGC TTTGGTTGCC
CGCCGGCAGG AGCATCACAG CTTCCGCGCC GAGACCGTGG TCGAGTTCTC GCCCGACACT
TATCAGCAGG TCGCGGGGCT GACGCATTAT TACAACCGGC ATAAATTCCA TGCCGTTGCC
GTGACGCTGC ACGAAACACT CGGCCGCTGC GTGACGATCC TCTCCTGCAA TGGCGATTAT
CCGAACGGAC GCCTGAGCTT CCCCGCCGAA AGCGATGTGG CGATCGCTGC TGAGGGCCGT
GTCCAGCTCG CCATGGAAAT TCGCGAGAAC GATCTGCAAT TCTTCTGGCA GACCGAAGGC
AAGGGCGCCT GGCAGCCGAT CGGCCCGATC CTCGACGCCG GCATGATTTC CGACGAGGGC
GGGCGCGGCG AACACGGTTC CTTCACCGGC GCCTTCGCTG GCGTGTTTGC CTTCGATACG
TCGGGACGCG GGAAGATCGC GGATTTCGAC TGGTTCAACT ATGACGAATT GTGA
 
Protein sequence
MIRNPILPGF NPDPSICRVG ADYYIATSTF EWYPGVQIHH SRDLVNWTLV RRPLERRSQL 
DMRGNPDSCG IWAPCLSYAD GQFWLVYTDV KRFDGSFKDA PNYIVTAPAI EAEWSEPVYV
NSSGFDPSLF HDDDGRKWFL NMQWNHRTES YGGSPKSPAF DGILLQEWDP VTKALKGPLR
NIFAGSPLGL VEGPHLFKRN GWYYLTTAEG GTGYDHAVTM ARSRRIEGPY EMHPNMHLIT
SKDHPGAVLQ RAGHGQYVET PDGEAYHTHL CGRPLPPKRR CTLGRETSLQ KCVWRDDDWL
YLENGTSVPD VDVPGLFGAV PAEKPMRSEY SFDGGTLLAD FQWLRTPEPE RIFNLTDRPG
HLRLIARESI GSWFEQALVA RRQEHHSFRA ETVVEFSPDT YQQVAGLTHY YNRHKFHAVA
VTLHETLGRC VTILSCNGDY PNGRLSFPAE SDVAIAAEGR VQLAMEIREN DLQFFWQTEG
KGAWQPIGPI LDAGMISDEG GRGEHGSFTG AFAGVFAFDT SGRGKIADFD WFNYDEL