Gene Rleg2_6250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6250 
Symbol 
ID6983323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011370 
Strand
Start bp191326 
End bp192807 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content54% 
IMG OID643399259 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_002284015 
Protein GI209552099 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3534] Alpha-L-arabinofuranosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.522312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGAAA ACTTGACAGG TGGTTTTGCG CAAGTGAGAG CGTCGAAGCA GCAGGAGTCT 
CGGAGATGGA ATCCGATGAT ACTCGGGCAT TTTGTCGAAC ATTTTCACAA TCAAATCTAT
GGCGGCGTCT TCGATCCGGG TTCGCATCTA GCGGACGACC GAGGTTTCCG TCTCGACGTC
ATCGAAGCGT TGAAAGAATT ACGGCCCCCG ATTGTTAGGT GGCCCGGCGG CAATTTCGTT
TCGGATTACC ATTGGTATGA GGCCGTGGGT GCAAACCGGC TGCCAAGCTA CAATAAGGCT
TGGCGTGTGG CCGAGCCCAA CACTTTTGGG ACCGACGAAT TTATTGAGTG GTGCCGGAGA
CTAAATTGCG AGCCCTACAT CTGCACCAAT GCGGGTAGTG GCACGCCCGA AGAAATGAGC
AATTGGCTCG AATACTGCAA CGGGCATCTC GAAACCCGAT ACGCAAATTT GCGTCGAAAG
AGCGGATATG AACGTCCACA CGCAGTAAAG TATTGGGGAA TCGGAAACGA GAGTTATGCA
GATTTCCAGA TCGGCGCCAA AACTATAGGG GAGTGGGGTC CTTATGTCGC CGAAGCGGCA
AAAATGATGC GTTCGGTGGA CGACACTATC GTCCTTTCAG CGGCTGCGGT ACCCGATACG
GAATGGACCC TAAACCTCCT GAAACACGCA GGTCGCTATC TCGACCTGGT TTCGATACAC
GGCTACTGGG ATGATCTGGA ACACCACGAC GAGCCGTCCG ACTATCTGAC GGCGGTCCTT
CGCTCTCACG AGCCGGAGAA GATGATCGAC GGCGCACGTG AGATCATCGC ACTGGCGGGG
CTGGAAGGAC AAATTCAAAT AGCATTTGAT GAGTGGAACC TTCGCGGGTG GCATCACCCT
CGTGGGACGC ATGAAGAAAA GATAAGGGCT CGTGACAGGA ACGACCGAGC TGAAACCTAC
ACGATGGCGG ATGCTCTGTT CACAGCCTCG TTCCTGAATT CATGCCTTCG TAACAGCGAT
ATCGTGTCGA TGGCGAACGT TTCGCCGAGC ATCAATGCAA GAGGACCGCT GTACGTCCAT
GGCGGCGGCG TTGTACGCCG CTCGACATTC TACGTTTTAA AAGCCTATAA CGATCACTTG
AAACCGTGGA TCGGATCGAC AAGCGTAAAT GGCCCGACAC TGCGTCATGC AGGGGCCGAA
ATAGCAACGA TCGAGGCACT GACCTCGTCC GACGGGGCTT CTCGCAACTT ATTCATTGTC
AATCGCGACC CTCACGACGC GATCCTTTGC GAACTATATT TCGACAATCA CCGGTTGGAT
GGCGACCGAG TAGTCACTGT TATCTCGGGC CTAACGGCCG ACTCCTTTAA CACGGTAGAA
GCCCCTGACA TGGTGTCACC GAGGGCTCAA CCTCTGGTGA GGCAAGGCGG AGGTTACTAC
ATTCCTCCTC ATTCTCTCTG CGTGCTGGAA GTTCCCGGCT GA
 
Protein sequence
MEENLTGGFA QVRASKQQES RRWNPMILGH FVEHFHNQIY GGVFDPGSHL ADDRGFRLDV 
IEALKELRPP IVRWPGGNFV SDYHWYEAVG ANRLPSYNKA WRVAEPNTFG TDEFIEWCRR
LNCEPYICTN AGSGTPEEMS NWLEYCNGHL ETRYANLRRK SGYERPHAVK YWGIGNESYA
DFQIGAKTIG EWGPYVAEAA KMMRSVDDTI VLSAAAVPDT EWTLNLLKHA GRYLDLVSIH
GYWDDLEHHD EPSDYLTAVL RSHEPEKMID GAREIIALAG LEGQIQIAFD EWNLRGWHHP
RGTHEEKIRA RDRNDRAETY TMADALFTAS FLNSCLRNSD IVSMANVSPS INARGPLYVH
GGGVVRRSTF YVLKAYNDHL KPWIGSTSVN GPTLRHAGAE IATIEALTSS DGASRNLFIV
NRDPHDAILC ELYFDNHRLD GDRVVTVISG LTADSFNTVE APDMVSPRAQ PLVRQGGGYY
IPPHSLCVLE VPG