Gene Rleg2_3454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3454 
Symbol 
ID6982208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3569716 
End bp3571626 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content64% 
IMG OID643398172 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_002282947 
Protein GI209551030 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGATT ATAGTCTTGA GACAAGCTGG CGCCCGGTCG AAGGCAGCTT CGGGCGCTTG 
ACCTTCACGC TCCACAATCT TTCGGCCGAG CCGCTGTCCG GCTTCTCGCT TGCCTATACA
TCGGAGACGC GGGTTGCCGA CAAGCATGTC TGCGACGGCG CCAGCCTCAA GCGGCAGGTC
GCGCATTTCC ATGAGTTCCT GCCGTCCGAG CAATTGACGG TGCCGCCGGG TGGGCGCTGG
CGGTTCACTG TGGAGGGGCT GAGCCGAGAG CCGAAACATG TCACCGCTGG CGTTAAATCC
GCCTATCTGA CACTTGCCGA CGGTCGTCAC CTTCCCGTCG ATTTCGGCGA TCTCATGCTC
GAAGGCCGGG ATGGCGGTGT GTCGCCGCCG CTCTTGCCGC CGGGGAAAGC CGATGAACCC
TATGCGCTGC TCCCCTGGCC GCTGGCGCTC GGGCTGAAGG CGGGTGACTT GCCGGCCACC
CTCTATCCGG CCGAGAGGAC GCGGCCTGAT GCGATCGAGG CCCTCTCCCA GGTTCTTTCT
CTTTACCAGC GCCTCTACCC AGTCGAAGAC GCGCCGTTTT CGCTGAGGGC CGTCGAAGGC
GGACGGCGCA TTCGTTTCGT CGCCGAATCG TCGATCGCTG CCTTCGCCTA CGAGCTGCGT
TTCACCGCGC ACGAGATCGT GCTTTCGAGC GCCGATGAAG CGGGGCGGCG CTACGGTCTC
ATCAGCCTGG CGCAGCTGCT GCACGGGGCC CGCGCCGATG GCGAACGATT TAAATTCCCC
AATTTCGGCA CGATCGCCGA CCAGCCGCGT TATGACTGGC GCGGCTGCCA TCTCGATGTG
TCCCGGCAGT TCTATCCGGT CGCCGACATC CTGCGGCTGA TCGATATTCT TGCCTGGAAC
AAGCTCAATA TCTTTCATTG GCACCTGACC GACGACGAAG CCTGGCGGCT GGAGATCAAG
GCCTATCCAG CGCTGACGGA GATCGGCGCG CGGCGCGGGC CGGACGAGGT CCTCGTGCCA
CAGCTCGGCG ACGGAGCGGA AACACGCGCC GGCCATTACA CGCAGGATGA TATCAGGCGG
ATCGTTGCGC ATGCCGCCTC GCTCGGTGTC GAGGTCGTGC CGGAAATCGA CATTCCCGGC
CACAGCACCG CAACCTTGCT CTCGCTGCCC GAGCTCGCCG ACGGGCAGGA AGCGCCGGAC
AGCTATCGCG CGGTCCAAGG TTATCCCAAC AATGCGCTCA ACCCTGCGGT CGAATTCACC
TATGAATTTC TCGGCAAGGT GTTCGACGAG ATCGTGGCGC TGTTTCCGAG CGAGTATCTC
CACATTGGCG GCGACGAGGT GGCAGAGGGC GCTTGGCTAT CCTCCCCTCT CTGCCAGGCG
CTGATGAAGC GGGAGAAGCT TGCCGGCACC GCCGAGCTGC AATCCTATTT CCTGAAACGC
ATCAAGGCGA TGCTCTCGGA GCGCGGCAGG AAGCTCGCCG GCTGGAACGA GGTCTCGCAT
GGCGGCGGCA TCGACCCCGA CGGCACGCTG CTGATGGCCT GGGAAAAGCC CGCCGTCGGC
ATCGAGCTGG CGCAGCAAGG CTACGACGTG GTGATGACGC CGGGACAGGC TTATTATCTC
GACATGGCGC AGGCCGAAGC CTGGGCGGAG CCCGGTGCAG CATGGGCGGG CTATTCGCCG
CCGGAACATA GCTACACTTA CGAGGCCGAG GGTGAGTTGC CGGAGGCGCT GCAGGAGAAG
ATGCGCGGTA TCCAGGCCTG CATCTGGACT GAAAACTTCA TCTCACGCGC CTATTTCAAC
CGGCTGGTCT TTCCGCGTCT TCCGGCCGTC GCCGAGGCCG CATGGACACC GTCGGGCCGG
AAGGATTGGG ACCGGTTCGC CGCGATCGTG CGGATGTGGC CTATGCTTTA G
 
Protein sequence
MADYSLETSW RPVEGSFGRL TFTLHNLSAE PLSGFSLAYT SETRVADKHV CDGASLKRQV 
AHFHEFLPSE QLTVPPGGRW RFTVEGLSRE PKHVTAGVKS AYLTLADGRH LPVDFGDLML
EGRDGGVSPP LLPPGKADEP YALLPWPLAL GLKAGDLPAT LYPAERTRPD AIEALSQVLS
LYQRLYPVED APFSLRAVEG GRRIRFVAES SIAAFAYELR FTAHEIVLSS ADEAGRRYGL
ISLAQLLHGA RADGERFKFP NFGTIADQPR YDWRGCHLDV SRQFYPVADI LRLIDILAWN
KLNIFHWHLT DDEAWRLEIK AYPALTEIGA RRGPDEVLVP QLGDGAETRA GHYTQDDIRR
IVAHAASLGV EVVPEIDIPG HSTATLLSLP ELADGQEAPD SYRAVQGYPN NALNPAVEFT
YEFLGKVFDE IVALFPSEYL HIGGDEVAEG AWLSSPLCQA LMKREKLAGT AELQSYFLKR
IKAMLSERGR KLAGWNEVSH GGGIDPDGTL LMAWEKPAVG IELAQQGYDV VMTPGQAYYL
DMAQAEAWAE PGAAWAGYSP PEHSYTYEAE GELPEALQEK MRGIQACIWT ENFISRAYFN
RLVFPRLPAV AEAAWTPSGR KDWDRFAAIV RMWPML