Gene Rleg2_5334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5334 
Symbol 
ID6978428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp959225 
End bp960397 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content63% 
IMG OID643394436 
Productglycosyl hydrolase family 88 
Protein accessionYP_002279254 
Protein GI209547336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCG TTTCAACAGT CGCCCCGCAG CCGATTACCG ATGAGGAAGT AAACGCCGCG 
CTCGATATCG CCGTCGAGCA GGTCAGGCGC AACCTCCCCG ACTTCACCCA CGCCGCGCAG
AACCATTCCA GTATCAAGAA TTTTTATCCC CCAGTGGCGA ACGACCAGTG GACCGCCGGC
TTCTGGCCCG GCGAACTGTG GCTCGCCTTC GAACACAGCG GCGACGCGGC CTTCCGGGAT
GCTGCGCAGA TCCAGGTTCA ATCGTTCCTG CATCGGATCG TCAATCGCAT CGAGACCGAT
CATCACGATA TGGGCTTTCT CTATTCGCCC TCCTGCATCG CCGCCTGGAA GCTCGTTGGA
GACGCGGATG GCCGCAGGGC CGCGATCCTG GCCGCCGACC AGCTGATAGA GCGCTTCCAG
CCGATCGGCC AGTTCATCCA GGCTTGGGGC CGCAAGGGAA AGGCGGAGGA ATATCGCTAT
ATCATCGACT GCCTTTTAAA CCTGCCGTTG CTCTACTGGG CAAGCCGCGA GACCGGCGAT
CCGAAATACC GCGAGATCGC GCTCACCCAC GCCCGCACCA CGCTCGCCAA TTCGGTGCGG
CCGGATGATT CCACCTATCA CACCTTCTAC ATGAACCCGG TGACCGGCGC GCCGGTGCGC
GGCGCCACCA AACAGGGCTA CCGGGACGAC AGCGCCTGGG CGCGCGGACA GGCCTGGGCA
ATCGCGGGCA TGGCGCTCTC CTACCGCTAC GAGCGGATCG AGGAATATCG CAGCACCTTC
GACCGGCTGC TCGCCTTCTA TCTCAACCGG CTGCCGGCCG ACATGGTCCC CTATTGGGAC
CTCGTCTTTT CCGACGGCGA CGGCGAGCCG CGCGACAGTT CGTCGGCCTC GATCACCGCC
TGCGGCCTGC TTGAAATGGC CGAGCTAGTC GAAGCCGAAC ACGCCGAGCG CTACCGCACG
CTGGCGCGCC GCATGATCAA GAGCCTGGCC GACCACTATG CGGTGAAGGA TCCCACCGTT
TCCAACGGCC TGGTGCTGCA CGCCACCTAT TCGAAGAAAT CGCCCTTCAA CACCTGCCGC
GGCGAGGGCG TCGATGAGTG CGTCTCCTGG GGAGACTATT ATTACATGGA AGCTTTGACG
CGCCTTTCGC GCCGCTGGTC TTCCTATTGG TGA
 
Protein sequence
MNAVSTVAPQ PITDEEVNAA LDIAVEQVRR NLPDFTHAAQ NHSSIKNFYP PVANDQWTAG 
FWPGELWLAF EHSGDAAFRD AAQIQVQSFL HRIVNRIETD HHDMGFLYSP SCIAAWKLVG
DADGRRAAIL AADQLIERFQ PIGQFIQAWG RKGKAEEYRY IIDCLLNLPL LYWASRETGD
PKYREIALTH ARTTLANSVR PDDSTYHTFY MNPVTGAPVR GATKQGYRDD SAWARGQAWA
IAGMALSYRY ERIEEYRSTF DRLLAFYLNR LPADMVPYWD LVFSDGDGEP RDSSSASITA
CGLLEMAELV EAEHAERYRT LARRMIKSLA DHYAVKDPTV SNGLVLHATY SKKSPFNTCR
GEGVDECVSW GDYYYMEALT RLSRRWSSYW