Gene Rleg_4874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4874 
Symbol 
ID8007261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp255087 
End bp256787 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content65% 
IMG OID644821803 
ProductGlycosyl hydrolase family 32 domain protein 
Protein accessionYP_002973063 
Protein GI241113228 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.963389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.66875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTGC AGGCAAAATC CGAAACGTCA GCACCTGAAA CCATTGAAGC GGAACTGCCC 
GAAGGCACGG TGCTGCATCT CTGGCTGAAG GCGCGCCATG CCGGCGGCGA GGCCAAGCTC
TTCGTCGCCG TTGAGGGTAA CGACATCGGC GAGCCCTCGA CCCACCGCGC AGGGGAATTT
GAGTTCTTCG CTGTGACGCT CGCGAAAGGC GGTCGTGCCA CGCTGTCTTA TGATGCGGCA
GCCACAGCGC TTTCCGTCGC TTACGCCTTC CGGCCGGAAA CCGTGATGAA GGAGGGCATC
CGCGTCCTGC ACAGCGATGC CCGCACCGCC GCCCCCGACG TGCCCGACAG CTACCACTTC
CGCCCACCCT TCGGCTGGAT GAACGATCCG AACGGTTTTG GACGGTTCGG CGGAAACGCT
CACCTCTTCT ACCAGCATTA TCCTCATGAG CCGCGCTGGA ACACCATGCA CTGGGGCCAT
GCCGTCTCGA GGGATTTCGT CCGCTGGACG CATCTGCCCA TGTTCCTCTT CCCGGCGGCG
CATCTATCGG AAAAGGACGA TGGCCGCGGC GGCGCCTTCT CCGGCTCGGC GATCCCCGGC
TCCGGCCCGG AAGGCGAGGA AATCCGCGTC TTCTACACCG AGCATGTGCG TGACCGTCTA
CCGGAAGAGC AAATCCAGCT TTCCGCCGTC AGCCGCGACG GCATCGTCGC CGGCCCGTCG
GAAATCGTGA TGCCGCTGCG CCCGGAAGGC TTGAACGTCA CCACCGATTT CCGCGACCCC
TATGTCTTCA AAGGCCCGGA CGGGCGCTGG AAGATGCTGC TCGGCAGCCG CGACAGGCAG
GGCGGCGTCG TACTGCTCTA TGAAACCGCA GACGCGCAAG GCGTCGATGG CTGGACCTTC
CTCGGCATCC TCCATCGCGA GGACGGTTTC GGCATGACGG CGGCGGAATG CCCCTGCATG
GTGCCGCTTT CCGGCAAAAA CGCAGAAACC CGCTGGGCGC TGATCTTCGG TCTGCTCACC
AGCCGCGACC CGGCCACCGG CCGCCGCAAC CTCACTTCCG TCACCGTCGG TGGTTTCGAT
GGCCGCACCT TCGTCGCGGA ATTCGTGCAG GAGTTGGATT TCGGTTCGGA TGCCTATGCC
TTCCAGGCCT TCGTTGATGG CGACGAGCCG GTCGGCATCG CCTGGCTCGC CAACTGGACG
GATTTTTCCA AGAAGGACGA TTTCCCGACG GCCATGACCC TGCCGCGCCG CATGCTTCTC
GACGGCGACA CCGTGCTGAC CCCGCCGGTC GCAGCCGTCG AAAGCCTACG CCATCGGTTG
CTGGACGGCA CCGCGCTTGC CGCCGGCAAG ACCGTGCCGC TCGGCACCGG CGCCGTCGAG
ATCGTGCTTG ATCTCACCGC GCCGGGCGCC GCCTTCGATC TCACCTTCGA TCATCCGGAT
GTCGATCTAG GCGTTAAACT CGACGCCGAT GGTCTGGCGA TTGTCTTCGA CGCCCGCACC
GGTATGAGGC CGCCGCGTTA CGTCGCCGCC GGCGCGAATC CGTCGAGCCT GCGCATCTTC
CTCGATGCCG GCTCCATCGA GGTCTTCGCT GACAACGGCC GCTGGACGGG GTCCAAACGC
ATTCCGAGCT TTGCCGCCGC ACGTTCGGCG ACGCTCGCCG GCGTCGTCGC CGGGGCCGGC
GTCTGGCAAT TGAAACTGTG A
 
Protein sequence
MSLQAKSETS APETIEAELP EGTVLHLWLK ARHAGGEAKL FVAVEGNDIG EPSTHRAGEF 
EFFAVTLAKG GRATLSYDAA ATALSVAYAF RPETVMKEGI RVLHSDARTA APDVPDSYHF
RPPFGWMNDP NGFGRFGGNA HLFYQHYPHE PRWNTMHWGH AVSRDFVRWT HLPMFLFPAA
HLSEKDDGRG GAFSGSAIPG SGPEGEEIRV FYTEHVRDRL PEEQIQLSAV SRDGIVAGPS
EIVMPLRPEG LNVTTDFRDP YVFKGPDGRW KMLLGSRDRQ GGVVLLYETA DAQGVDGWTF
LGILHREDGF GMTAAECPCM VPLSGKNAET RWALIFGLLT SRDPATGRRN LTSVTVGGFD
GRTFVAEFVQ ELDFGSDAYA FQAFVDGDEP VGIAWLANWT DFSKKDDFPT AMTLPRRMLL
DGDTVLTPPV AAVESLRHRL LDGTALAAGK TVPLGTGAVE IVLDLTAPGA AFDLTFDHPD
VDLGVKLDAD GLAIVFDART GMRPPRYVAA GANPSSLRIF LDAGSIEVFA DNGRWTGSKR
IPSFAAARSA TLAGVVAGAG VWQLKL