Gene Rleg_5368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5368 
Symbol 
ID8007326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp777760 
End bp779076 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content58% 
IMG OID644822272 
Productglycoside hydrolase family 4 
Protein accessionYP_002973532 
Protein GI241113697 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.265798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA TTTGCCTGGT AGGCGCTGGT AGCACCGTTT TTGCACAGAA CATTCTGGGA 
GACGTCCTGT CCTCGCAGAG AGGCGGCGAC TACGTCATCA GCCTCTTCGA TATCGATCCC
GAGCGACTGA AGACGTCAGA GATCGTCGCT CGCCGGATCT GCGAGTCGCT TAAGCTTTCA
AGTGTCAGGA TCGACGCCAC GCTCGACCGG CGGGAAGCGC TGCGGGGGTC GGATTTCGTC
ATCCTCATGA TGCAGGTCGG TGGCTACAAG CCGGCTACCG TCACGGACTT CGACATTCCG
AAAAAATATG GCTTGCGTCA GACGATCGCC GACACCCTTG GCATCGGCGG CATTTTCCGC
GGTTTGCGGA CGATACCAGT CCTTGAGGCG ATCTGCCGAG ACATGCAGGA GGTCTGCCCG
CAGGCTCTTC TGATGCAGTA CGTCAACCCG ATGGCCATCA ACTGCTGGGC GATCAAGGAG
CTGGCCCCGG AAATCCGCAC CGTCGGCTTG TGTCATAGCG TGCAGCACAC AGCCGGGCAT
CTCGCGCAAT GCCTCGGCGA GGACATTGCC GACGTGAACT ATATTTCGGC CGGGATCAAT
CACGTCGCCT TCTTCCTGAA ATATGAGAAA GTTCACAGCG ACGGACGGCG GGAAGATCTT
TATCCGAGAC TGAACGCTCT CGCCACCGAT GGCCGCGTCC CGTCGGACGA TCGGGTGCGG
TTCGATGTGC TCAAACGCCT CGGCCACTTC GTCACCGAAT CCAGCGAGCA TTTCTCCGAG
TATACCTCCT GGTACATCAA GGAGGGACGA GGGGATCTGA TCGATCAGCT CAATATCCCG
TTGGACGAAT ACATCCGGCG CTGCGAGGTG CAGATCAAAG AATGGCATGC CCTGCGCAAG
GAACTGGAAG GGGACAAGCC GATCGAGGTG TGCCGCAGCA ATGAATATGC GGCCGGCATC
ATTCATGCCG CGGTCACTGG CAGCCCGGCG CTGATATACG GTAATGTCCC AAACAACGGC
CTGATCGAGA ATCTGCCCGA TGAATGCATC GTGGAAGTGC CTTGCCATGT CGACAGAAAC
GGAATTCAGC CGGTCCGGGT CGGTCGGATC CCCTCTCAGC TTGCCGCCGT CATGAACTTG
AGCGTTTCCG TTCAGCAGTT GACGGTCGAG GCGGCTCTTA CAAAAAACCG CGAGCGCATC
TACCAGGCCG CTCTGCTCGA TCCGCATACG TCTGCGGAAT TGTCGCCAGA CCAAATCTGG
AACCTTGTCG ACGACCTGAT CGTCGCACAC GGCGATTTGT TGCCGAGATA TCAGTGA
 
Protein sequence
MPKICLVGAG STVFAQNILG DVLSSQRGGD YVISLFDIDP ERLKTSEIVA RRICESLKLS 
SVRIDATLDR REALRGSDFV ILMMQVGGYK PATVTDFDIP KKYGLRQTIA DTLGIGGIFR
GLRTIPVLEA ICRDMQEVCP QALLMQYVNP MAINCWAIKE LAPEIRTVGL CHSVQHTAGH
LAQCLGEDIA DVNYISAGIN HVAFFLKYEK VHSDGRREDL YPRLNALATD GRVPSDDRVR
FDVLKRLGHF VTESSEHFSE YTSWYIKEGR GDLIDQLNIP LDEYIRRCEV QIKEWHALRK
ELEGDKPIEV CRSNEYAAGI IHAAVTGSPA LIYGNVPNNG LIENLPDECI VEVPCHVDRN
GIQPVRVGRI PSQLAAVMNL SVSVQQLTVE AALTKNRERI YQAALLDPHT SAELSPDQIW
NLVDDLIVAH GDLLPRYQ