Gene Rleg2_5908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5908 
Symbol 
ID6977295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp326462 
End bp327928 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content62% 
IMG OID643393361 
Productglycoside hydrolase family 4 
Protein accessionYP_002278179 
Protein GI209546289 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCA AAATCGCTAT CATCGGTGCG GGCAGCGTTG GTTTCACCAA GAAGCTGTTT 
ACCGACATTT TGTGCGTTCC GGAGTTTCGC GACGTCGAAT TCGCGCTGAC CGATCTCAGC
GAACATAATC TCGAGATGAT CAAGGCGATC CTCGACCGGA TCGTCGAGGC GAACGGATTG
CCGACCAAGG TGACGGCGAC GACCGACCGG CGCCAGGCGC TGACCGGCGC GCGCTACATC
ATCAGCTGCG TCCGGGTCGG CGGCCTCGAA GCCTATGCCG ACGATATCAG GATCCCGCTG
AAATACGGCA TCGACCAATG CGTCGGCGAT ACGATCTGCG CCGGCGGCAT CCTCTATGGC
CAGCGCAACA TTCCGGTCAT CCTCGACTTC TGCAAGGATA TCCGCGAGGT CGCCGAGCCC
GGCGCGAAAT TCCTGAACTA TGCCAATCCG ATGGCGATGA ACACCTGGGC GGCGATCGAA
TACGGCAAGG TCGATACCGT CGGCCTCTGC CACGGCGTCC AGCACGGCGC CGAACAGATC
GCCGAGGTGC TCGGCGCCAA GTCGCTGGGT GAACTCGACT ATGTCTGCTC CGGCATCAAC
CACCAGACCT GGTTCATCGA CCTGCGCCTC AACGGCCGCA GGATCGGCAA GGACGAACTC
ATCGCCGCCT TCGAGGCGCA TCCGGTCTAT TCGCAGCAGG AGAAGCTGCG CATCGATGTG
CTGAAGCGGT TCGGCGTCTA TTCCACCGAG AGCAACGGCC ATCTTTCGGA ATACCTGCCC
TGGTATCGCA AGCGGCCGGA GGAAATCACC CGCTGGATCG ACATGTCCGA CTGGATCCAC
GGCGAGACCG GCGGTTACCT CCGCCATTCC ACCGAAACCC GCAACTGGTT CGAGACCGAA
TTCCCGCAAT TCCTGGAATC CGCCGCAAAG CCGATCGACC CCGCCAAGCG CTCGAACGAA
CATGCGAGCC ACATCCTCGA GGCGCTGGAG ACGAACCGGG TCTACCGCGG CCATTTCAAC
GTCAAGAACA ATGGCGTCAT CACCAACCTG CCGTCCGATG CGATCATCGA ATCGCCCGGC
TTCGTCGACC GCTTCGGCAT CAACATGGTC TCCGGCGTCA CCCTGCCGGA AGCCTGTGCC
GCCACCTGCA TCGCCTCGAT CAATGTCCAG CGCATGTCGG TGCATGCCGC CATCTCCGGC
GACATCGACC TTCTGAAGCT TGCCGTGCTG CACGACCCGC TGGTCGGCGC CGTGGCGACA
CCCGAGGAGG TCTGGCAGAT GGTCGACGAG ATGGTCGTTG CCCAGGCGCG CTGGCTGCCG
CAATATGCCG ATGCCGTGCC GGCCGCCAAG GAGCGGCTGT CGAAATCCAA GGTGCAGACC
CGCGACTGGG CGGGTGCCGC ACGCCGCAAC GTTCGCTCGA TCGAAGAGCT GCGCGCGGAA
AAGGCGGCGC TGAAACAGGC CGTCTGA
 
Protein sequence
MSFKIAIIGA GSVGFTKKLF TDILCVPEFR DVEFALTDLS EHNLEMIKAI LDRIVEANGL 
PTKVTATTDR RQALTGARYI ISCVRVGGLE AYADDIRIPL KYGIDQCVGD TICAGGILYG
QRNIPVILDF CKDIREVAEP GAKFLNYANP MAMNTWAAIE YGKVDTVGLC HGVQHGAEQI
AEVLGAKSLG ELDYVCSGIN HQTWFIDLRL NGRRIGKDEL IAAFEAHPVY SQQEKLRIDV
LKRFGVYSTE SNGHLSEYLP WYRKRPEEIT RWIDMSDWIH GETGGYLRHS TETRNWFETE
FPQFLESAAK PIDPAKRSNE HASHILEALE TNRVYRGHFN VKNNGVITNL PSDAIIESPG
FVDRFGINMV SGVTLPEACA ATCIASINVQ RMSVHAAISG DIDLLKLAVL HDPLVGAVAT
PEEVWQMVDE MVVAQARWLP QYADAVPAAK ERLSKSKVQT RDWAGAARRN VRSIEELRAE
KAALKQAV