Gene Rleg_6878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6878 
Symbol 
ID8022461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp331728 
End bp333194 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content61% 
IMG OID644833740 
Productglycoside hydrolase family 4 
Protein accessionYP_002984874 
Protein GI241666790 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.036749 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.397906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCA AAATCGCTAT CATCGGTGCG GGCAGCGTCG GTTTCACCAA GAAGCTGTTT 
ACCGATATAT TGTGCGTTCC CGAGTTTCGC GACGTCGAAT TCGCGCTGAC GGATCTCAGC
GAACACAATC TCGAAATGAT CAAGGCGATC CTCGACCGCG TCGTCGAGGC GAACGGGCTG
CCGACGAAGG TGACGGCGAC CACCAACCGG CGCCAGGCGT TGGAAGGCGC GCGGTATATT
ATCAGCTGCG TTCGGGTCGG TGGGCTCGAG GCCTATGCCG ATGATATCGG AATTCCCCTG
AAATACGGCA TCGACCAGTG CGTCGGCGAT ACGATCTGTG CCGGCGGCAT CCTCTATGGC
CAGCGCAACA TCCCGGTCAT CCTCGACTTC TGCAAGGATA TCCGCGAGGT CGCCGAGCCC
GGCGCGAAAT TCCTGAACTA TGCCAACCCG ATGGCGATGA ACACCTGGGC GGCGATCGAA
TATGGCAAGG TCGATACCGT CGGCCTCTGC CACGGCGTCC AGCATGGCGC CGAACAGATC
GCCGAGGTGC TCGGCGCGAA GTCGCTGAGC GAGCTCGACT ATATTTGCTC CGGCATCAAC
CATCAAACCT GGTTCATCGA CCTGCGCCTC AACGGCCGCA GGATCGGCAA GGACGAGCTC
ATCGCCGCCT TCGAGGCGCA TCCGGTCTAT TCGCAGCAGG AGAAATTGCG CATCGACGTG
CTGAAGCGTT TCGGCGTCTA TTCCACGGAA AGCAACGGCC ATCTCTCGGA ATACCTGCCC
TGGTATCGCA AGCGGCCGGA CGAAATCACC CGCTGGATCG ACATGTCCGA CTGGATCCAC
GGCGAGACCG GCGGCTATCT CCGCCATTCC ACTGAAACGC GCAACTGGTT CGAGACGGAA
TATCCGCAAT TTTTGGAATC CGCCGCAAAG CCGATCGATC CTGCCAAGCG CTCGAACGAA
CATGCCAGCC ATATCCTCGA GGCGCTGGAG ACGAACCGGG TCTATCGCGG CCATTTCAAC
CTCAAGAACA ATGGCGTCAT CACCAACCTG CCGTCCGATG CGATCATCGA ATCGCCGGGC
TTCGTCGATC GCTTCGGCAT CAACATGGTC TCCGGCGTCA CCCTGCCGGA AGCCTGCGCG
GCCACCTGCA TCGCCTCGAT CAACGTCCAG CGCATGTCGG TGCACGCGGC GATATCAGGC
GACATCGACC TCTTGAAGCT TGCCGTGCTG CACGACCCGC TGGTCGGCGC CGTCTCGACG
CCGGAAGAGG TCTGGCAGAT GGTCGACGAG ATGGTCGTTG CCCAGGCGCG CTGGCTGCCG
CAATATGCGC ATGCCGTGCC GGCCGCCAAG GAGCGGCTGT CGAAATCGAA GGTGCAGACC
CGCGACTGGG CGGGGGCCGC ACGCCGCAAC GTTCGCTCGA TCGAGGAGCT GCGCGCGGAA
AAGGCGGCAC TGAAACAGGC CGTCTGA
 
Protein sequence
MSFKIAIIGA GSVGFTKKLF TDILCVPEFR DVEFALTDLS EHNLEMIKAI LDRVVEANGL 
PTKVTATTNR RQALEGARYI ISCVRVGGLE AYADDIGIPL KYGIDQCVGD TICAGGILYG
QRNIPVILDF CKDIREVAEP GAKFLNYANP MAMNTWAAIE YGKVDTVGLC HGVQHGAEQI
AEVLGAKSLS ELDYICSGIN HQTWFIDLRL NGRRIGKDEL IAAFEAHPVY SQQEKLRIDV
LKRFGVYSTE SNGHLSEYLP WYRKRPDEIT RWIDMSDWIH GETGGYLRHS TETRNWFETE
YPQFLESAAK PIDPAKRSNE HASHILEALE TNRVYRGHFN LKNNGVITNL PSDAIIESPG
FVDRFGINMV SGVTLPEACA ATCIASINVQ RMSVHAAISG DIDLLKLAVL HDPLVGAVST
PEEVWQMVDE MVVAQARWLP QYAHAVPAAK ERLSKSKVQT RDWAGAARRN VRSIEELRAE
KAALKQAV