Gene Smed_2786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2786 
Symbol 
ID5323656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2903962 
End bp2905368 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content61% 
IMG OID640791731 
ProductBeta-glucosidase 
Protein accessionYP_001328451 
Protein GI150397984 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAG CCAAGAAACT GGCAGAGCGC TTTCCCGGCG ATTTTGTCTT CGGCGTTGCC 
ACCGCATCCT TCCAGATCGA GGGAGCGAGC AAGGCGGATG GGCGCAAAGC CTCCATCTGG
GATGCCTTCT CCAATATGCC GGGCCGTGTT TACGGACGCC ATAATGGCGA TGTCGCCTGC
GACCATTACA ACAGGCTGGA ACAGGACCTC GACCTGATAA AGAGCCTTGG CGTCGGGGCC
TATCGCTTCT CGATCGCCTG GCCGAGGATC GTTCCGGAGG GCACCGGCCC GATCAACGAG
AAGGGGCTCG ACTTCTACGA TCGCCTCGTC GATGGGCTGA AGGCGCGCGG CATCAAGGCC
TTCGCCACGC TCTATCACTG GGACCTGCCG CTGGCGCTGA TGGGCGACGG CGGCTGGACG
GCGCGCACGA CGGCTTATGC CTATCAGCGC TACGCGAAAA CGGTGGTTGC GCGTGTCGGC
GACCGTCTCG ACGCGGTGGC GACTTTCAAC GAACCCTGGT GTTCAGTCTG GCTAGGCCAT
CTTTACGGCG TGCATGCGCC GGGTGAACGC AACATGGATG CGGCACTTGC CGCGCTGCAC
GTCACCAATC TCGCCCATGG GTTAGGCGTG TCCGCGATCC GTTCGGTAAG GGCGGACCTG
CCGGTGGGCA TCGTCATCAA TGCCCATTCA ATCTATGCCG GCAGCGGCAG CGCCGCGGAC
AAGGCCGCGG CCGAACGCGC CTTCGATTTC CACAACGGCG TTTTTTTCGG CCCTGTCTTC
AAAGGCGAAT ATCCGGAGGG TTTCCTCTCG GCGCTTGGCG ACCGCATGCC AGCAATCGAG
GACGGCGACA TGGAGACGAT CGCCCAGCCG CTCGACTGGT GGGGGCTCAA CTACTATACG
CCGATGCGCG TTTCGGCAGA CTCCGCGAAG GATTCAGAAT ACCCGGCGAC CGTCAATGCG
AAGCCCATGA GCGACGTGAA GACGGATATC GGCTGGGAAG TCTACGCTCC GGCGCTTGGC
GCGTTGGTGG AGACGCTCAA TGCCCGCTAT GCGCTTCCCG ACTGCTACAT CACCGAGAAC
GGCGCCTGCT ACAATATGGA CGACGAGAAT GGCGTCGTCG ACGATCAGCC TCGGCTCGAC
TATATCTCGG ACCATCTCGC AGTTGCCGCC GATCTCATCT CCAGGGGGTA TCCTATGAAG
GGCTATTTCG CCTGGAGCCT GATGGACAAT TTCGAGTGGG CGGAGGGCTA CAGGATGCGC
TTCGGCATCG TTCACGTTGA TTATGAAACT CAGGTCCGCA CGATCAAGAA AAGCGGCCGC
TGGTACGAAG CCCTGGCGAA GCAGTTCCCC AGGACAACCG TAAACAGGGA AGATGCCGCG
CCACGGCGTT CTGTACAGAA AGAGTAG
 
Protein sequence
MIEAKKLAER FPGDFVFGVA TASFQIEGAS KADGRKASIW DAFSNMPGRV YGRHNGDVAC 
DHYNRLEQDL DLIKSLGVGA YRFSIAWPRI VPEGTGPINE KGLDFYDRLV DGLKARGIKA
FATLYHWDLP LALMGDGGWT ARTTAYAYQR YAKTVVARVG DRLDAVATFN EPWCSVWLGH
LYGVHAPGER NMDAALAALH VTNLAHGLGV SAIRSVRADL PVGIVINAHS IYAGSGSAAD
KAAAERAFDF HNGVFFGPVF KGEYPEGFLS ALGDRMPAIE DGDMETIAQP LDWWGLNYYT
PMRVSADSAK DSEYPATVNA KPMSDVKTDI GWEVYAPALG ALVETLNARY ALPDCYITEN
GACYNMDDEN GVVDDQPRLD YISDHLAVAA DLISRGYPMK GYFAWSLMDN FEWAEGYRMR
FGIVHVDYET QVRTIKKSGR WYEALAKQFP RTTVNREDAA PRRSVQKE