Gene Smed_3237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3237 
Symbol 
ID5324116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3411810 
End bp3413030 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID640792185 
Producttryptophan synthase subunit beta 
Protein accessionYP_001328896 
Protein GI150398429 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCAGC CGCCTAAACC GAATTCCTTC AGATCCGGAC CCGATGAAGA GGGCCGTTTC 
GGCATATTCG GTGGCCGCTT CGTCGCCGAG ACGCTGATGC CGCTGATCCT CGACCTCCAG
GACGAATGGG CGAGGGCAAA GAATGATCCG GCTTTCAAGG CGGAGCTGGA AAATCTCGGC
AGGCATTATA TCGGCCGGCC GAGCCCGCTC TATTTCGCCG AGCGCCTGAC GGCCGAACTC
GGCGGCGCGA AGATCTACTT CAAGCGCGAG GAGCTCAATC ACACGGGCTC CCATAAGATC
AATAACTGCA TCGGCCAGAT CCTGCTTGCC AAGCGCATGG GCAAGACCCG CATCATCGCC
GAGACCGGCG CCGGCCAGCA TGGTGTGGCA TCGGCCACCG TGGCGGCGCG TTTCGGGCTG
CCTTGCGTCG TCTATATGGG GGCGACAGAC GTGGAGCGGC AGGCACCGAA CGTCTTCCGC
ATGAAGCTTC TCGGCGCCGA GGTGAAGCCG GTGACTGCGG GTCACGGCAC CCTCAAGGAC
GCCATGAACG AGGCGCTGCG GGACTGGGTG ACCAATGTCG ACAGCACCTA TTACCTGATC
GGCACGGCCG CCGGCCCGCA TCCCTATCCG GAGATGGTAC GCGACTTCCA GGCGGTCATC
GGCGAGGAAG CCAAGCAGCA GATGCTCGAA GCCGAAGGCC GGCTTCCGGA CCTCGTGGTT
GCAGCGGTCG GCGGTGGGTC AAATGCGATA GGCATCTTCC ATCCATTCCT GGATGACGGG
GGCGTCAGGA TCGTCGGCGT TGAAGCCGGT GGCAAGGGCC TGGACGGCGA TGAGCATTGC
GCCTCTCTCA CAGCCGGCTC GCCGGGCGTG CTGCATGGCA ACCGCACTTA TCTGCTCCAG
GACGGTGACG GCCAGATCAA GGAAGGCCAC TCGATTTCGG CCGGGCTCGA TTACCCGGGG
ATCGGACCGG AGCATGCCTG GCTGAACGAT ATCGGCCGCG TCGAATATGT GCCGATCATG
GATCATGAGG CGCTGGAGGC GTTTCAGATC CTGACGCGGC TCGAAGGCAT CATTCCGGCG
CTCGAGCCGT CCCACGCGCT TGCCGAAGTC ATCAAGCGTG CGCCGAAAAT GGGCAAGGAC
GAGATCATCC TGATGAATCT CTCCGGTCGC GGCGACAAGG ACATCTTCAC CGTCGGCAAA
ATTCTCGGTA TGGGGCAATA A
 
Protein sequence
MNQPPKPNSF RSGPDEEGRF GIFGGRFVAE TLMPLILDLQ DEWARAKNDP AFKAELENLG 
RHYIGRPSPL YFAERLTAEL GGAKIYFKRE ELNHTGSHKI NNCIGQILLA KRMGKTRIIA
ETGAGQHGVA SATVAARFGL PCVVYMGATD VERQAPNVFR MKLLGAEVKP VTAGHGTLKD
AMNEALRDWV TNVDSTYYLI GTAAGPHPYP EMVRDFQAVI GEEAKQQMLE AEGRLPDLVV
AAVGGGSNAI GIFHPFLDDG GVRIVGVEAG GKGLDGDEHC ASLTAGSPGV LHGNRTYLLQ
DGDGQIKEGH SISAGLDYPG IGPEHAWLND IGRVEYVPIM DHEALEAFQI LTRLEGIIPA
LEPSHALAEV IKRAPKMGKD EIILMNLSGR GDKDIFTVGK ILGMGQ