Gene Smed_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0133 
Symbol 
ID5320962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp146933 
End bp148117 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content63% 
IMG OID640789066 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001325828 
Protein GI150395361 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.765641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.668156 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGA ATTGGCGCCC GGCAACCCAG CTCGTCCACG GCGGAACGCT GCGTTCCCAG 
TACGGCGAGA CCTCCGAAGC GATCTTCCTG ACGCAAGGCT TCGTCTACGA CACCTCCGAA
GCGGCGGAGG CCCGTTTCAA GGGCGAGACT GACGGTTACA TCTATGCGCG TTACGGCAGC
CCGACCAACG ACATGTTCGA AAAGCGCATG TGCATGCTCG AAGGAGCGGA AGACGCGCGT
GCAACCGCCT CCGGCATGGC CGCGGTTTCT GCGGCAATCC TTTGCCAGGT GAAGGCTGGA
GACCATATCG TCGCCGCCCG CGCACTCTTC GGCTCGTGCC GCTGGGTTGT GGAGACGCTG
GCGCCGAAAT ACGGGGTCGA GTGCACGCTG GTGGACGGCC GCGATCTCAA GAACTGGGAA
GACGCAGTGC GTCCGAACAC GAAGGTCTTC TTCCTGGAAA GCCCGACGAA CCCGACGCTG
GAAGTGATCG ACATTGCCGG TGTCGCCAGG CTCGCCGATC AGATCGGCGC CAAGGTGGTG
GTCGACAACG TCTTCGCAAC GCCGCTCTTC CAGAAGCCGC TGGAGCTCGG CGCCCATATC
GTCGTCTATT CCGCGACGAA ACATATCGAT GGCCAGGGTC GCTGCCTCGG CGGCGTGGTT
CTCTCCGACA AGCAGTGGAT CGACGAGAAT CTGCATGATT ACTTCCGTCA CACCGGCCCG
GCCATGTCGC CCTTCAATGC CTGGACGCTA CTGAAGGGGA TCGAGACCCT GCCGCTACGC
GTTAGGCAGC AGACCGAGAG CGCCCGCCGC ATCGCCGACT TCCTCACAGA GCAGCCGCAG
GTCGCACGCG TCATTTATCC GGGCCGCAAG GATCACCCGC AGGCCGACAT TATTGCCAAG
CAGATGAGCG GCGGCTCGAC GCTGGTCGCC TTCGAACTCA AGGGCGGCAA GGAAGCAGCC
TTCGCCCTGC AGAACGCGCT GGAAATCGTT CGGATCTCCA ACAATCTGGG CGATTCCAAG
AGCCTGATCA CCCATCCGGC GACGACGACC CATAAGAACC TTACCGACGA GGCCCGCGCG
GAACTCGGCA TCTCCGCGGG GACCGTGCGC TTCTCGGCTG GAATCGAGGA TAGTGAAGAC
CTCGTCGAGG ACTTCGCGAA GGCACTGAGG AGCGTCACGG CCTAA
 
Protein sequence
MSKNWRPATQ LVHGGTLRSQ YGETSEAIFL TQGFVYDTSE AAEARFKGET DGYIYARYGS 
PTNDMFEKRM CMLEGAEDAR ATASGMAAVS AAILCQVKAG DHIVAARALF GSCRWVVETL
APKYGVECTL VDGRDLKNWE DAVRPNTKVF FLESPTNPTL EVIDIAGVAR LADQIGAKVV
VDNVFATPLF QKPLELGAHI VVYSATKHID GQGRCLGGVV LSDKQWIDEN LHDYFRHTGP
AMSPFNAWTL LKGIETLPLR VRQQTESARR IADFLTEQPQ VARVIYPGRK DHPQADIIAK
QMSGGSTLVA FELKGGKEAA FALQNALEIV RISNNLGDSK SLITHPATTT HKNLTDEARA
ELGISAGTVR FSAGIEDSED LVEDFAKALR SVTA