Gene Smed_5532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5532 
Symbol 
ID5319834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp497144 
End bp498175 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content60% 
IMG OID640777283 
ProductD-cysteine desulfhydrase 
Protein accessionYP_001314215 
Protein GI150377620 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2515] 1-aminocyclopropane-1-carboxylate deaminase 
TIGRFAM ID[TIGR01275] pyridoxal phosphate-dependent enzymes, D-cysteine desulfhydrase family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.1518 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACCG ACCGAGATCC TTTGCAGAGC TTTCCCCGTG AGCGACTGAT GAAAGGTCCA 
ACGCCGATCC AGCGCTTGGC GCGTCTCGAA GAAGTTCTGG GCGAACGGAG TAGGGGCGTA
TCGATCTGGG CCAAAAGAGA TGATCTCATG GAACTCGGCG GCGGCGGCAA CAAGCTCCGC
AAGCTTGAAT TCCTCCTCGG GCAGGCGAAA GCGGAGGGAT GTGACACCCT CGTCGTAACG
GGAGGAGTTC AATCCAACTT CGCACGATTG GCCGCCGCAG CGTGCGCCAG GTCGGGGCTC
GCCTGCGAGC TCGTTCTTGC TCAGATGGTA CCTCGGACGA CCGAAATTTA TCAGGACAAC
GGCAATGTGC TTCTCGACCG TCTGTTTGGC GCCAGCGTTC ATATACTGGA CCCGGATGAA
GATGCTGGCG CGTATGCGAG GCGTCGGGTC GATGAGATCG CCGAAACTCG CAGGAGAGCT
CTTCTGGCGC CTCTCGGCGG CTCAACGACA ATCGGTTGCC TCGGTTACGT GGATTGCGCT
TTCGAACTCG CCCGGCAATC GGCTGAAACG GGTGTTGCGT TCGAGCAGAT CATCATCCCC
AACGGCAGCG GCGGCATGCA TGCCGGGTTG GCTGCTGGCG TGGTCGTTGC GGGGTCTCAC
CCTTCTCGGA TCGCCGCATA CACCGTGCTC TCGCCTGCAG ACAAGTGTCT CCTCGCAACT
GCGGACAAGG TCAACGCGGT TCTTGAGCGA CTGGCCAGCG ACGCTCGCGT GACCGCGGAC
GATCTCCGGA TAAGCAGTGC TCAACTGGGC GAAGGATACG GCATGCCGAC TTCCGGCATG
ATCGACGCGG TCGAACTTCT CGCGAGATCA GAAGGGCTTC TCGTCGATCC GGTTTACGGC
GGCAAGGCCT TGGCAGGGTT GCTGTCCGAC GTTGAAAGTG GGGCAATCGC ACCGCAGTCT
AACGTGCTCT TCATCATGAC CGGAGGTTCG CCCGGACTTT ATGCATACGC CGACGTTCTC
ACTTCCAAGT AG
 
Protein sequence
MMTDRDPLQS FPRERLMKGP TPIQRLARLE EVLGERSRGV SIWAKRDDLM ELGGGGNKLR 
KLEFLLGQAK AEGCDTLVVT GGVQSNFARL AAAACARSGL ACELVLAQMV PRTTEIYQDN
GNVLLDRLFG ASVHILDPDE DAGAYARRRV DEIAETRRRA LLAPLGGSTT IGCLGYVDCA
FELARQSAET GVAFEQIIIP NGSGGMHAGL AAGVVVAGSH PSRIAAYTVL SPADKCLLAT
ADKVNAVLER LASDARVTAD DLRISSAQLG EGYGMPTSGM IDAVELLARS EGLLVDPVYG
GKALAGLLSD VESGAIAPQS NVLFIMTGGS PGLYAYADVL TSK