Gene Smed_3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3289 
Symbol 
ID5324173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3481318 
End bp3482568 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content62% 
IMG OID640792241 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_001328946 
Protein GI150398479 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.96044 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTACT CTGCCCTTTC CATCCTCCTC AATGGCCTGC GCGGCAACCG GAACTGGACG 
CCGGCATGGC GCCAGCCAGA CCCGAAGCCA CATTACGACG TGATCATCGT CGGCGGCGGT
GGCCATGGCC TCGCGACTGC CTATTACCTT GCCAAGGAAT TCGGCGTCAC CAATGTCGCG
GTGCTGGAAA AGAATTATGT CGGTTCGGGC AATGTCGGCC GCAACACGAC GATCATTCGC
TCGAACTACC TGCTCCCCGG GAACAATCCA TTTTACGAGC TCTCCATGCA GCTATGGGAG
GGGCTGGAGC AGGATTTTAA TTTCAATGCG ATGGTCTCGC AGCGCGGCGT TCTCAATCTC
TATCATTCGG ACGCTCAGCG CGATGCCTAC ACGCGCCGCG GCAATGCGAT GCGGCTTCAC
GGGGTAGACG CAGAACTCCT CGATCGGGCG GCCGTACGCC GGATGCTGCC CTTTCTCGAT
TTCGACAATG CCCGCTTCCC CATCCAGGGC GGGCTCCTGC AGCGCCGCGG CGGTACCGTG
CGCCACGACG CCGTCGCCTG GGGATATGCC CGCGGCGCCG ACAGCCGCGG GGTCGATATC
ATCCAGAATT GCGAAGTGAC CGGGATCAGG CGAGAAGACG GGCGAGTCAC CGGCGTCGAG
ACCAGCCGCG GCTTCATCGG CTGCGGAAAG CTCGCCCTGG CGGCGGCCGG AAATTCCTCG
AAGGTCGCCG AACTGGCGGG ATTGCGCCTG CCGATCGAGA GTCACGTGCT TCAGGCCTTC
GTGTCGGAGG GGCTGAAACC GTTCATTGAC GGGGTGGTCA CTTTCGGAGC CGGACATTTC
TACGTTTCAC AATCGGACAA GGGCGGCCTC GTCTTTGGCG GCGATCTCGA CGGCTATAAT
TCCTACGCTC AGCGCGGCAA CCTGGCGACC GTCGAGCATG TGGCGGAGGC CGGAAAGGCC
ATGATTCCGG CATTGTCGCG GGTGCGGGTG CTGCGCTCCT GGGGCGGTAT CATGGATATG
AGCATGGACG GCTCGCCGAT CATCGACCGC ACGCCGATCG ACAATCTCTA TCTGAATGCC
GGCTGGTGCT ATGGCGGGTT CAAGGCGACC CCTGCCTCAG GATTCTGCTT CGCACATCTC
CTCGCCCGAG GCGCGCCGCA AAAGACAGCC GCAGCGTTTC GTCTCGACCG TTTCGAGCGA
GGCTACCTCC TTGATGAAAA AGGCCAAGGC GCTCAGCCGA ACCTTCACTG A
 
Protein sequence
MRYSALSILL NGLRGNRNWT PAWRQPDPKP HYDVIIVGGG GHGLATAYYL AKEFGVTNVA 
VLEKNYVGSG NVGRNTTIIR SNYLLPGNNP FYELSMQLWE GLEQDFNFNA MVSQRGVLNL
YHSDAQRDAY TRRGNAMRLH GVDAELLDRA AVRRMLPFLD FDNARFPIQG GLLQRRGGTV
RHDAVAWGYA RGADSRGVDI IQNCEVTGIR REDGRVTGVE TSRGFIGCGK LALAAAGNSS
KVAELAGLRL PIESHVLQAF VSEGLKPFID GVVTFGAGHF YVSQSDKGGL VFGGDLDGYN
SYAQRGNLAT VEHVAEAGKA MIPALSRVRV LRSWGGIMDM SMDGSPIIDR TPIDNLYLNA
GWCYGGFKAT PASGFCFAHL LARGAPQKTA AAFRLDRFER GYLLDEKGQG AQPNLH