Gene Smed_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2236 
Symbol 
ID5323097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2315788 
End bp2316978 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID640791174 
Productglucose sorbosone dehydrogenase 
Protein accessionYP_001327903 
Protein GI150397436 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.060795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCATA GGCGCTGGAC GGCTCTCGGA TCAAGCAATA TCCCTGCGAT CCTAATATCC 
AGTATGGCGG CTGCGGCCCC GTTCCTTGTG AACTTCACCG CATTCGCTGC GCTTGCGCAG
GAGGCGCGGG AATTTTCGAC GCAGACAGGA ACCGTTCTCG TCGAGACGCT CGCCTCCGGG
CTCGAACATC CCTGGGCGGT TGAGGCCATG CCCGATGGGG CGCTTATCGT TACCGAGCGA
CCGGGCCGGC TACGCATCCT GCGCGACGGC AAGCTTTCAG ACGCGATCAA GGGCGTACCC
ACAGTGGCCG CCCACGGCCA GGGCGGCCTT CTCGATGTCG CTCTCGATCG GCAATTCGCG
ACGAACAGGA CTATTTATCT CACCTTATCC GCGCGCGGCG AAGGCGGCTA CGGCACGGCC
CTTGTCCGCG CGGCCCTCTC GCAGGATGGT CGGAGCCTGA CGGATGCGAA GGAGATCTTC
CGGATGAACC GGTTCACCCG GAAGGGGCAG CATTTCGGCT CACGCATTGC CATCGACAAG
GACGGCAGCC TGTTTTTCGG CATTGGCGAT CGCGGTGAAG GTGAACGCGC CCAGGACCCG
CACGACCATG CCGGCTCGGT CCTCCACATC AATGCCGACG GCAGCATCCC CGCCTCCAAC
CCGTTTCGTG GCGGTACTGG CGGCCTGGCC GAAATCTGGT CCACCGGACA TCGGAACCCC
CAGGGAATTA CCTTTGATCC GGAAGATGGC AAGCTCCTCA CAGTCGAGCA CGGCGCGCGC
GGCGGCGACG AAGTGAACAA TCCGCAGCCT GGCAGAAATT ACGGCTGGCC GGTGATCACC
TTCGGCAAGG ACTATTCCGG TGTGGAGATC GGCGAAGGCA CGGCGAAGGA AGGCCTGGAG
CAGCCGCTCT TTTACTGGGA CCCCTCGATC GCGCCGGGTG CGATTGCCGT ATACCGCGGC
AGCATGTTTC CAGAGTGGAA CGGCGATCTC TTGATCGCAG CACTGAAATA CCAGTTGCTT
ACCCGCCTCG ACCGCGACGA GACCGGCACG GTCACGGCCG AGGAACGTTT GTTCGACGGC
GAATTCGGCC GAATCCGCGA CGTCATCGTC GCTCCCGACG GGGCACTCAT CATGGTCACC
GATGAGGAAG ATGGCGAAGT GCTCAGGGTC TCCAAAGCCC CGACACAGTA G
 
Protein sequence
MRHRRWTALG SSNIPAILIS SMAAAAPFLV NFTAFAALAQ EAREFSTQTG TVLVETLASG 
LEHPWAVEAM PDGALIVTER PGRLRILRDG KLSDAIKGVP TVAAHGQGGL LDVALDRQFA
TNRTIYLTLS ARGEGGYGTA LVRAALSQDG RSLTDAKEIF RMNRFTRKGQ HFGSRIAIDK
DGSLFFGIGD RGEGERAQDP HDHAGSVLHI NADGSIPASN PFRGGTGGLA EIWSTGHRNP
QGITFDPEDG KLLTVEHGAR GGDEVNNPQP GRNYGWPVIT FGKDYSGVEI GEGTAKEGLE
QPLFYWDPSI APGAIAVYRG SMFPEWNGDL LIAALKYQLL TRLDRDETGT VTAEERLFDG
EFGRIRDVIV APDGALIMVT DEEDGEVLRV SKAPTQ