Gene Smed_3564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3564 
Symbol 
ID5324452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3770980 
End bp3772110 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content64% 
IMG OID640792513 
Productoxidoreductase domain-containing protein 
Protein accessionYP_001329214 
Protein GI150398747 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0520703 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCA GGCGCAGGCT GGGCATCGGC CTGATCGGCA CGGGCTTCAT GGGCAAGGCC 
CATGCTCTCG GCTTCACGAT TGCAGCGCGG GTCTTCGACC TGCCTTTCGA GCTGGACCTC
GTATCCGTCG CGGACGTGAC TGTGGAGGGC GCGGAGGCGG CCAGGGGACG GCTCGGCTTC
CGCAAGGCGA CCACCGACTG GCGCGACCTT CTGATCGACC CGGAGATCGA CGTCATCGAT
ATCACCACGC CGAACCTCCT GCACAAGGAG ATGGCGCTCG CCGCGATTGC CCATGGTAAG
CACGTCTATT GCGAAAAGCC GCTGGCCCCG ACGGTCGCCG ACTGCGCCGA GATGGTTGCG
GCGGCCGAAA AGGCAGGGGT CGTCACCCAG CTCGGCTTCA ACTATCTCAA GAACCCGCTC
ATTTTTCTCG CCAGGGACAT CATCGAAAGC GGCGAGATCG GCGAGATACG GTCGTTTCGC
GGCGTTCACG CGGAGGACTT CATGGCGGAT CGGACCGTTC CCTGGGGCTG GCGGCTCGAT
CCGCGCAGCG GCGGCGGGGC TCTTGCCGAC ATCGGCAGCC ATATGATCGC CTGCATGCGC
CATCTCGTCG GGCCCGTTAG GTCCGTGCTG GCCGACTCCG TAATCCACGT TGCGGAACGC
CCGCTCGCTC GCGGCGCAAC AGAGACCCGC GCCGTCGAAG TCGACGACGT AACGCGGGCT
TTTGTTCGAT TCGAGAGCGG CGCAAGCGGG AGCTTCGAAG CCAACTGGAT CGCGACCGGC
CGCAAGATGC AGCACGACTT CGAAATTTAC GGGTCAAAAG GCAGCATCGT CTTCACGCAG
GAACGGCTGA ACGAAATCAA GATCTATTAT GCGGGCGACG ATATAAGGAG CCGCGGCTTC
CGCACCATCT GGGCGGGTCC GGAACATCCG CCCTACGGGG CGTTCTGCGT CGCTCCGGGC
CACCAGATCG GCTTCAACGA TCTGAAGGCG ATCGAGGTCC ACGAATTCCT GGAAGCGATC
GCGAATGGCG TCAGGACGTC TACCGATTTC CGCGAGGGTT ATGAGGTCCA GAAGGTCCTC
TCCGCGACCT ACCACTCCGC CCGGACGAAC GCCTGGGTGG AGATCGGGTG A
 
Protein sequence
MSGRRRLGIG LIGTGFMGKA HALGFTIAAR VFDLPFELDL VSVADVTVEG AEAARGRLGF 
RKATTDWRDL LIDPEIDVID ITTPNLLHKE MALAAIAHGK HVYCEKPLAP TVADCAEMVA
AAEKAGVVTQ LGFNYLKNPL IFLARDIIES GEIGEIRSFR GVHAEDFMAD RTVPWGWRLD
PRSGGGALAD IGSHMIACMR HLVGPVRSVL ADSVIHVAER PLARGATETR AVEVDDVTRA
FVRFESGASG SFEANWIATG RKMQHDFEIY GSKGSIVFTQ ERLNEIKIYY AGDDIRSRGF
RTIWAGPEHP PYGAFCVAPG HQIGFNDLKA IEVHEFLEAI ANGVRTSTDF REGYEVQKVL
SATYHSARTN AWVEIG