Gene Smed_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3922 
Symbol 
ID5318707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp373568 
End bp375655 
Gene Length2088 bp 
Protein Length695 aa 
Translation table11 
GC content65% 
IMG OID640775732 
Producttransketolase 
Protein accessionYP_001312665 
Protein GI150376069 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTT CGCAGCAGAT CGAACCCCGC GCCGCCGCCT CGGAGCGCAA CATGGCCGAC 
GCCATCCGGT TTCTTTCCAT GGATGCCGTC GAGAAGGCCA ATTCCGGTCA TCCGGGCATG
CCGATGGGCA TGGCGGACGC GGTTACCGTG CTCTTCAACC GCTTCATCAG AATCGATCCG
TCGCTCCCCG ACTGGCCCGA CCGCGACCGT TTCGTGCTTT CGGCCGGCCA TGGCTCGATG
CTGCTCTATT CCCTCCATCA CCTCATCGGC TTTGCGGACA TGCCGATGGC CGAGCTTTCG
TCCTTCCGGC AACTCGGCTC GAAAACGGCC GGCCATCCCG AATACGGCCA TGCCCTCGGC
ATCGAGACCA CCACCGGCCC GCTGGGCCAG GGGATCTCGA CCGCTGTCGG GATGGCGATG
GCCGAACAGA TGATGGCCTC CCGGTTCGGC AGTGCTCTGT GCAACCACTT CACCTATGTC
GTAGCCGGCG ACGGCTGCCT TCAGGAAGGC ATCAGCCACG AGGCTATCGA CCTTGCCGGA
CATTTAAAGC TGCGCAAGCT GGTCGTGCTG TGGGACGACA ACCGAATATC GATCGACGGA
TCGACGGATC TCTCTACCTC GATGAACCAG CTCGCGCGTT TCCGCGCCGC CAGCTGGGAC
GCCCAAGCCG TCGATGGCCA CGACCCGGAA GCGGTTGCGA AAGCCCTGGA AAGAGCACGC
CGGACCCGCA AGCCGTCGCT GATCGCCTGC CGCACCCGGA TCGGCAAGGG TGCAGCCAGC
ATGGAAGGCT CGCACAAAAC CCACGGCGCG GCGCTCGGCG ACAAGGAAAT CGCAGCCACA
CGCGAAAAAC TTGGCTGGCC GCATCCGCCC TTCTTCGTTC CGCCTGAGAT AAGGGCTGCC
TGGGCAAAGG TGGCGGCTCG AGGTCGCACG GCTCGCGAGG CCTGGGATAT CCGCCTCGAC
GCCTCGCGCT CGAAAAAGCG CTACGAGCAG ACCATAAGGC GGCAGTTTGA CGGCGAACTC
GGCGATCTGC TTGCAAAATT CCGGAGCGCG CATCGCACAA GGGCTACGAA AGTTGCGACG
CGTCAGGCCT CGCAGATGGC GCTGGAGGTC ATAAACGGCG CGACCGCTTT GACGATCGGC
GGCTCGGCCG ACCTGACCGG CTCCAACCTG ACGATGACCT CGCAGACCCA GCCCATCTCG
CCGGGCAATT TCAAGGGCCG TTATCTGCAT TACGGCATCC GCGAGCACGG CATGGCGGCC
GCTATGAACG GCATCGCGCT TCATGGCGGC TTCATCCCCT ATGGCGGCAC TTTCCTGGTC
TTCTCCGACT ATGCCCGCGG TGCGATGCGC CTCTCGGCCC TGATGGGCCT GCCCGTCATT
TACGTGCTGA CGCATGATTC CATCGGGCTC GGCGAGGACG GACCGACCCA CCAGCCGGTC
GAGCATCTGG CCATGCTGCG CGCCACGCCC AACCTCAACG TTTTCCGGCC GGCCGACATC
ATCGAGACGG CAGAATGCTG GGAGATCGCG CTTGGCGAGA AGAATACGCC GAGCGTCCTC
GCCCTTTCGC GTCAGGCCCT GCCGATGCTG CGCCGGACGG AAGGCAACGA GAACCAGTCG
GCGCTCGGGG CGTATGTTCT GAGGGAAGCG CGCGGCAACC GGGACATCAC GATCCTTGCC
ACGGGATCCG AAGTCGAGAT CGCCGTCGCT GCCGCCGAGC GCCTGCAGGC CGAGGAAGGC
ATCGCGGCGG CAGTGGTCTC CATGCCCTGC TGGGAGAAGT TCGAGGTTCA GGACCTTGCC
TATCGGAGGA AGGTCCTCGG CGACGCGCCC CGCATCGCCA TCGAGGCGGC GGGCCGGCTC
GGCTGGGACC GATGGATGGG GCCGGACGGT GCCTTCGTCG GCATGACCGG CTTTGGTGCC
TCGGCACCGG CAGGCGACCT CTACCGGCAT TTCGGCATTA CCGCCGACCA TGTCGTCGCA
GAAGCCCTGG AGCTTCTCCG CCGCGCATAC TCGGAAACTC TGCCCATAGG TGCCCGGATC
GGTCCGCACC CATCCGCACA CACCGTCAGA TCATCGCAGG AGGCATGA
 
Protein sequence
MNVSQQIEPR AAASERNMAD AIRFLSMDAV EKANSGHPGM PMGMADAVTV LFNRFIRIDP 
SLPDWPDRDR FVLSAGHGSM LLYSLHHLIG FADMPMAELS SFRQLGSKTA GHPEYGHALG
IETTTGPLGQ GISTAVGMAM AEQMMASRFG SALCNHFTYV VAGDGCLQEG ISHEAIDLAG
HLKLRKLVVL WDDNRISIDG STDLSTSMNQ LARFRAASWD AQAVDGHDPE AVAKALERAR
RTRKPSLIAC RTRIGKGAAS MEGSHKTHGA ALGDKEIAAT REKLGWPHPP FFVPPEIRAA
WAKVAARGRT AREAWDIRLD ASRSKKRYEQ TIRRQFDGEL GDLLAKFRSA HRTRATKVAT
RQASQMALEV INGATALTIG GSADLTGSNL TMTSQTQPIS PGNFKGRYLH YGIREHGMAA
AMNGIALHGG FIPYGGTFLV FSDYARGAMR LSALMGLPVI YVLTHDSIGL GEDGPTHQPV
EHLAMLRATP NLNVFRPADI IETAECWEIA LGEKNTPSVL ALSRQALPML RRTEGNENQS
ALGAYVLREA RGNRDITILA TGSEVEIAVA AAERLQAEEG IAAAVVSMPC WEKFEVQDLA
YRRKVLGDAP RIAIEAAGRL GWDRWMGPDG AFVGMTGFGA SAPAGDLYRH FGITADHVVA
EALELLRRAY SETLPIGARI GPHPSAHTVR SSQEA