Gene Smed_5094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5094 
Symbol 
ID5319396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp42589 
End bp43791 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content64% 
IMG OID640776873 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001313805 
Protein GI150377210 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGATG CTTTTATCTG CGACTACGTT CGCACTCCCA TCGGCCGTTA CGGCGGTGCC 
TTGTCGTCTG TGCGGGCCGA TGACCTCGGC AGCGTTCCGC TCAAGGCGCT TGTCGAGCGC
CATCCCTCGG TCGATTGGGA GGCGGTCGAC GACGTGATCT TCGGATGCGC CAACCAGGCC
GGCGAAGACA ACCGCAACGT AGCGCGTATG TCGCTGCTGC TGGCCGGGCT TCCGCCCCAG
GTTTCGGGCA CGACGATCAA CCGCCTTTGC GGTTCCGGCA TGGACGCGGT CATCGCCGCT
GCCCGGGCCA TCAAGTCCGG TGAAGCGGAA TTGATGATCG CGGGCGGCGT CGAGAGCATG
AGCCGTGCTC CCATGGTCAT CCCCAAGGCT GACGCCGCGT TCTCGCGCAA TGCGGAAATC
TACGACACAA CCATCGGATG GCGCTTCGTC AACCCGTTGA TGAAGAGCCA GTACGGCGTC
GATTCCATGC CGGAGACGGG CGAAAACGTC GCTGAGGACT ACAATGTCAG CCGCGAGGAC
CAGGATGCGT TTGCTGTCCT TAGCCAGGCG AAGGCCGCGC GGGCCCAGGA AGACGGCCGT
CTGGCCAAGG AGGTCGTGCC CGTAACGGTC CCGCAGCGAA AGGGCGAACC GATCGTCGTG
CAAAAGGACG AACATCCCCG CGCGACGACC ATCGAGGCGC TTGCGAAGTT GCCGACCCCT
TTCCGCAAGG GTGGCTCGGT CACGGCCGGG AACGCGTCGG GAGTGAGCGA TGGCGCGGCC
GCCCTGATCA TTGCGTCAGC CGAAGCAGCT CAAAAGTACG GTCTCAAACC GGTCGCGAGA
ATTATTGCCG GCGCGACGGC GGGCGTGTCA CCCCGTGTCA TGGGTATCGG GCCGGCCCCG
GCATCGCAGA AGTTGATGCG CCTGACGGGC CTCAAGCAGG AACAGTTCGA TGTAATCGAG
CTGAACGAGG CATTCGCGTC GCAAGGGCTG GCCGTACTTC GTGCGCTTGG CATTGCTGAC
GACGACGCAC GTGTGAACCG TAATGGTGGA GCGATCGCGC TTGGACACCC GCTTGGCATG
TCCGGGGCTC GTATCACGGG CACGGCAGCC TTGGAGCTTG TCGAGACGGG TGGGCGCTAT
TCACTTTCAA CGATGTGCAT CGGCGTAGGT CAGGGCATCG CTGTAGCGCT TGAGCGGGTG
TGA
 
Protein sequence
MRDAFICDYV RTPIGRYGGA LSSVRADDLG SVPLKALVER HPSVDWEAVD DVIFGCANQA 
GEDNRNVARM SLLLAGLPPQ VSGTTINRLC GSGMDAVIAA ARAIKSGEAE LMIAGGVESM
SRAPMVIPKA DAAFSRNAEI YDTTIGWRFV NPLMKSQYGV DSMPETGENV AEDYNVSRED
QDAFAVLSQA KAARAQEDGR LAKEVVPVTV PQRKGEPIVV QKDEHPRATT IEALAKLPTP
FRKGGSVTAG NASGVSDGAA ALIIASAEAA QKYGLKPVAR IIAGATAGVS PRVMGIGPAP
ASQKLMRLTG LKQEQFDVIE LNEAFASQGL AVLRALGIAD DDARVNRNGG AIALGHPLGM
SGARITGTAA LELVETGGRY SLSTMCIGVG QGIAVALERV