Gene Smed_5096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5096 
Symbol 
ID5319398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp44227 
End bp45414 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content64% 
IMG OID640776874 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_001313806 
Protein GI150377211 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.392469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.97471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACC AGGATCCGGT CGTCATTGTC GGGCAGGCTA GGACGCCGCT TGGCAGCTTT 
CAGGGCGAGC TTAAAGACTT TTCCGCCCCC GATCTCGGCG GTGCCGCAAT CGCCGATGCT
CTGAAGCGCG CCGGGATTGC GCCGGATGCG GTCGATGAGG TGGTGTTCGG ATGCGTCCTG
ACCGCGGGAC AGGGGCAGGC GCCGGCGCGT CAGGCCGCAC TCAGCGCGGG TCTTCCGCTG
GGCGTCGGCG CGACAACCGT GAACAAGATG TGCGGTTCGG GCATGAAGGC AGCGATGCTG
GCGCACGACC TGATCAAAGC GGAATCCGCC TCGATCGTGG TTGCCGGCGG CATGGAAAGC
ATGACCAATG CGCCCTATCT GCTCGACCGT GCGCGGCAGG GCTATCGCAT CGGGCATCAG
AGGGCTCTCG ACCACATGTT CCTCGATGGC CTGGAGGATG CTTACGACAA GGGGCGCCTG
ATGGGTACCT TTGCCGAGGA TTGCGCCGAA GCCTATCAGT TCACCCGTGG CGCTCAGGAC
GATTATGCAA TCGGGTCCCT CGAGAAAGCC AAGAAGGCAG GTGCCGATGG CAGCTTTGCC
AACGAGATCG TTCCGCTCAG CACCGGCTCG GGCAAGGGGG GCACTGTCAG CCTGGACGAG
CAACCGCAGA AGGCGCGGGC CGAGAAAATC CCCCTGTTGA AGCCCGCTTT TCGCGAGGGC
GGCACGGTCA CGGCTGCAAA CGCGTCTTCA ATTTCGGATG GCGCGGCGGC GCTCGTGCTG
ATGAGGCGAT CGGCAGCGGA AACACAGGGC GTCACACCTC TGGCGATCGT CCGCGGTCAT
GCCACCCATG CTGATGCCCC AAAGCTCTTT CCGACGGCGC CGATCGGAGC GATCACTGCA
CTCTGCCAGC GCATCGGCTG GGACATCGGC GGTGTCGACC TCTTCGAAAT CAACGAGGCC
TTTGCCGTCG TGCCGATGGC GGCGATCCGC GATCTCGGCC TTGCCGCGGA AAAAGTGAAC
GTCAATGGCG GTGCCTGCGC ATTGGGCCAT CCGATCGGAG CCTCGGGCGC CCGAGTGATC
GTCACGCTCG TCAATGCGTT GCGGCGCCGT GGCCTGAGAC GCGGCATCGC TTCTGTTTGT
ATCGGCGGCG GCGAGGCGAC GGCCGTTGCC GTGGAAGTTT CGGACTGA
 
Protein sequence
MNNQDPVVIV GQARTPLGSF QGELKDFSAP DLGGAAIADA LKRAGIAPDA VDEVVFGCVL 
TAGQGQAPAR QAALSAGLPL GVGATTVNKM CGSGMKAAML AHDLIKAESA SIVVAGGMES
MTNAPYLLDR ARQGYRIGHQ RALDHMFLDG LEDAYDKGRL MGTFAEDCAE AYQFTRGAQD
DYAIGSLEKA KKAGADGSFA NEIVPLSTGS GKGGTVSLDE QPQKARAEKI PLLKPAFREG
GTVTAANASS ISDGAAALVL MRRSAAETQG VTPLAIVRGH ATHADAPKLF PTAPIGAITA
LCQRIGWDIG GVDLFEINEA FAVVPMAAIR DLGLAAEKVN VNGGACALGH PIGASGARVI
VTLVNALRRR GLRRGIASVC IGGGEATAVA VEVSD