Gene Saro_3556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3556 
Symbol 
ID5077705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp172529 
End bp173680 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content67% 
IMG OID640481280 
ProductAcetyl-CoA acetyltransferase-like protein 
Protein accessionYP_001165942 
Protein GI146275782 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGCG ACGTCTGCAT CGTCGGCATC GGCATCCACC CGTTCGGGCG CACCGACGGG 
CTATCGGGGC TGGAGCAGGG CGTCTTTGCC GTGCGCCAGG CACTGGGAGA TGCCGGAATC
GAGTGGGGCG ACGTCCAGTT CGCCTATGGC AGCTCGGATT CCGCCGGCAA CCCCGACACG
ATGGTCGACC GGCTGGGCCT TACGGGCATG CAGTTCATCA ACGTGCGCAA CGGGTGCGCT
GCGGGCGGAT CGGCGCTGTT CTCGGCGCAG ATGGCGATCA AGAGCGGCGA GTTCGACATC
GGCCTTGCCG TCGGCTTCGA CAAACATCCG CGCGGCGCGT TCAATGCCAT GCCGAGCGAG
TACAACCTGC CCGACTGGTA CGGCGAGGCG GGCTACATGA TCACCACGCA GTTCTTCGCG
AACAAGATCA TGCGCTACAT GCACGATCAC GGCATCAGCC AGCAGACGCT GGGCCGGGTG
GCGGAAAAGG CTTTCCGCAA CGCGGTGCAT GCCGATCACG CCTGGCGGCG CGAGCCGGTG
GACCTCGAGA CGATCCTCGA GGCGCCGCTG GTTTCCGACC CCTATACCAA GTACATGTTC
TGCTCGCCCG CCGAAGGCGG CGTCGCGCTG ATCCTGGCGA GCGAAAAGAA GGCGCGCGAA
CTGGGCAAGC CGCTGGTCCG CCTGAAGGCC GCGACGATGC GCACCCGGCC GCCCAAGTCG
TTCGAGGTCT TCGCACCCTC GATCGATATC GGCGGCGGCA AGGCGACCGC GACCCAGATC
GCCAGCGCCG ACGCGTTCCG CATGGCCGGC ATCGGGCCCG GCGACATCGC AGTCGCCCAG
CTCCAGGATA CCGAGGCCGG CGCCGAGATC ATGCACATGG CCGAGAACGG CTTCTGCAAG
GACGGCGAGC AGGAGCGCTG GCTGGCCGAA GGGCTGACCG AGGTGGGCGG CAAGCTGCCG
GTCAACACCG ACGGCGGCTG CCTTGCCTGC GGCGAACCCA TCGGCGCTTC GGGCCTGCGA
CAGGTCTACG AGAACGTCGT GCAACTTCGC GGGGACGGCG GCGGGCGCCA GGTGCCCGGC
AATCCCAAGA CCGCATACAG CCACGTCTAT GGCGCCCCGG GCGTCTCTGC CGTGACCATT
CTGGAACGCT GA
 
Protein sequence
MSGDVCIVGI GIHPFGRTDG LSGLEQGVFA VRQALGDAGI EWGDVQFAYG SSDSAGNPDT 
MVDRLGLTGM QFINVRNGCA AGGSALFSAQ MAIKSGEFDI GLAVGFDKHP RGAFNAMPSE
YNLPDWYGEA GYMITTQFFA NKIMRYMHDH GISQQTLGRV AEKAFRNAVH ADHAWRREPV
DLETILEAPL VSDPYTKYMF CSPAEGGVAL ILASEKKARE LGKPLVRLKA ATMRTRPPKS
FEVFAPSIDI GGGKATATQI ASADAFRMAG IGPGDIAVAQ LQDTEAGAEI MHMAENGFCK
DGEQERWLAE GLTEVGGKLP VNTDGGCLAC GEPIGASGLR QVYENVVQLR GDGGGRQVPG
NPKTAYSHVY GAPGVSAVTI LER