Gene Saro_1222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1222 
Symbol 
ID3916520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1274354 
End bp1275568 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content68% 
IMG OID640443959 
Productacetyl-CoA acetyltransferase 
Protein accessionYP_496501 
Protein GI87199244 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0492457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGC TTCGCCACGC CGCCATCGTC GCCCCCATCC GCACCGCCGT GGGCAAGTTC 
GGCGGCTCGC TGTCGCCTCT CACCGCCGGG CAACTGGGCG CAACGATCCT CACGGCGCTG
ATGGACCGCA CAAGGATCGA CCCCGCGCGC GTCGATGACG TGATCTTCGC GCAGGGTTAC
GGCAACGGCG AGGCGCCGTG CATCTCGCAC TGGTCGTGGC TGCTCGCGGG CCTGCCCGAG
GAAGTTCCCG GCTACCAGCT CGATCGCCGC TGCGGCTCGG GCCTCCAGTC GATCGTCAAT
GCGGCGATGA TGGTGCAGAC CGGGGTTTCC GACGTCGTCG TGGCGGGCGG CGTGGAATCG
ATGTCCAACG TCGAGCACTA TACCACTGAC GTCCGCAAGG GCGTGCGCGC GGGCTCGCTG
ACCCTTCACG ACCGCCTTAC CCGTGGCCGC GTGATGAGCC AGCCGATCGA GCGCTATGGC
GTGATCAGCG GCATGATCGA GACGGCGGAA AACCTCGCCA AGGACTTTGC CATCACCCGC
GAAGCCTGCG ACGCCTATGC CGTGCGCAGC CACCAGCGCG CGGCTGCTGC ATGGGCCAAC
GGCCTGTTCG ACGACGAACT CGTTCCGGTC TCCATCCCCC AGAAAAAGGG CGACCCCGTT
CTCTTCGCCC ACGACGAGGG TTACCGTGCC GACGCCAGCA TGGAAACGCT TGGCAAGCTG
CGCCCCCTCG AAGGCGGCGT CGTGACGGCA GGCAACGCCA GCCAGCAGAA CGACGCGGCC
GCCGCCTGCC TCGTCGTCGC GGAAGACAAG CTCGCCGAAC TCGGCCTCGA ACCCATCGCG
TGGTTCCATT CCTGGGCGGC AGCGGGCTGC GATCCGAGCC GCATGGGCTA TGGCCCTGTC
CCCGCTACCG AGCGCCTGTT CGCCCGCAAC GGCCTGACGT GGAACGACAT CGACCTCATC
GAACTGAACG AGGCCTTCGC CCCTCAGGTT CTCGCCTGCC TCAAGGGCTG GGGCTGGTCG
GACGACGACA GCCGCCACGA GATGCTGAAC GTCAATGGCT CGGGCATCAG CCTCGGCCAT
CCCATCGGCG CCACCGGCGG GCGCATCCTC GCCAACCTTA CGCGCGAATT GAAGCGGCGC
GGCGGGCGCT ATGGCCTTGA AACCATGTGC ATTGGTGGCG GTCAGGGAAT CGCGGCGGTG
TTCGAGGCGG CCTGA
 
Protein sequence
MTQLRHAAIV APIRTAVGKF GGSLSPLTAG QLGATILTAL MDRTRIDPAR VDDVIFAQGY 
GNGEAPCISH WSWLLAGLPE EVPGYQLDRR CGSGLQSIVN AAMMVQTGVS DVVVAGGVES
MSNVEHYTTD VRKGVRAGSL TLHDRLTRGR VMSQPIERYG VISGMIETAE NLAKDFAITR
EACDAYAVRS HQRAAAAWAN GLFDDELVPV SIPQKKGDPV LFAHDEGYRA DASMETLGKL
RPLEGGVVTA GNASQQNDAA AACLVVAEDK LAELGLEPIA WFHSWAAAGC DPSRMGYGPV
PATERLFARN GLTWNDIDLI ELNEAFAPQV LACLKGWGWS DDDSRHEMLN VNGSGISLGH
PIGATGGRIL ANLTRELKRR GGRYGLETMC IGGGQGIAAV FEAA