Gene Saro_3825 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3825 
Symbol 
ID5077973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp479561 
End bp480763 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content67% 
IMG OID640481548 
Productbeta-ketoadipyl CoA thiolase 
Protein accessionYP_001166210 
Protein GI146276050 
COG category[I] Lipid transport and metabolism 
COG ID[COG0183] Acetyl-CoA acetyltransferase 
TIGRFAM ID[TIGR01930] acetyl-CoA acetyltransferases
[TIGR02430] beta-ketoadipyl CoA thiolase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGG CTTTCGTTTG CGACGCGGTG CGCACCCCGA TCGGGCGCCT CAACGGCAGT 
CTCTCGGGGA TCCGGGCCGA CGATCTGGCC GCACTGCCCC TGCGCGCGCT CATGGAACGC
AATCCGCAGG TCGATTGGGC TGCGCTCGAC GATGTCGTGC TCGGCTGCGC CAACCAGGCG
GGCGAGGACA ACCGCAATCT CGCACGAATG GCCCTGCTGC TGGCGGGAAT GCCCGAGGCC
GTGCCTGGTG CGACGATCAA CCGCCTCTGC GGATCGGGCA TGAACGCCGT CGGCATTGCC
GCGCAGGCAA TCCGCAGCGG CGATGCCGAC CTCATGATCG CGGGCGGCGC AGAAAGCATG
ACGCGTGCGC CCTATGTCCT GGGCAAGGCC GGCAGCGCAT TCGGACGCGA TCAGAAGATC
GAGGACACCA CGCTTGGCTG GCGCTTCGTC AACCCGGCGA TGAAGCGCGC CTTCGGCGTC
GATACCATGC CGCAGACGGC GGAGAACGTC GCCGCGCAAT GGAACGTCGG TCGCGAAGAG
CAGGACCGCT TTGCCCTTGC CAGCCAGGAC AAGACCGCTG CCGCGCAGGC AAGGGGCCGC
CTCGCGCTTG AGATCGTTGG CGTCAGCGTA CCTTCCGGGA AGGGCCAGAC CCGCGAATTC
ACGCAGGACG AACACCCTCG CTCCACCACG CTGGACGTGC TCTCCGGCCT GCGCCCGGTT
GTCCATAAAG AAGGCACCGT CACTGCCGGC AACGCTTCCG GCCTCAACGA TGGCGCTGCC
GCGATGATCG TCGCGAGCGA ACAGGCCGCC GCCGCCAACG GCCTTGTGCC CCGCGCGCGC
ATTGTCGCCA TGGCTTCGGC CGGCGTCGCA CCGCGCGTGA TGGGTATCGG TCCGGTCGAC
GCCGCGCGTC GCCTTTTCGC CCGCACAGGC CTCTCGATGG AACGGATGGA CGTTATCGAA
CTGAACGAGG CTTTCGCCGC TCAGGCGATC CCGGTCCTGC GAGATCTCGG CGTCGATCCC
CTCGATCCGC GGGTAAACCC CAACGGCGGC GCGATCGCGC TGGGTCATCC TCTTGGCATG
TCGGGGGCAA GGCTGGTGCA GACCGCAGTG CAGGAACTGC AAGAGACCGG AGGGCGCCAC
GCCCTTGCCA TGATGTGCGT TGGCGTTGGC CAGGGCATCG CCATGATCCT GGAGCGCGTG
TGA
 
Protein sequence
MTQAFVCDAV RTPIGRLNGS LSGIRADDLA ALPLRALMER NPQVDWAALD DVVLGCANQA 
GEDNRNLARM ALLLAGMPEA VPGATINRLC GSGMNAVGIA AQAIRSGDAD LMIAGGAESM
TRAPYVLGKA GSAFGRDQKI EDTTLGWRFV NPAMKRAFGV DTMPQTAENV AAQWNVGREE
QDRFALASQD KTAAAQARGR LALEIVGVSV PSGKGQTREF TQDEHPRSTT LDVLSGLRPV
VHKEGTVTAG NASGLNDGAA AMIVASEQAA AANGLVPRAR IVAMASAGVA PRVMGIGPVD
AARRLFARTG LSMERMDVIE LNEAFAAQAI PVLRDLGVDP LDPRVNPNGG AIALGHPLGM
SGARLVQTAV QELQETGGRH ALAMMCVGVG QGIAMILERV