Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3825 |
Symbol | |
ID | 5077973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 479561 |
End bp | 480763 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481548 |
Product | beta-ketoadipyl CoA thiolase |
Protein accession | YP_001166210 |
Protein GI | 146276050 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0183] Acetyl-CoA acetyltransferase |
TIGRFAM ID | [TIGR01930] acetyl-CoA acetyltransferases [TIGR02430] beta-ketoadipyl CoA thiolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCAGG CTTTCGTTTG CGACGCGGTG CGCACCCCGA TCGGGCGCCT CAACGGCAGT CTCTCGGGGA TCCGGGCCGA CGATCTGGCC GCACTGCCCC TGCGCGCGCT CATGGAACGC AATCCGCAGG TCGATTGGGC TGCGCTCGAC GATGTCGTGC TCGGCTGCGC CAACCAGGCG GGCGAGGACA ACCGCAATCT CGCACGAATG GCCCTGCTGC TGGCGGGAAT GCCCGAGGCC GTGCCTGGTG CGACGATCAA CCGCCTCTGC GGATCGGGCA TGAACGCCGT CGGCATTGCC GCGCAGGCAA TCCGCAGCGG CGATGCCGAC CTCATGATCG CGGGCGGCGC AGAAAGCATG ACGCGTGCGC CCTATGTCCT GGGCAAGGCC GGCAGCGCAT TCGGACGCGA TCAGAAGATC GAGGACACCA CGCTTGGCTG GCGCTTCGTC AACCCGGCGA TGAAGCGCGC CTTCGGCGTC GATACCATGC CGCAGACGGC GGAGAACGTC GCCGCGCAAT GGAACGTCGG TCGCGAAGAG CAGGACCGCT TTGCCCTTGC CAGCCAGGAC AAGACCGCTG CCGCGCAGGC AAGGGGCCGC CTCGCGCTTG AGATCGTTGG CGTCAGCGTA CCTTCCGGGA AGGGCCAGAC CCGCGAATTC ACGCAGGACG AACACCCTCG CTCCACCACG CTGGACGTGC TCTCCGGCCT GCGCCCGGTT GTCCATAAAG AAGGCACCGT CACTGCCGGC AACGCTTCCG GCCTCAACGA TGGCGCTGCC GCGATGATCG TCGCGAGCGA ACAGGCCGCC GCCGCCAACG GCCTTGTGCC CCGCGCGCGC ATTGTCGCCA TGGCTTCGGC CGGCGTCGCA CCGCGCGTGA TGGGTATCGG TCCGGTCGAC GCCGCGCGTC GCCTTTTCGC CCGCACAGGC CTCTCGATGG AACGGATGGA CGTTATCGAA CTGAACGAGG CTTTCGCCGC TCAGGCGATC CCGGTCCTGC GAGATCTCGG CGTCGATCCC CTCGATCCGC GGGTAAACCC CAACGGCGGC GCGATCGCGC TGGGTCATCC TCTTGGCATG TCGGGGGCAA GGCTGGTGCA GACCGCAGTG CAGGAACTGC AAGAGACCGG AGGGCGCCAC GCCCTTGCCA TGATGTGCGT TGGCGTTGGC CAGGGCATCG CCATGATCCT GGAGCGCGTG TGA
|
Protein sequence | MTQAFVCDAV RTPIGRLNGS LSGIRADDLA ALPLRALMER NPQVDWAALD DVVLGCANQA GEDNRNLARM ALLLAGMPEA VPGATINRLC GSGMNAVGIA AQAIRSGDAD LMIAGGAESM TRAPYVLGKA GSAFGRDQKI EDTTLGWRFV NPAMKRAFGV DTMPQTAENV AAQWNVGREE QDRFALASQD KTAAAQARGR LALEIVGVSV PSGKGQTREF TQDEHPRSTT LDVLSGLRPV VHKEGTVTAG NASGLNDGAA AMIVASEQAA AANGLVPRAR IVAMASAGVA PRVMGIGPVD AARRLFARTG LSMERMDVIE LNEAFAAQAI PVLRDLGVDP LDPRVNPNGG AIALGHPLGM SGARLVQTAV QELQETGGRH ALAMMCVGVG QGIAMILERV
|
| |