Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3559 |
Symbol | |
ID | 5077708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 175909 |
End bp | 176922 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640481283 |
Product | transketolase, central region |
Protein accession | YP_001165945 |
Protein GI | 146275785 |
COG category | [C] Energy production and conversion |
COG ID | [COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00351586 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA TGATGTACCG CGACGCCGTC GTCTCGACGA TCGCAGAGGA AATGGAGCGC GACGAGAACG TCGTCATGCT CGGTGAAGAC ATTGTCGGCG GCATGGGCAC GCCGGGTGGG CCCGAGGCCA TCGGCGGCAT CTGGTCGACC TCCACCGGCC TGTTCGGCAA GTTCGGCGCG GACCGCGTGA TCGACACGCC GATCTCGGAA AGCGCGATCA TGGGCGCGGC GGCAGGCCTT GCGCTTTCGG GCAAGCGCCC GATCGCCGAG CTGATGTTCG CCGACTTCAT CGGCGTTTCC CTCGACCAGA TCTGGAACCA GCTCGCCAAG TTCCGCTACA TGTTCGGCGG CAAGACCAAG TGCCCGGCAG TGATCCGCAT GGCCTATGGC GCGGGCTACA ACGCCGCCGC GCAGCATAGC CAGGCGGTCC ACCAGATCCT GACCGGCATG CCGGGCCTCA AGGTGGTCAT GCCGACCACG CCTGCCGACG TGAAGGGCCT GCTGCGCACC GCGATCCGCG ACGACGATCC GGTGATCTTC CTCGAGCACA AGGCGCTCTA CGGCGTTTCC GGCGAAGTGC CCGACGATCC GGACTTCATG ATCCCGTTCG GCCACGCCCG CCTTTCGCGC GCCGGCCAGG ACGTGACGAT CGTCTCGACC GGCCTGCTGC TGGGATTCTG CGAGGCGGTG GCCGACAAGC TTGCCGCCGA GGGCATCGGC TGCGACGTGA TCGACCTGCG CACCACCAGC CCGATCGACG AGGAAACGAT CCTCGATTCG GTCGAGGTGA CCGGCCGCCT CGTCGTCGTC GACGAAGCGC CGCCCCGGTG CAGCCTTGCG TCCGACATCT GTGCGACAGT TGCCGAAAAG GGCTTCGCCG CGCTCAAGGC TCCGCCGCAG GCGGTCAACC CGCCACACAC CCCGATCCCG TTCGCGCGTG AGCTGGAATC TGCCTACCTT CCTTCGGTCG ACAAGATCGA AGCGGCGGTG CGCAAGGTTC TGGCTTACCG CTGA
|
Protein sequence | MAKMMYRDAV VSTIAEEMER DENVVMLGED IVGGMGTPGG PEAIGGIWST STGLFGKFGA DRVIDTPISE SAIMGAAAGL ALSGKRPIAE LMFADFIGVS LDQIWNQLAK FRYMFGGKTK CPAVIRMAYG AGYNAAAQHS QAVHQILTGM PGLKVVMPTT PADVKGLLRT AIRDDDPVIF LEHKALYGVS GEVPDDPDFM IPFGHARLSR AGQDVTIVST GLLLGFCEAV ADKLAAEGIG CDVIDLRTTS PIDEETILDS VEVTGRLVVV DEAPPRCSLA SDICATVAEK GFAALKAPPQ AVNPPHTPIP FARELESAYL PSVDKIEAAV RKVLAYR
|
| |