Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3687 |
Symbol | |
ID | 5077835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 321553 |
End bp | 322593 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640481410 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_001166072 |
Protein GI | 146275912 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATCGA ATTTCAACGT GGAAGCGGGC GACAAGCTCT ACATCCAGGA CGTCACCCTG CGCGACGGCA TGCACGCGGT GCGCCACATG TACGGCATCG ACCATGTCCG CTCGATCGCG TCCGCGCTCG ACAAGGCCGG CGTCGATGCG ATCGAGGTCG CCCACGGTGA CGGCCTTTCG GGAGCCAGCT TCAACTACGG CTTCGGCGCC CACACCGACT GGGAATGGCT GGAGGCCGTG GCCGACGTGC TGGAGAAGAG CGTCCTCACC ACGCTCATCC TTCCCGGCGT CGGCACCGTC GAGGAACTGC GCCGCGCCTA TGACATCGGC GTCCGCTCGG TCCGCGTCGC GACCCACTGC ACCGAGGCCG ACGTCAGCAA GCAGCACATC GGCATCGCCC GCGATCTCGG CATGGACGTG TCGGGCTTCC TGATGATGAG CCACATGATC GAACCCGAAG CGCTGGCGCA GCAGGCATCG CTGATGGAAA GCTACGGCGC GCAATGCGTC TATGTCACCG ACAGCGGCGG CGCGCTCGAC ATGGACGGCG TGAAGGCCCG CCTCGAAGCC TATGACCGCG TACTCAAGCC AGAAACCCAG CGCGGCATCC ACGCCCACCA CAACCTCGCG CTCGGCGTCG CTAACTCGAT CGTCGCGGCG CAATGTGGCG CGGTGCGCAT CGACGCCTCG CTGACCGGCA TGGGTGCGGG TGCGGGCAAT GCGCCGCTCG AAGTTTTCAT CGCCGCCGCC GACCGCAAGG GCTGGAACCA CGGCTGCGAC GTGATGATGC TGATGGACGC GGCCGAAGAT CTCGTCCGGC CGCTGCAGGA CCGCCCGGTC CGCGTCGACC GCGAGACTTT GGCGCTCGGC TATGCGGGGG TCTACTCCAG CTTCCTGCGT CACGCCGAGA AGGCGGCTGA GACCTATGGC CTCGATACGC GCACGATCCT CGTCGAACTG GGTCGCCGCA AGATGGTCGG CGGCCAGGAA GACATGATCG TCGACGTCGC GCTCGACATG CTCAAGGAAC AGCAGGCCTG A
|
Protein sequence | MTSNFNVEAG DKLYIQDVTL RDGMHAVRHM YGIDHVRSIA SALDKAGVDA IEVAHGDGLS GASFNYGFGA HTDWEWLEAV ADVLEKSVLT TLILPGVGTV EELRRAYDIG VRSVRVATHC TEADVSKQHI GIARDLGMDV SGFLMMSHMI EPEALAQQAS LMESYGAQCV YVTDSGGALD MDGVKARLEA YDRVLKPETQ RGIHAHHNLA LGVANSIVAA QCGAVRIDAS LTGMGAGAGN APLEVFIAAA DRKGWNHGCD VMMLMDAAED LVRPLQDRPV RVDRETLALG YAGVYSSFLR HAEKAAETYG LDTRTILVEL GRRKMVGGQE DMIVDVALDM LKEQQA
|
| |