Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3852 |
Symbol | |
ID | 5077463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 20420 |
End bp | 21451 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640480961 |
Product | 4-hydroxy-2-ketovalerate aldolase |
Protein accession | YP_001165623 |
Protein GI | 146275462 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR03217] 4-hydroxy-2-oxovalerate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTTG ATCCAACATC TGATCGGCTC TACATCCAGG ACGTGACCTT GCGCGACGGG ATGCACGCCA TCCTGCATAT GTATGGCACC GACAGCGTCC GCACGATCGC TAAGGCGCTT GACGAGGCAG GGGTGGATGC GATCGAGGTC TCGCACGGCG ATGGCCTGAA CGGTTCGACC TTCAATTACG GATTTGGCGC CCATACCGAC TGGGACTGGA TCGAGGCTGC CGCCGACGTA ATCAAGAACG CGGTGCTGAC AACGCTTCTG GTGCCCGGTA TCGGCACCGC CGAAGAACTC AAGCGGGCCT ATTCGATGGG AGTGCGCTCG GTCAGGGTCG CGACCCACTG CACCGAGGCC GACGTCGGCA AGCAGCACAT CGGCATCGCG CGCGATCTAG GCATGGACGT ATCGGGCTTC CTGATGATGA GCCACATGAT CGAGCCCGAG GCGCTCGCCC AGCAGGCGCT GCTGATGGAA AGCTATGGCG CGCATTGCGT CTATGTCACC GACAGCGGCG GGGCGCTCGA CATGGATGGC GTGATTGCAC GTCTCCAAGC CTATGACCGG GTGCTCAAAC CTGAAACCCA ACGCGGCATC CATGCGCACC ACAATCTGTC CCTGGGCGTG GCCAATTCGA TCGTCGCCGC GCAAGCCGGC GCGGTGCGGA TCGACGCGAG CCTTGCCGGG ATGGGTGCGG GAGCCGGTAA CGCACCGCTC GAGGTGTTCA TCGCCGCCGC CAACCGCAAG GGCTGGAAGC ACGGCTGCGA CGTGATGGCG CTGATGGACG CGGCGGACGA CATCATACGC CCGCTTCAGG ACCGTCCGGT GAGGGTCGAT CGCGAGACGC TCAGCCTGGG CTATGCAGGC GTCTATTCAA GCTTCCTGCG CCATGCGGAA AAGGCAGCGG AACAGTACGG AATCGATACC CGCGAGATCC TCGTCGAATT GGGCAACCGC CGGATGGTCG GAGGCCAGGA AGACATGATC ATCGACGTTG CACTTGATCT GATCAAAGCC AAGGCGAACT GA
|
Protein sequence | MTFDPTSDRL YIQDVTLRDG MHAILHMYGT DSVRTIAKAL DEAGVDAIEV SHGDGLNGST FNYGFGAHTD WDWIEAAADV IKNAVLTTLL VPGIGTAEEL KRAYSMGVRS VRVATHCTEA DVGKQHIGIA RDLGMDVSGF LMMSHMIEPE ALAQQALLME SYGAHCVYVT DSGGALDMDG VIARLQAYDR VLKPETQRGI HAHHNLSLGV ANSIVAAQAG AVRIDASLAG MGAGAGNAPL EVFIAAANRK GWKHGCDVMA LMDAADDIIR PLQDRPVRVD RETLSLGYAG VYSSFLRHAE KAAEQYGIDT REILVELGNR RMVGGQEDMI IDVALDLIKA KAN
|
| |