Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0991 |
Symbol | |
ID | 3915773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1034325 |
End bp | 1035698 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443725 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_496270 |
Protein GI | 87199013 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAAAGA ACTGGACTGC GGATTCGTGG CGCGGGCATG AGGGGCGGCA TCTGCCGACC TATCCGGACG CCGAAAAGCT GAACGAGGTC GAGACGACCC TGACGAGCTT TCCGCCGCTG GTTTTTGCCG GCGAGGCGCG CGCGCTCAAG GCCGATCTCG CGCAAGTGGC TGCGGGCAAG GGTTTCCTGC TCCAGGGCGG GGACTGCGCC GAAAGCTTCG CCGAGTTCCA TCCGAACAAC ATTCGCGACA CGTTCCGCGT GCTGCTGCAG ATGGCGGTCG TGCTGACGTT CGCCAGCAAG CAGCCGGTGG TGAAGGTTGG CCGCATGGCC GGCCAGTTCG CCAAGCCGCG TTCGTCGCCG GTGGAAAAGA TCGGTGACGT CGAACTGCCG AGCTACCTTG GCGACAATAT CAACGGCATC GATTTCACCC CCGAGTCGCG GATTCCCGAT CCCGAGCGCA TGCTGCGCGC CTACAGCCAG GCGGCGGCGA CGCTGAACCT GCTGCGCGCT TTCGCGGGTG GCGGCTATGC GAACCTGCGC CAGGTCCACC AGTGGACGCT CGACCACATC GGCAAGAGCC CCTGGGCGGC GAAGTTCAGC GAGATGGCCG ACAAGATCGG CGAGTCTCTC GACTTCATGG AAGCCTGTGG CGTCGATCCC AGCACCGTCC CGCAGCTTCA GGGCACGAGC TTCTACACTA GCCACGAAGC GCTGCTGCTG CAGTACGAAG AAGCGATGAC GCGCCAGGAT TCGCTGACCG GCGAATGGTA CGACACCAGC GCGCACATGC TCTGGATCGG CGACCGTACT CGCTTCGAAG GATCGGCCCA CGTCGAATAC CTGCGCGGCG TCGGCAATCC CATCGGCATG AAGTGCGGTC CTTCGCTGAC GCCTGACGCC CTGCTGCGCA TGCTGGATAC GCTGAACCCC GGTCGCGAGG CCGGTCGCAT CACGCTTATC AGCCGCTTCG GGCACGACAA GGTCGAAGCC GGCCTGCCGC CTCTGGTGCG TGCCGTAACG CGCGAGGGGC ACCCGGTAGT GTGGTCGTGC GATCCGATGC ACGGCAACGT GATCAAGGCC GACAATGGCT ACAAGACGCG TCCGTTCGAC CGCATCCTGA CCGAGGTGAA GGGCTTCTTC GCGGTGCATC GCGCCGAGGG AACCCACGCT GGTGGCATCC ACATCGAGAT GACCGGCCGC GACGTGACCG AGTGCACCGG CGGTGCGGTG GCGATCACCC AGGAAGGCCT GGCGGACCGT TACCACACCC ATTGCGACCC GCGCCTCAAC GCGGCGCAGT CGATCGAGCT CGCTTTCCTG ATGGCCGAAG CGCTCAACCA GGAACGCGCG GAGCGCAAGG CCGAAGCGGC CTGA
|
Protein sequence | MAKNWTADSW RGHEGRHLPT YPDAEKLNEV ETTLTSFPPL VFAGEARALK ADLAQVAAGK GFLLQGGDCA ESFAEFHPNN IRDTFRVLLQ MAVVLTFASK QPVVKVGRMA GQFAKPRSSP VEKIGDVELP SYLGDNINGI DFTPESRIPD PERMLRAYSQ AAATLNLLRA FAGGGYANLR QVHQWTLDHI GKSPWAAKFS EMADKIGESL DFMEACGVDP STVPQLQGTS FYTSHEALLL QYEEAMTRQD SLTGEWYDTS AHMLWIGDRT RFEGSAHVEY LRGVGNPIGM KCGPSLTPDA LLRMLDTLNP GREAGRITLI SRFGHDKVEA GLPPLVRAVT REGHPVVWSC DPMHGNVIKA DNGYKTRPFD RILTEVKGFF AVHRAEGTHA GGIHIEMTGR DVTECTGGAV AITQEGLADR YHTHCDPRLN AAQSIELAFL MAEALNQERA ERKAEAA
|
| |