Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_2212 |
Symbol | |
ID | 4080170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 2322722 |
End bp | 2324095 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638010590 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_617254 |
Protein GI | 103487693 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0659111 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0554404 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAAA ACTGGCAACC GCATAGCTGG CGCACGCACG AGGCCCGCCA GCTGCCCACC TATCGCGACG CCGATGCACT CGCGGCCGCC GAACGCGAGC TGGCGAACTA TCCGCCGCTC GTCTTTGCGG GCGAGGCGCG CGAACTGACG AACGAACTCG CGCGCGTCGC GGAGGGCAGG GCGTTCCTGC TCCAGGGCGG CGACTGCGCC GAAAGCTTTG CCGAGTTCCA CCCGAACAAT ATCCGCGACA CATTTCGTGT GCTGCTCCAG ATGGCGGTGG TGCTGACCTT CGCCTCGAAA ATGCCCGTGG TAAAGCTCGG CCGCATGGCG GGGCAGTTCG CCAAGCCGCG TTCGGCGGAC ATGGAAGAGG TGGATGGCGT CGCGCTGCCC AGCTATCGCG GCGACATCAT CAACGACATC GCGTTCGAGG AAGCCGGCCG CGAGCCCGAC CCCGCGCGGA TGGTCAAGGC CTACAACCAG TCGGCGGCGA CGCTCAACCT GCTGCGCGCT TTCGCGGGCG GTGGCTATGC CAATCTGCAC CAGGTCAACG CCTGGACGCA CGACTTCATG GACCGCAGCC CGTGGGCGAA GAAATATCAG GAAACCGCAG GCCGCATTTC CGAAGCGCTC GCCTTCATGG AAGCGTGCGG CGTGACGCCC GAAACGGTTC CGCAGATCAA GGGCACCAGC TTCTACACCA GCCATGAGGC GCTGCTCCTC CCCTATGAGC AGGCGCTGAC CCGGCAGGAC AGCCTGACCG GCGGCTGGTA CGACACATCG GGCCACTTCC TGTGGGTCGG CGACCGCACC CGTTTCGAAG GATCAGCGCA TATCGAATAT CTCCGCGGGA TCGGCAATCC GGTCGGCATG AAATGCGGAC CGAGCCTCGA ACCCGACGTG CTGCTGCGCC TGCTCGACAC GCTGAATCCC AACCATGTGC CCGGCCGCAT GACGCTCATC ACGCGCTATG GCCACGACAA GATCGAGGCG CATCTGCCCA GGCTCGTGCG CGCGGTGAAG GAATCGGGCC ATCCCGTCGT CTGGTCGTGC GACCCGATGC ACGGCAATGT CATCAAGACC TCGACCGGCT ACAAGACGCG CCCGTTCGAG CGCATCCTCG CCGAAGTGCG CGGCTTCTTC GCCGTCCACC GCGCCGAGGG CACGCATGGC GGCGGCATCC ATATCGAGAT GACCGGCCAG AATGTCACCG AATGCACCGG CGGCGCGATG GACGTGACCC AGATGGACCT TGCCGACCGC TATCACACGC ATTGCGACCC GCGTTTGAAT GCGGGGCAGA GCCTCGAACT CGCCTTCCTG CTGGCGGAGA TGCTCAATCA GGAAATGAGC GAGCGGGCGA AGCAGGCGGC GTAA
|
Protein sequence | MTKNWQPHSW RTHEARQLPT YRDADALAAA ERELANYPPL VFAGEARELT NELARVAEGR AFLLQGGDCA ESFAEFHPNN IRDTFRVLLQ MAVVLTFASK MPVVKLGRMA GQFAKPRSAD MEEVDGVALP SYRGDIINDI AFEEAGREPD PARMVKAYNQ SAATLNLLRA FAGGGYANLH QVNAWTHDFM DRSPWAKKYQ ETAGRISEAL AFMEACGVTP ETVPQIKGTS FYTSHEALLL PYEQALTRQD SLTGGWYDTS GHFLWVGDRT RFEGSAHIEY LRGIGNPVGM KCGPSLEPDV LLRLLDTLNP NHVPGRMTLI TRYGHDKIEA HLPRLVRAVK ESGHPVVWSC DPMHGNVIKT STGYKTRPFE RILAEVRGFF AVHRAEGTHG GGIHIEMTGQ NVTECTGGAM DVTQMDLADR YHTHCDPRLN AGQSLELAFL LAEMLNQEMS ERAKQAA
|
| |