Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtpsy_0020 |
Symbol | |
ID | 7382711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax ebreus TPSY |
Kingdom | Bacteria |
Replicon accession | NC_011992 |
Strand | + |
Start bp | 19509 |
End bp | 20609 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643653338 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002551509 |
Protein GI | 222109245 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0655801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCTGA CCGAACGCCT CGCATCCCCG CTCACCACCC ACGACACCAC GCGCATCGAC GACCTGCGCA TCAAAGCCGT GCGCCCGCTC ATCACGCCCG CGCTGCTGCA GGAATGGCAG CCCGCGCCCG AGGCGGCGCA GACGCTGGTG GAAAGCAGCC GCGTGGCCAT CTCGCGCGTG CTGCACGGGC AGGACGACCG CCTCATCGTC GTGGTGGGGC CGTGCTCCAT CCACGACCAT GACCAGGCCA TGGAGTACGC GCGCCTGTTG AAGGAGCAGG CGGATGCGTT GGCGCAGGAC CTGCTCATCG TGATGCGCGT GTATTTCGAG AAGCCCCGCA CCACCGTGGG CTGGAAGGGC TACATCAACG ACCCGCACCT GGACGGCAGC TTCGCCATCA ACGAAGGCCT GGAGCGCGCC CGCGCATTGC TGCTGGACGT GCTGGCGCTG GGCCTGCCCG TGGGCACAGA GTTCCTGGAC CTGCTGTCGC CGCAGTTCAT CAGCGACCTG GTGAGCTGGG GCGCCATCGG CGCGCGCACC ACCGAAAGCC AGAGCCACCG GCAACTGGCC AGCGGCCTGT CCTGCCCCGT GGGCTTCAAG AACGGCACGG ATGGCGGCGT GAAGGTGGCG GCCGACGCCA TCCAGGCCGC GCAGGCACCG CATGCCTTCA TGGGTATGAC CAAGATGGGG CAGGCGGCGA TCTTCGAGAC GCGCGGCAAC CACGACTGCC ATGTGATCCT GCGCGGCGGC AAGGCCCCGA ACTACGGCGC GGCCGACGTG CAGGCCGCCT GCGAGATGCT GGGGAAGGCC GGCCAGCGCC CGCAGGTGAT GATCGACCTG TCGCATGCCA ACAGCAGCAA GCAGCACCGC CGCCAGATCG ACGTGGCGCA GGACGTGGCC CAGCAGATCG CCGCGGGCGA TGCACGCATC ACCGGCGTGA TGATCGAGAG CCACCTGCAG GAGGGCCGCC AGGACATCGT GGACGGCCAG CCGCTCACAC CCGGCGTGTC GGTCACCGAC GCTTGCATCA GCTTCGCACA GACCGTGCCG GTACTGCACC AGTTGGCTGC GGCCGTGCGC GAACGCCGCA CGCGCGGCTG A
|
Protein sequence | MTLTERLASP LTTHDTTRID DLRIKAVRPL ITPALLQEWQ PAPEAAQTLV ESSRVAISRV LHGQDDRLIV VVGPCSIHDH DQAMEYARLL KEQADALAQD LLIVMRVYFE KPRTTVGWKG YINDPHLDGS FAINEGLERA RALLLDVLAL GLPVGTEFLD LLSPQFISDL VSWGAIGART TESQSHRQLA SGLSCPVGFK NGTDGGVKVA ADAIQAAQAP HAFMGMTKMG QAAIFETRGN HDCHVILRGG KAPNYGAADV QAACEMLGKA GQRPQVMIDL SHANSSKQHR RQIDVAQDVA QQIAAGDARI TGVMIESHLQ EGRQDIVDGQ PLTPGVSVTD ACISFAQTVP VLHQLAAAVR ERRTRG
|
| |