Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0286 |
Symbol | |
ID | 4077421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 291466 |
End bp | 292560 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638005580 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_612281 |
Protein GI | 99080127 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.969107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCC TCTCTCAGAC ACCGACCCAT GATCTGCGCA TCACCGACAT GCAGGAATTG ATCTGCCCCG AAGCCCTCGC GGTGAAACAC CCGCTGACAG ATGCCGCGCG CGAAACCGTG CTCTCGGCGC GCGCCAGCAT CCAGAAGATC CTGCATGGCG CCGACGACCG GCTGGTCGTC GTGGTCGGCC CCTGTTCGAT CCACGACCCG GAGGCCGCGC TGGACTATGC GCGCCGTCTT GCGCCGCTGC GCGCCGAGCT GGGCGATGCG CTTGAGATCG TGATGCGGGT CTACTTTGAA AAACCGCGCA CCATCGCGGG CTGGAAGGGG CTGATCAACG ACCCCAACCT TGATGGGTCT TTCCGCATCA ACAAGGGGTT GTCGGTCGCC CGCAAGCTCT GCCTGGATCT GAGCGAAATG GGCCTGCCCG TGGGGACCGA ATTCCTCGAT GCCTCGGTGC CGCAATACAT CAGTGATCTG GTGAGCTGGG CCGCGATTGG CGCCCGCACC ACCGAGAGCC AGATCCACCG CGAAATGGCT TCGGGCCTGA GCTGCCCGGT GGGCTTCAAG AACGGCACCC GCGGCAATGT GCAGATCGCC ATTGACGCGG TGCGCTCGGC GGCCACACCG CATCATTTCA TGGCGCTGGC CCCCTCGGGT CTCGCGGCGA TTGCGGCGAC GGCCGGAAAC CCGGATTGCC ACATCATCCT GCGCGGCGGC GGTGGTACCA ACTTTGATGC CGAGAGCGTG GATTCAGCCT GCAAAAAGGC CGAAGCCGAT GGCATCCGTC CGCAGGTGAT GATCGACGCA AGCCACGCCA ACTCTGCCAA GGATCCCGCC AAACAGCCCG AGGTGCTCTC GGATGTGGCC GGCCAGATGG CACAGGGTGA GACCCGCATC ACCGGCATCA TGATCGAAAG CCACCTCGAA CAGGGTCGTC AGGATCTGCC CAAGGACGGG GACCTGTCGA AACTCACCTA TGGTCAGTCG ATCACCGACG GCTGCATCGG CTGGGAGCAA ACCGAGGCCG AGCTGCGCAA ACTGGCCCAG GCGGTCAAAA CACGCCGCAC GCAGGGCGCC CGCCTGGCGG GTTGA
|
Protein sequence | MTTLSQTPTH DLRITDMQEL ICPEALAVKH PLTDAARETV LSARASIQKI LHGADDRLVV VVGPCSIHDP EAALDYARRL APLRAELGDA LEIVMRVYFE KPRTIAGWKG LINDPNLDGS FRINKGLSVA RKLCLDLSEM GLPVGTEFLD ASVPQYISDL VSWAAIGART TESQIHREMA SGLSCPVGFK NGTRGNVQIA IDAVRSAATP HHFMALAPSG LAAIAATAGN PDCHIILRGG GGTNFDAESV DSACKKAEAD GIRPQVMIDA SHANSAKDPA KQPEVLSDVA GQMAQGETRI TGIMIESHLE QGRQDLPKDG DLSKLTYGQS ITDGCIGWEQ TEAELRKLAQ AVKTRRTQGA RLAG
|
| |