Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1269 |
Symbol | |
ID | 4077663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1366861 |
End bp | 1368231 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006577 |
Product | 3-deoxy-D-arabinoheptulosonate-7-phosphate synthase |
Protein accession | YP_613264 |
Protein GI | 99081110 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACT GGCAAAAAAC GAACTGGCGC AGCAAGCCGC GCGTACAGAT GCCTGACTAT ACCGATCAGG CCGCGCTGCA GGCTGTTGAG GCGCAGCTGG CCAAGTATCC GCCCCTGGTT TTTGCCGGTG AATCCCGTCG CCTGAAGGCG CAACTGGGTG CTGCCGGGCG CGGCGAGGCC TTCTTGCTGC AAGGCGGCGA CTGCGCCGAG AGCTTTGAGC AGTTCAGCGC AGACGGCATC CGCGACACCT TCAAGGTGAT GTTGCAGATG GCCATGGTGC TGACCTATGG CGCCAAGGTG CCGGTGGTCA AAGTGGGCCG CATGGCCGGT CAATTTGCCA AACCCCGCTC GGCGCCGACC GAGACCGTTG ATGGGGTCGA ACTGCCGAGC TATCGCGGTG ACATCATCAA CGAGCTGGCC TTTACACCCG AAGCGCGTAT TCCCGATCCG CGCAAAATGC TGCAGGCCTA TACCCAGGCG GCGGCGACGT TGAACCTGAT CCGCGCCTTC TCGACCGGCG GCTATGCGGA TGTGCATCAG GTCCACGCCT GGACCTTGGG GTTCACCGAA GGCGACAAGG CCGAAGCCTA TCGGGATATG GCCAATCGGA TCACCGATAC GCTCGACTTC ATGAAAGCCG CCGGTGTGAC CGCCGACAAT GCTCATACGC TGCAGACGGT GGAATTCTAC ACCAGCCATG AAGGTCTGTT GCTGGAGTAT GAAGAGGCGC TGACACGTCT CGACTCGACT TCCGGCAAAT GGCTTGCGGG TTCGGGTCAC ATGATCTGGA TCGGGGACCG CACACGCCAG CCCGATGGCG CGCATGTGGA ATTCTGCAGC GGGGTCTTGA ACCCGATCGG TCTCAAATGT GGTCCGACCA CCACAGCTGA CGACCTCAAG GTCCTGATGC AAAAGCTCAA TCCTGAAAAC GAAGAAGGCA AGCTGACGCT GATCGCCCGC TTTGGCGCCG GCAAAGTTGC AGACCATCTG CCACGCCTCA TTCAGGCCGT GAAGGACGAA GGGGCCAATG TCACCTGGGT CTGTGATCCG ATGCATGGCA ACACCATCAA ATCCGCCAGC GGCTACAAGA CCCGTCCGTT TGACTCGGTG CTGCGCGAAG TGCGTGATTT CTTTGGTGTG CATCAGGCCG AAGGAACGAT CCCCGGTGGC GTTCACTTTG AGATGACGGG CCAGGATGTC ACCGAATGCA CCGGCGGCGT GCGCGAAGTG ACCGACGAGG ACCTGAGCGA TCGCTACCAC ACCGCCTGCG ATCCCCGTCT CAACGCGGAT CAATCGCTGG AACTGGCGTT TCTTGTCGCA GAAGAACTGT CGCGCCTGCG CACACCGGAC GACACGCGCG CCGCGATCTG A
|
Protein sequence | MSDWQKTNWR SKPRVQMPDY TDQAALQAVE AQLAKYPPLV FAGESRRLKA QLGAAGRGEA FLLQGGDCAE SFEQFSADGI RDTFKVMLQM AMVLTYGAKV PVVKVGRMAG QFAKPRSAPT ETVDGVELPS YRGDIINELA FTPEARIPDP RKMLQAYTQA AATLNLIRAF STGGYADVHQ VHAWTLGFTE GDKAEAYRDM ANRITDTLDF MKAAGVTADN AHTLQTVEFY TSHEGLLLEY EEALTRLDST SGKWLAGSGH MIWIGDRTRQ PDGAHVEFCS GVLNPIGLKC GPTTTADDLK VLMQKLNPEN EEGKLTLIAR FGAGKVADHL PRLIQAVKDE GANVTWVCDP MHGNTIKSAS GYKTRPFDSV LREVRDFFGV HQAEGTIPGG VHFEMTGQDV TECTGGVREV TDEDLSDRYH TACDPRLNAD QSLELAFLVA EELSRLRTPD DTRAAI
|
| |