Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3571 |
Symbol | |
ID | 5060046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 4091314 |
End bp | 4092402 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640475826 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001160380 |
Protein GI | 145596083 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.534041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCT CCGAGACGAA CCGGATCAGC GACCAGCGGA TCGACCGTGT GGTGCCGCTG ACCACCCCGG CACTGCTACA CCACGAGCTG CCCCTGGACA GTTCGCTCAC CTCGGCCGTA CTCACTGGCA GACGGGCCGT CGGCCGAGTC TTGGACCGCG CCGACGACCG CCTCCTGGTG GTAGTCGGCC CCTGTTCGGT ACATGACCCG GTCGCCGCCC TCGCCTACGC TCACCGGCTC CGCGAGCTCG CCGATCGGCT CGCCGACGAC CTACTCGTGG TGATGCGGGT CTACTTCGAG AAGCCGCGCT CAACCGTCGG CTGGAAGGGG CTTATCAACG ACCCGGGACT GGACGGTTCC GGTGATGTGA ACACGGGCCT GCGCCGGGCC CGTGCGCTCC TGATCGACGT GCTGCGCCTG GGCCTCCCGG TCGGTTGCGA GTTCCTGGAC CCGATCACCC CGCAGTACAT CGCCGACACG GTGGCCTGGG GCGCGATCGG CGCCCGGACC GTGGAGAGCC AGGTGCACCG CCAGCTCGCC TCCGGCCTGT CGATGCCGAT CGGAATGAAG AACCGTCCCG ACGGCAGCAT CTCCACCGCG ACGGACGCCA TCCGGGCGGC CGGCGTGCCG CACGTCTTCC CCGGCATCGA CGCCTCCGGC ACCCCAGCGA TCATGCACAC CCGCGGTAAC ACCGACGGCC ACCTGGTGCT GCGCGGTGGC GGCAACCAGC CGAACTACGA CGCCGAATCG GTGACGGACG CGCTGGCGCT GCTGCGCGAC GCCGGGCTTC CCGAACGGCT GGTCATCGAC GCCAGCCACG CCAACAGCGG CAAGGACCAC CGTAACCAGC CGCTCGTCGC CGCCGACGTG GCCGCCCAAC TCGCCGGAGG CCAGCACGGC ATCGTCGGCG TCATGCTGGA GAGCTTCCTG CTCGCAGGTC GGCAGGACCT GGACCCGACC CGCGAGCTGA CCTACGGGCA GTCGATCACC GACGCCTGTA TCGGCTGGGA CACCACCGAG GAGGTCCTGG CCGACCTGGC CGCCGCCGTG CGCACCCGAC GGCGGGCTCC GGCCGTCACC CCTGTCTGA
|
Protein sequence | MTTSETNRIS DQRIDRVVPL TTPALLHHEL PLDSSLTSAV LTGRRAVGRV LDRADDRLLV VVGPCSVHDP VAALAYAHRL RELADRLADD LLVVMRVYFE KPRSTVGWKG LINDPGLDGS GDVNTGLRRA RALLIDVLRL GLPVGCEFLD PITPQYIADT VAWGAIGART VESQVHRQLA SGLSMPIGMK NRPDGSISTA TDAIRAAGVP HVFPGIDASG TPAIMHTRGN TDGHLVLRGG GNQPNYDAES VTDALALLRD AGLPERLVID ASHANSGKDH RNQPLVAADV AAQLAGGQHG IVGVMLESFL LAGRQDLDPT RELTYGQSIT DACIGWDTTE EVLADLAAAV RTRRRAPAVT PV
|
| |