Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3950 |
Symbol | |
ID | 5708221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4493817 |
End bp | 4494905 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273375 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001538731 |
Protein GI | 159039478 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.178604 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.35096 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCACCT CGGAGATGGA CCGGATCAGC GATCAGCGGA TCGATCGTGT GGTGCCGCTG ACCACCCCGG CCCTGTTACA CCACGAGCTG CCCCTGAACG ATCGGCTCAC CTCGGCCGTA CTCACTGGCA GACGGGCTGT CGGCCGGGTG CTGGACCGCG CCGACGACCG CCTCCTGGTG GTGGTCGGCC CGTGTTCGGT ACACGATCCG GCCGCCGCCC TCGACTACGC CCACCGGCTC CGCGAGGTCG CCGGTCGGCT CGCCGACGAC CTGCTTGTGG TGATGCGGGT CTACTTCGAG AAGCCGCGCT CGACCGTGGG CTGGAAGGGG CTCATCAACG ATCCCGGGCT GGACGGTTCC GGTGATGTGA ACACCGGCCT GCGTCGGGCC CGCGCGCTGC TGATCGACGT GCTGCGCCTG GGTCTCCCGG TCGGATGCGA GTTCCTGGAC CCGATCACCC CGCAGTACAT CGCCGACACG GTGGCCTGGG GTGCGATCGG CGCCCGGACC GTGGAGAGCC AGGTGCACCG CCAGCTCGCC TCCGGCTTGT CGATGCCGAT CGGGATGAAG AACCGCCCCG ACGGCAGCAT CTCCACCGCG GTGGACGCGA TCCGGGCGGC CGGCGTGCCA CACGTGTTCC CCGGCATCGA CATCTCCGGC ACCCCAGCGA TCATGCACAC CCGAGGCAAC GCGGACGGTC ACCTGGTGCT GCGCGGTGGT GGCAACCGCC CGAACTACGA CGCGAAGTCG GTGGCGGACG CGCTCGCGCT GCTGCGCGCC GACGGGCTGC CCGAGCGGCT GGTGATCGAC GCCAGCCATG CCAACAGCGG CAAGGACCAC CGGAACCAGC CGCTCGTCGC CGCCGACGTG GCCGCCCAAC TCGCCGGGGG CCAGCACGGC ATCGTCGGCA TCATGCTGGA GAGCTTCCTG CTGTCGGGTC GGCAGGGCCT GGACCCGACC CGCGAGCTGA CGTACGGGCA GTCGATCACC GATGCCTGCA TCGGCTGGGA CACCACGGAA GAGGTGCTGG CCGACCTGGC AGCCGCCGTG CGCACCCGCC GTCGGGCTCC GGCCGTCACC CCCGCCTGA
|
Protein sequence | MTTSEMDRIS DQRIDRVVPL TTPALLHHEL PLNDRLTSAV LTGRRAVGRV LDRADDRLLV VVGPCSVHDP AAALDYAHRL REVAGRLADD LLVVMRVYFE KPRSTVGWKG LINDPGLDGS GDVNTGLRRA RALLIDVLRL GLPVGCEFLD PITPQYIADT VAWGAIGART VESQVHRQLA SGLSMPIGMK NRPDGSISTA VDAIRAAGVP HVFPGIDISG TPAIMHTRGN ADGHLVLRGG GNRPNYDAKS VADALALLRA DGLPERLVID ASHANSGKDH RNQPLVAADV AAQLAGGQHG IVGIMLESFL LSGRQGLDPT RELTYGQSIT DACIGWDTTE EVLADLAAAV RTRRRAPAVT PA
|
| |