Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1254 |
Symbol | |
ID | 5703482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1450544 |
End bp | 1451884 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641270769 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_001536150 |
Protein GI | 159036897 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000167974 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACACCGT TTCCGGGCCG GTCGGCCGCG TCAGTGGGCC CGGCCCACAC CCACTCGGTG GACCGGCTGC TGACCCGCCG GCTGGACGAG GCACTGGCTC GCCCGGCCGC CCAACAGCCG TGCTGGCCGG ACCCCGAGCG GGCCAGCGCC GTCATCGAGC GGCTGCGACA CGCCGAGCCG ATCGTGGCCC CGGACGAGAC GGCGCGGTTG TCCGAGCAGC TCGCGTCGGT CGCGCGCGGT GAGGCGTTCC TGCTGCAGGG CGGCGACTGC GCCGAGACCT TCGTCGACAA CACCGAGACA CACCTGCGAG CCAACCTACG GATCCTGTCG CAGATGGCAA CCGTGCTGAC CTCCGCCACC GGCGTGCCGG TGGTCGAGAT CGCGCGGATG GCCGGACAGT ACGCCAAGCC CCGCTCCGCT GTGGTGGACG CGTCGGGTCT TCCGGTCTAC CGCGGCGACA TCATCAATTC GACGGAGCCG ACACCGACCG CCAGAACCCC CGACCCGCAA CGGATGCTGC GTGCCCGCGC GCATGCGGCC GACGCGATGG ACATGGTCCG CAGAATCCGC GCGGGTGCGG CCCACATCAG CCACGAGATG CTTCTGCTCG ACTACGAACG GGCCGGGCTC TGGGCCGATG CCTGCGGCCC CGGCACGCGG CTGACCAGCG GGCTCGCCCA CTTCCTGTGG ATCGGCGAGC GTACCCGTCA GCTCGACGGC GCCCATGTCG CGGTCGCCGA GCTGATCGCC AACCCGATCG GTCTCAAGTT GGGGCCGAGC GTGACACCGG AACTGGCCGT CGAGTATGTG GAGCGCCTCG ACCCGCACTG CACACCGGGA CGGCTGACAT TGGTGAGCCG GATGGGACAT CACCTGGTCC GCGATGTCCT GCCACCGATC GTGGAGAAGG TCACCGCTTC CGGACATCAG GTGGTCTGGC AGTGCGACCC GATGCACGGT AATACCCGGC AGTCGGCAAA CGGCTTCAAG ACTCGGCACG TCGACCATGT GGTCGACGAG CTCGCCGGAT TCTTCGAGGT ACACCGTGGC CTCGGCACCC ACCCTGGTGG TATTCACGTC GAGGTGACCG GAGAGGACGT GACGGAATGC CTCGGTGGTG CCTCGGGGAC CGCGGAGCGC GACCTGCCCG CCCGCTACCG GACCGCGTGC GACCCACGGC TCAACGCTGA TCAGTCGCTC GAGTTGGCCC GCTTCGTCGC GGAGCTGCTG ACGACCGTCG TCCGGGCGCC CACGCGAGGG CGTGCTGGCC GAAACCTCGA CCTTGAGCGC ACCACACCAC CCGCCGGTAG GCCGTCCCAG GTGCCCGTCG CCAGACGGTG A
|
Protein sequence | MTPFPGRSAA SVGPAHTHSV DRLLTRRLDE ALARPAAQQP CWPDPERASA VIERLRHAEP IVAPDETARL SEQLASVARG EAFLLQGGDC AETFVDNTET HLRANLRILS QMATVLTSAT GVPVVEIARM AGQYAKPRSA VVDASGLPVY RGDIINSTEP TPTARTPDPQ RMLRARAHAA DAMDMVRRIR AGAAHISHEM LLLDYERAGL WADACGPGTR LTSGLAHFLW IGERTRQLDG AHVAVAELIA NPIGLKLGPS VTPELAVEYV ERLDPHCTPG RLTLVSRMGH HLVRDVLPPI VEKVTASGHQ VVWQCDPMHG NTRQSANGFK TRHVDHVVDE LAGFFEVHRG LGTHPGGIHV EVTGEDVTEC LGGASGTAER DLPARYRTAC DPRLNADQSL ELARFVAELL TTVVRAPTRG RAGRNLDLER TTPPAGRPSQ VPVARR
|
| |