Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_1036 |
Symbol | |
ID | 5057482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 1175301 |
End bp | 1176692 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640473305 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_001157888 |
Protein GI | 145593591 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.479807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.750222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTATGGAA CGCGCATGCG ACCCGGCGTG GAGCGGTCGA TCCGCCATTC GCCACCAGCG ACGAACGATG GCGACAGAGA ACTCTGCCTA GACCGGTGGC GCGAGCTACC CCGGCGGCAG GTGCCACCCT GGCCCGATCC CGCCGAGGTG GCAGCAGTGT GCGCGACACT CGGCAAAATG CCGCCGATCG TCACACCCTA CGAGGTCGAT GAACTGCGTC ACCGCCTGGC CGAGGTATGC GAAGGACGCG CCTTCCTGTT ACAGGGCGGC GACTGCGCCG AGACGTTCAC CGGAAACACT GAGAGCCACC TGCTGGGCAC CACGCGGACG CTGCTGCAGA TGGCGATGGC GATAACGTAC GGCGGTTCCG TTCCGGTGGT GAAGGTGGCG CGCCTCGCCG GCCAGTACGG GAAGCCCCGA TCCTCGGCGA CCGATTCCCT GGGGTTGCCC GCGTACCGTG GCGATATTAT CAATGCGCGG CACCCGGCCG AGTCGGCGCG GGCCGCCGAT CCGCAGCGCA TGATCGATGC GTACGCGAAT TCCGCGGTCG CGATGAACCT CATCCGCGCC TATCCGCCGG ACGATCTGAC CGACCTCGAA GAGCTGTACG ACGATACCTA CGACCTCATC CGCGCTTCCC CGGCCGGAGC CCGGTACCAG GTCATCTCCG GCGAGATCGA CCGGGCGCGC GGCTTCGTCC GCGCGTGGGG GCCGAGCGAG CGCCACGCGT TGCGGGAATC AAAGGTGTAC TGCTCGCACG AGGCGCTGGT GCTCGAGTAC GACCGGGCGC TGACCCGAAT CAACGACGGC CGGGCATACG CGTTGTCGGG TCACTTTCTG TGGGTCGGCG AACGCACCCG CCAGCTTGAC CACGCGCACG TCGACTTTGT CGCGCGTATC GCCAACCCAA TCGGCGTGAA GCTCGGCCCG GCCGCCAGCC CGCATGCTGC GATCGAGCTG TGCGAGCGGC TCAACCCCGA GAACCTGCCC GGGCGGCTCA CCCTGATCAG CCGCATGGGC AACCGCCAGG TACGTGACGT GTTTCCGGCG ATCGTCGACA AGGTCACGGC CGCGGGCGCC AAGGTCGTTT GGCAGTGCGA TCCGATGCAC GGCAACACCG AGCAGTCCTC ACACGGCTTC AAGACCCGGC GCCTCGACCG GGTCGTTGAC GAGTTGCTGG GCTACTTCGA CGTGCACCGC AGTCTGGGCA CCCATCCGGG AGGCGTCCAC GTCGAGCTCA CCGGTGAGAA CGTCACCGAG TGCCTCGACG GGATACGTGG CGTCGAGGAC CAACACCTAC CGGATCGCTA CGAGACTGCC TGCGACCCGC GGCTGAACAT GCGGCAGAGC TTAGAACTCG CGTTGCTGGT CGCCGAGATT CTACGCGGCT GA
|
Protein sequence | MYGTRMRPGV ERSIRHSPPA TNDGDRELCL DRWRELPRRQ VPPWPDPAEV AAVCATLGKM PPIVTPYEVD ELRHRLAEVC EGRAFLLQGG DCAETFTGNT ESHLLGTTRT LLQMAMAITY GGSVPVVKVA RLAGQYGKPR SSATDSLGLP AYRGDIINAR HPAESARAAD PQRMIDAYAN SAVAMNLIRA YPPDDLTDLE ELYDDTYDLI RASPAGARYQ VISGEIDRAR GFVRAWGPSE RHALRESKVY CSHEALVLEY DRALTRINDG RAYALSGHFL WVGERTRQLD HAHVDFVARI ANPIGVKLGP AASPHAAIEL CERLNPENLP GRLTLISRMG NRQVRDVFPA IVDKVTAAGA KVVWQCDPMH GNTEQSSHGF KTRRLDRVVD ELLGYFDVHR SLGTHPGGVH VELTGENVTE CLDGIRGVED QHLPDRYETA CDPRLNMRQS LELALLVAEI LRG
|
| |