Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4099 |
Symbol | |
ID | 5706526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4658061 |
End bp | 4659362 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641273525 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001538880 |
Protein GI | 159039627 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0307354 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGACATC TGACCGCTAC CCGCCCGCTG CGCCCGTGGA CCGCGCCGAC CGCCTCGGAT CCGGTCTCGA CGACGCTGCG CCTACCCGGC TCCAAGTCAC TCACCGCACG TGCCCTGGTG CTCAGCGGAC TGGCCACCGG CCCATCGACG CTCGCCAGGC CACTGCGCGC CCGAGACACC GAGCTGATGG CCGACGGGCT GCGGGCCATG GGTGTGCACA TGTCGATCAG CGACGACGAG CGCTGGTTGG TCCGACCGCA CCCACTGGCC GGACCGGCAC ACGTCGACGT CGGCCTGGCA GGCACGGTGA TGCGGTTCCT TCCCCCGGTG GCAGGCCTTG CGGACGGCCA GATCACCTTC GACGGTGACC CGCAGGCTCG ACTCCGTCCG CTTGGGCCGC TCCTGGACGC CCTGAGCCGC CTCGGCGTCC GGATCACCAC ACCACCCACC GGAAGCCTGC CGCTGACCGT GCTCGGTGGC GGACAGATCC GCGGGGGCGA GGTCGTGATC GACGCCAGCG CCTCCAGCCA ACTCGTCTCG GGGCTGCTGC TGGCCGCCCC GCACTTCGAC CGGGGCGTGG TCGTCCGACA CGTCGGCCCG CCGGTGCCCT CCGCGCCCCA CCTGCGGATG ACGGTGCACA TGTTGCGGTC CGCCGGCGCC GCCGTCGACG ACACCGCCCC CGACGTCTGG ACCGTTGAGC CGGGCCCGCT TGCTGGCCGC GGCTGGGAGA TCGAACCGGA CCTCTCCGGT GCGGTCCCCT TCTTCGCCGC CGCACTGGTC ACCGGCGGGG AGGTGACCGT CACCGGCTGG CCGGGGGGCA GCGTCCAGCC GGTCGAGCGG CTCCGCGGGC TGTTGCAGGC GATGGGCGGC GAAGTGTCCC TCTCCACCGC CGGATTGACC GTCCGGGGCA CCGGCGCCCT ACATGGCCTG ACCGCCGACC TGTCCGACGT GAGTGAGCTG ACCCCGGCGC TGACCGCACT GGCGATGCTC GCCGACTCTC CCTCCCGGTT CACCGGAATC GCCCACATCC GGGGGCACGA GACCGATCGG ATCACGTCGC TCGCCCGCGA GTTCACCGCC CTTGGCGCCG ACCTCACCGA GTTCCACGAC GGGCTGGCGA TCCGCCCCCG GCCGCTGAGA AGCGGGGTGT TCGAGACCTA TCACGACCAT CGGATGGCGC ACGCCGCGGC GATCACCGGC TTGGCCGTAC CCGGCATCGA GCTGAGCGAC GTGGCCTGCA CCTCGAAGAC GATGCCGGAG TTCCCGGCAC TATGGTCGGC GATGGTGACC GGCAAGAGCT GA
|
Protein sequence | MGHLTATRPL RPWTAPTASD PVSTTLRLPG SKSLTARALV LSGLATGPST LARPLRARDT ELMADGLRAM GVHMSISDDE RWLVRPHPLA GPAHVDVGLA GTVMRFLPPV AGLADGQITF DGDPQARLRP LGPLLDALSR LGVRITTPPT GSLPLTVLGG GQIRGGEVVI DASASSQLVS GLLLAAPHFD RGVVVRHVGP PVPSAPHLRM TVHMLRSAGA AVDDTAPDVW TVEPGPLAGR GWEIEPDLSG AVPFFAAALV TGGEVTVTGW PGGSVQPVER LRGLLQAMGG EVSLSTAGLT VRGTGALHGL TADLSDVSEL TPALTALAML ADSPSRFTGI AHIRGHETDR ITSLAREFTA LGADLTEFHD GLAIRPRPLR SGVFETYHDH RMAHAAAITG LAVPGIELSD VACTSKTMPE FPALWSAMVT GKS
|
| |