Gene Sare_4099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4099 
Symbol 
ID5706526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4658061 
End bp4659362 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content72% 
IMG OID641273525 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001538880 
Protein GI159039627 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0307354 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACATC TGACCGCTAC CCGCCCGCTG CGCCCGTGGA CCGCGCCGAC CGCCTCGGAT 
CCGGTCTCGA CGACGCTGCG CCTACCCGGC TCCAAGTCAC TCACCGCACG TGCCCTGGTG
CTCAGCGGAC TGGCCACCGG CCCATCGACG CTCGCCAGGC CACTGCGCGC CCGAGACACC
GAGCTGATGG CCGACGGGCT GCGGGCCATG GGTGTGCACA TGTCGATCAG CGACGACGAG
CGCTGGTTGG TCCGACCGCA CCCACTGGCC GGACCGGCAC ACGTCGACGT CGGCCTGGCA
GGCACGGTGA TGCGGTTCCT TCCCCCGGTG GCAGGCCTTG CGGACGGCCA GATCACCTTC
GACGGTGACC CGCAGGCTCG ACTCCGTCCG CTTGGGCCGC TCCTGGACGC CCTGAGCCGC
CTCGGCGTCC GGATCACCAC ACCACCCACC GGAAGCCTGC CGCTGACCGT GCTCGGTGGC
GGACAGATCC GCGGGGGCGA GGTCGTGATC GACGCCAGCG CCTCCAGCCA ACTCGTCTCG
GGGCTGCTGC TGGCCGCCCC GCACTTCGAC CGGGGCGTGG TCGTCCGACA CGTCGGCCCG
CCGGTGCCCT CCGCGCCCCA CCTGCGGATG ACGGTGCACA TGTTGCGGTC CGCCGGCGCC
GCCGTCGACG ACACCGCCCC CGACGTCTGG ACCGTTGAGC CGGGCCCGCT TGCTGGCCGC
GGCTGGGAGA TCGAACCGGA CCTCTCCGGT GCGGTCCCCT TCTTCGCCGC CGCACTGGTC
ACCGGCGGGG AGGTGACCGT CACCGGCTGG CCGGGGGGCA GCGTCCAGCC GGTCGAGCGG
CTCCGCGGGC TGTTGCAGGC GATGGGCGGC GAAGTGTCCC TCTCCACCGC CGGATTGACC
GTCCGGGGCA CCGGCGCCCT ACATGGCCTG ACCGCCGACC TGTCCGACGT GAGTGAGCTG
ACCCCGGCGC TGACCGCACT GGCGATGCTC GCCGACTCTC CCTCCCGGTT CACCGGAATC
GCCCACATCC GGGGGCACGA GACCGATCGG ATCACGTCGC TCGCCCGCGA GTTCACCGCC
CTTGGCGCCG ACCTCACCGA GTTCCACGAC GGGCTGGCGA TCCGCCCCCG GCCGCTGAGA
AGCGGGGTGT TCGAGACCTA TCACGACCAT CGGATGGCGC ACGCCGCGGC GATCACCGGC
TTGGCCGTAC CCGGCATCGA GCTGAGCGAC GTGGCCTGCA CCTCGAAGAC GATGCCGGAG
TTCCCGGCAC TATGGTCGGC GATGGTGACC GGCAAGAGCT GA
 
Protein sequence
MGHLTATRPL RPWTAPTASD PVSTTLRLPG SKSLTARALV LSGLATGPST LARPLRARDT 
ELMADGLRAM GVHMSISDDE RWLVRPHPLA GPAHVDVGLA GTVMRFLPPV AGLADGQITF
DGDPQARLRP LGPLLDALSR LGVRITTPPT GSLPLTVLGG GQIRGGEVVI DASASSQLVS
GLLLAAPHFD RGVVVRHVGP PVPSAPHLRM TVHMLRSAGA AVDDTAPDVW TVEPGPLAGR
GWEIEPDLSG AVPFFAAALV TGGEVTVTGW PGGSVQPVER LRGLLQAMGG EVSLSTAGLT
VRGTGALHGL TADLSDVSEL TPALTALAML ADSPSRFTGI AHIRGHETDR ITSLAREFTA
LGADLTEFHD GLAIRPRPLR SGVFETYHDH RMAHAAAITG LAVPGIELSD VACTSKTMPE
FPALWSAMVT GKS