Gene Strop_2986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2986 
Symbol 
ID5059450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3415749 
End bp3416993 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content68% 
IMG OID640475237 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001159802 
Protein GI145595505 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.710581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.414192 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA TACGGGCCCG CGTGCCCGGA TCGAAGTCCT TGACCAACCG TGCCCTCGCC 
ATCGCGGCTA TGGCGGACGG AGTGACCGAA CTCGACAATC CGCTGGTGAG CGATGGCACC
ACGGCGTTCG CCGACGCGCT CGTCGCCCTC GGGGCGTCGG TCGAACGGCA CGCCCAGCGG
TGGACGGTGA CTGGCAGCGG CGCCGGGACG CGGTTGAGGT CCGGTCGAGT CTGGTGCGAG
GACGCCGGTA CGGCAGCCCG GTTCCTACCG CCCATGGCCG CGGCAGCCGG TGGTGTCTTC
GACTTCGATG GCACGGACCA ACTACGTGCC CGGCCACTTC ATCCGCTGAT CGACGCACTG
CGTGCACTTG GCGCGACGGT CGAGCCCAGC GGTGACGGCG AGGGACTGCC GTTCCGTCTG
GTATCGGATG GCCTTACCGG GGGTGAAGTC GTGCTCGCCA GCGGCACCAG CAGCCAGTAC
CTGAGCGGGC TACTCATGGC CGGGCCGTTG CTGTCCAACC CGCTTACGGT GGTCGCGCCG
GAACTGGTCA GCCGGCCGTA CGTCGATATG ACGATCGCCG TGATGGCTCG CTTCGGGGCT
CAGGTCGCTG AGGCCATCCC TGGCCGCTTC ACCGTTCGCC CCGGTCGGTA CACCCGCACC
CAGTTTCTCG TCGAGCCGGA CGCCTCGACC GCTTCGTATG TTCTCGCGGC CGCCGCAGTC
ACGGGGAAGG AGGTATCCGT AGATGGGTTG GGCAGCGCCA GCCTGCAGGG CGACCGGCGG
TTCGTCGACG TGCTGTCTCA ACTGGGCGCC AAGGTGACGG CGGACCGAGA TCGGGTGACG
GTGCGAGGGC CGCGACAGTT GCGTGGGGGA TTCGCGGTCG ACATGGGGCC GATCTCCGAC
ACCTTTATGA CCCTCGCCGC CATCGCGCCG CTCGCTGACG CGCCGATTCG AATCACCGGC
GTGGGCCACG CCCGCCTCAA GGAGTCAGAC CGGATCGACG CGATAGCACA GAACCTTGTC
TCGTGTGGCG TTCCGGTGCG GACAGGAGCG GACTGGATCG AGATTTCCCC GGCGGACCCA
TCCGCGGCCC TGATCCGCTG TCGGCGGGAC CACCGCATCG CGATGTCGTT CTCGGTGCTC
GGGCTGCGGG TTCCCGGTCT GGTCCTCGAT GACCCGGCAT GCGTGTCGAA GACCTTTCCC
GGATTCCACG ACGAGTTGGC AAGACTGTTC GCCGGCGACC GCTGA
 
Protein sequence
MSAIRARVPG SKSLTNRALA IAAMADGVTE LDNPLVSDGT TAFADALVAL GASVERHAQR 
WTVTGSGAGT RLRSGRVWCE DAGTAARFLP PMAAAAGGVF DFDGTDQLRA RPLHPLIDAL
RALGATVEPS GDGEGLPFRL VSDGLTGGEV VLASGTSSQY LSGLLMAGPL LSNPLTVVAP
ELVSRPYVDM TIAVMARFGA QVAEAIPGRF TVRPGRYTRT QFLVEPDAST ASYVLAAAAV
TGKEVSVDGL GSASLQGDRR FVDVLSQLGA KVTADRDRVT VRGPRQLRGG FAVDMGPISD
TFMTLAAIAP LADAPIRITG VGHARLKESD RIDAIAQNLV SCGVPVRTGA DWIEISPADP
SAALIRCRRD HRIAMSFSVL GLRVPGLVLD DPACVSKTFP GFHDELARLF AGDR