Gene Smed_3456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3456 
Symbol 
ID5324343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3663656 
End bp3665026 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content65% 
IMG OID640792407 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001329109 
Protein GI150398642 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCACG GCTCGAACCC GCGACCCGCC ACCGCACGCA AGTCTTCCGA CCTCAAGGGC 
ACGGTGCGCA TCCCGGGCGA CAAGTCGATC TCGCACCGCT CCTTCATGTT CGGCGGCCTT
GCTTCGGGTG AGACGCGCAT CACCGGACTC CTCGAAGGCG AAGACGTGAT CAATACCGGC
AGGGCCATGC AGGCGATGGG TGCCAAAATT CGCAAGGAGG GCGACACCTG GATCATCGAC
GGCGTCGGGA ACGGTGCGCT GCTCGCGCCC GAGGCGCCGC TCGACTTCGG CAATGCGGGA
ACCGGCTGCC GTCTGACCAT GGGCCTCGTC GGCGTCTATG ATTTCGATTC CACCTTCATC
GGCGACGCCT CGCTGACGAA GCGCCCGATG GGCCGCGTGC TCGATCCGCT CCGCGAAATG
GGTGTCCAGG TGAAGTCGGC CGAGGGCGAC CGGTTGCCGG TGACGCTGCG CGGACCGAAG
ACGCCGAACC CGATCACCTA TCGCGTGCCG ATGGCTTCGG CCCAGGTGAA ATCCGCCGTG
CTGCTCGCCG GGCTGAACAC GCCCGGCATC ACCACCGTCG TCGAGCCGGT AATGACCCGC
GACCATACGG AAAAGATGCT CCAGGGCTTC GGCGCGGACC TTTCGGTCGA GACCGACAGA
GACGGCGTGC GCACCATCCG GCTCGAAGGC CGAGGCAAGC TGAGGGGTCA GGTGATCGAC
GTGCCGGGCG ATCCGTCTTC CACGGCCTTT CCGCTGGTGG CGGCGCTTCT TGTGCCGGGG
TCCGACCTTA GCATTTTCAA CGTGCTGATG AACCCGACCC GTACAGGGCT CATTCTAACG
CTTCAGGAAA TGGGTGCCAG GATCGAAGTG CTGAGCAGCC GGCTTGCCGG TGGCGAGGAT
GTCGCCGATC TGCGCGTACG CTATTCGGAA CTCAAGGGCG TGACGGTACC GGAAGAGCGC
GCACCCTCGA TGATCGACGA ATATCCCGTG CTTGCCGTCG CGGCCGCCTT CGCCGAAGGC
GCAACGGTGA TGAACGGGCT GGAGGAACTG AGGGTCAAGG AATCCGATCG CCTGTCCGCC
GTTGCCGAGG GACTGAAGCT CAACGGCGTC GATTGCGACG AGGGCGAGGC CTCGCTCGTC
GTACGCGGCC GGCCGGGCGG CAAGGGCCTG GGCAATGATT CGGGCGGCCA GGTCAAAACC
CATCTCGATC ACCGCATCGC CATGAGCTTC CTTGTCATGG GGCTTGCCTC GGAGCGTCCG
GTGACGGTCG ACGATGCGAC GATGATCGCG ACGTCCTTCC CCGAGTTCAT GGGACTGATG
ACAGGGTTGG GCGCGAAGAT CGAAGAGACG GAAAACAAGG CGGCCCTATG A
 
Protein sequence
MSHGSNPRPA TARKSSDLKG TVRIPGDKSI SHRSFMFGGL ASGETRITGL LEGEDVINTG 
RAMQAMGAKI RKEGDTWIID GVGNGALLAP EAPLDFGNAG TGCRLTMGLV GVYDFDSTFI
GDASLTKRPM GRVLDPLREM GVQVKSAEGD RLPVTLRGPK TPNPITYRVP MASAQVKSAV
LLAGLNTPGI TTVVEPVMTR DHTEKMLQGF GADLSVETDR DGVRTIRLEG RGKLRGQVID
VPGDPSSTAF PLVAALLVPG SDLSIFNVLM NPTRTGLILT LQEMGARIEV LSSRLAGGED
VADLRVRYSE LKGVTVPEER APSMIDEYPV LAVAAAFAEG ATVMNGLEEL RVKESDRLSA
VAEGLKLNGV DCDEGEASLV VRGRPGGKGL GNDSGGQVKT HLDHRIAMSF LVMGLASERP
VTVDDATMIA TSFPEFMGLM TGLGAKIEET ENKAAL