Gene Smed_0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0515 
Symbol 
ID5321349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp558536 
End bp559633 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID640789449 
Productchorismate synthase 
Protein accessionYP_001326206 
Protein GI150395739 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACA ATACATTCGG TCATCTTTTC CGCGTAACGA CCTGGGGGGA AAGCCACGGT 
CCTGCACTTG GCTGTGTCGT TGACGGGTGC CCGCCCGGAC TTCGCTTCAC GCTCGCGGAC
ATTCAAACCT GGCTTGACAA GCGCAAGCCG GGGCAGTCCC GCTTCGTCAC TCAGCGTCGC
GAGGACGATC TTGTAAAGGT CCTTTCCGGG GTGATGCTCG ACGACGACGG TGAGACGATG
ATTTCCACCG GGACGCCGAT TTCGATGTTG ATCGAAAACA CCGACCAACG GTCCAAGGAT
TACTCGGAGA TCGCCAAGCG CTATCGCCCG GGCCATGCGG ATTATACATA TGACGTGAAA
TACGGCATCC GCGACTATCG CGGCGGCGGC CGCTCCTCGG CCCGTGAGAC GGCTGCACGC
GTGGCTGCCG GTGCGATCGC ACGCCAGGTC GTCCCCGGTC TCGTCGTGCG CGGCGCGCTC
GTCCAGATCG GCAAGCACAG AATCGACCGC GCCAACTGGG ACTGGGCGGA AGTCGGCAAA
AACCCGTTCT TTTCTCCCGA CCCGGCCGTC GTTCCTGTCT GGGAGGAATA TCTCGACGGC
ATCCGCAAAG CCGGCTCGTC GATCGGCGCA ATCGTCGAGG TGATTGCCGA AGGCGTTCCG
GCCGGTATAG GCGCACCGAT CTACGGCAAG CTCGATCAGG ACATCGCCGC CAACCTGATG
TCGATCAATG CCGTAAAGGG CGTGGAGATC GGTAATGGCT TCGCGGCCGC CGAGATCAGC
GGCGAAGACA ATGCCGACGA GATGCGTGTC GGGGCAGGGG GCGACGCGGT TTTCCTTTCC
AACAATGCCG GCGGCATTCT GGGCGGCATC TCGACCGGGC AGCCGGTGGT TGCGCGCTTC
GCCATCAAGC CGACCTCTTC CATTCTGAGC GAGCGCCGTT CGATCGATAG CGACGGCAAG
GAGGTCGACG TGCGCACCAA GGGTCGCCAC GACCCCTGCG TCGGCATCCG GGCCGTACCG
ATCGGGGAGG CGATGCTTGC CTGCGCAATC GCCGACCATT ACCTGCGTGA TCGCGGCCAG
ACAGGCCGGC TGAAGTAA
 
Protein sequence
MSHNTFGHLF RVTTWGESHG PALGCVVDGC PPGLRFTLAD IQTWLDKRKP GQSRFVTQRR 
EDDLVKVLSG VMLDDDGETM ISTGTPISML IENTDQRSKD YSEIAKRYRP GHADYTYDVK
YGIRDYRGGG RSSARETAAR VAAGAIARQV VPGLVVRGAL VQIGKHRIDR ANWDWAEVGK
NPFFSPDPAV VPVWEEYLDG IRKAGSSIGA IVEVIAEGVP AGIGAPIYGK LDQDIAANLM
SINAVKGVEI GNGFAAAEIS GEDNADEMRV GAGGDAVFLS NNAGGILGGI STGQPVVARF
AIKPTSSILS ERRSIDSDGK EVDVRTKGRH DPCVGIRAVP IGEAMLACAI ADHYLRDRGQ
TGRLK