Gene TM1040_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0199 
Symbol 
ID4078647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp216374 
End bp217726 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content63% 
IMG OID638005493 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_612194 
Protein GI99080040 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCC ACGGCACGCC CATCCCGATG ACCTCCCGCC GCGCAAGCCC CCTCAAAGGC 
GAGGCGCATG TCCCTGGCGA CAAGTCGATT TCGCATCGCT CATTGATCCT TGGCGCAATG
GCTGTGGGTG AGACAAAAAT CTCCGGCCTC CTTGAGGGCG AAGATGTGCT CGACACCGCC
AAGGCGATGC AGGCTTTTGG GGCCGAGGTC GTCAATCACG GCGGTGGAGA ATGGTCCGTC
TTTGGCGTGG GCGTCGGCGG TTTTGCCGAG CCGGAAAACG TGATTGACTG CGGCAATTCC
GGCACTGGTG TGCGGCTCAT CATGGGCGCG ATGGCGACCT CGCCGATCAC CGCGACCTTT
ACCGGCGATG CCTCGCTCAA CAAACGCCCG ATGGCGCGTG TGACCGATCC GCTTGCGCTC
TTTGGCGCGC AATCCGTGGG CCGCGAGGGC GGCCGTCTGC CGATGACCAT CGTTGGCGCG
GCCGAGCCCG TGCCGGTGCG CTATGAGGTG CCGGTGCCCT CAGCGCAGGT GAAATCTGCC
GTTCTGCTTG CAGGCCTCAA TGCGCCCGGC AAAACCGTTG TGATTGAGCG CGAAGCCACC
CGCGACCATT CCGAGCGGAT GCTTGCGGGC TTTGGGGCTG AAATCACGGT TGAGGACACC
AAGGAAGGCC GCGTGATTAC CCTCACCGGT CAGCCTGAGC TGAAACCGCA AGTGATTGCA
GTACCGCGCG ATCCCTCCTC TGCCGCCTTC CCGGTTTGCG CCGCGCTCAT CACGCCCGGT
TCTGACGTGC TGGTGCCGGG GATTGGTCTC AACCCGACCC GCGCGGGCCT GTTCTACACC
CTGCAAGACA TGGGCGCGGA TCTGACGTTT GAGAATCCTC GGACCGAAGG CGGCGAACCT
GTGGCCGATC TGCGCGCCAA ATACTCGCCC GACATGAAAG GGATCGAGGT CCCACCAGAA
CGCGCCGCGT CGATGATTGA CGAGTATCCC GTTCTGTCTG TGGTGGCCTC TTTTGCCACC
GGAACCACCA TGATGGCTGG CGTCAAGGAA TTGCGCGTGA AGGAAAGCGA CCGCATCGAT
GCAATGGCAA AGGGCCTGCG CGCCAATGGT GTCACCGTCG AGGAAGGCGA GGACTGGTGG
AGCGTCGAAG GCTGCGGCCC CGAGGGTGTC AAAGGCGGTG GCACTGCCGA GAGCTTCCTT
GATCACCGCA TCGCCATGTC GTTCATGGTG ATGGGTATGG GCGCACAAAA CCCGGTCTCC
GTCGACGATG GCAGCCCGAT CGCGACGTCC TTTCCCATCT TCGAGCGGCT GATGGGCGAT
CTTGGGGCGT CGATCATCCG CACGGATGGC TGA
 
Protein sequence
MSGHGTPIPM TSRRASPLKG EAHVPGDKSI SHRSLILGAM AVGETKISGL LEGEDVLDTA 
KAMQAFGAEV VNHGGGEWSV FGVGVGGFAE PENVIDCGNS GTGVRLIMGA MATSPITATF
TGDASLNKRP MARVTDPLAL FGAQSVGREG GRLPMTIVGA AEPVPVRYEV PVPSAQVKSA
VLLAGLNAPG KTVVIEREAT RDHSERMLAG FGAEITVEDT KEGRVITLTG QPELKPQVIA
VPRDPSSAAF PVCAALITPG SDVLVPGIGL NPTRAGLFYT LQDMGADLTF ENPRTEGGEP
VADLRAKYSP DMKGIEVPPE RAASMIDEYP VLSVVASFAT GTTMMAGVKE LRVKESDRID
AMAKGLRANG VTVEEGEDWW SVEGCGPEGV KGGGTAESFL DHRIAMSFMV MGMGAQNPVS
VDDGSPIATS FPIFERLMGD LGASIIRTDG