Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0199 |
Symbol | |
ID | 4078647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 216374 |
End bp | 217726 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638005493 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_612194 |
Protein GI | 99080040 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGCC ACGGCACGCC CATCCCGATG ACCTCCCGCC GCGCAAGCCC CCTCAAAGGC GAGGCGCATG TCCCTGGCGA CAAGTCGATT TCGCATCGCT CATTGATCCT TGGCGCAATG GCTGTGGGTG AGACAAAAAT CTCCGGCCTC CTTGAGGGCG AAGATGTGCT CGACACCGCC AAGGCGATGC AGGCTTTTGG GGCCGAGGTC GTCAATCACG GCGGTGGAGA ATGGTCCGTC TTTGGCGTGG GCGTCGGCGG TTTTGCCGAG CCGGAAAACG TGATTGACTG CGGCAATTCC GGCACTGGTG TGCGGCTCAT CATGGGCGCG ATGGCGACCT CGCCGATCAC CGCGACCTTT ACCGGCGATG CCTCGCTCAA CAAACGCCCG ATGGCGCGTG TGACCGATCC GCTTGCGCTC TTTGGCGCGC AATCCGTGGG CCGCGAGGGC GGCCGTCTGC CGATGACCAT CGTTGGCGCG GCCGAGCCCG TGCCGGTGCG CTATGAGGTG CCGGTGCCCT CAGCGCAGGT GAAATCTGCC GTTCTGCTTG CAGGCCTCAA TGCGCCCGGC AAAACCGTTG TGATTGAGCG CGAAGCCACC CGCGACCATT CCGAGCGGAT GCTTGCGGGC TTTGGGGCTG AAATCACGGT TGAGGACACC AAGGAAGGCC GCGTGATTAC CCTCACCGGT CAGCCTGAGC TGAAACCGCA AGTGATTGCA GTACCGCGCG ATCCCTCCTC TGCCGCCTTC CCGGTTTGCG CCGCGCTCAT CACGCCCGGT TCTGACGTGC TGGTGCCGGG GATTGGTCTC AACCCGACCC GCGCGGGCCT GTTCTACACC CTGCAAGACA TGGGCGCGGA TCTGACGTTT GAGAATCCTC GGACCGAAGG CGGCGAACCT GTGGCCGATC TGCGCGCCAA ATACTCGCCC GACATGAAAG GGATCGAGGT CCCACCAGAA CGCGCCGCGT CGATGATTGA CGAGTATCCC GTTCTGTCTG TGGTGGCCTC TTTTGCCACC GGAACCACCA TGATGGCTGG CGTCAAGGAA TTGCGCGTGA AGGAAAGCGA CCGCATCGAT GCAATGGCAA AGGGCCTGCG CGCCAATGGT GTCACCGTCG AGGAAGGCGA GGACTGGTGG AGCGTCGAAG GCTGCGGCCC CGAGGGTGTC AAAGGCGGTG GCACTGCCGA GAGCTTCCTT GATCACCGCA TCGCCATGTC GTTCATGGTG ATGGGTATGG GCGCACAAAA CCCGGTCTCC GTCGACGATG GCAGCCCGAT CGCGACGTCC TTTCCCATCT TCGAGCGGCT GATGGGCGAT CTTGGGGCGT CGATCATCCG CACGGATGGC TGA
|
Protein sequence | MSGHGTPIPM TSRRASPLKG EAHVPGDKSI SHRSLILGAM AVGETKISGL LEGEDVLDTA KAMQAFGAEV VNHGGGEWSV FGVGVGGFAE PENVIDCGNS GTGVRLIMGA MATSPITATF TGDASLNKRP MARVTDPLAL FGAQSVGREG GRLPMTIVGA AEPVPVRYEV PVPSAQVKSA VLLAGLNAPG KTVVIEREAT RDHSERMLAG FGAEITVEDT KEGRVITLTG QPELKPQVIA VPRDPSSAAF PVCAALITPG SDVLVPGIGL NPTRAGLFYT LQDMGADLTF ENPRTEGGEP VADLRAKYSP DMKGIEVPPE RAASMIDEYP VLSVVASFAT GTTMMAGVKE LRVKESDRID AMAKGLRANG VTVEEGEDWW SVEGCGPEGV KGGGTAESFL DHRIAMSFMV MGMGAQNPVS VDDGSPIATS FPIFERLMGD LGASIIRTDG
|
| |