Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_1332 |
Symbol | |
ID | 8415630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 1595615 |
End bp | 1596943 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645024301 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_003181690 |
Protein GI | 257791084 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.467827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCATG AAGCAAACGC CACCGTCGTC AACCCGCTGC CGGCGCCCCT GCGCGGCTCG GCCTCGGTCC CGGGCGACAA GTCCATATCG CACCGCGCCG TGCTGTTCGC CGCCATGGCC GAGGGCACCT CGCGGCTTTC GGGCGTGCTC GACTCCGAGG ACGTGCGCTC GTCCATCAGG GCCGTAGGCC AGCTGGGCGC GCAGGTGTCG CTCGAGAAGC AGCCCGACGG CAGCTTGGCG GGAGGCGTCA CGGGGTGGGG CGCCGCCGGC CCCTCGCAGC CCGAGGCTCC CATCGACTGC GGCAACTCCG GCACCACGGT GCGCCTGCTC ATGGGCGTGC TGGCGCCGTG GAACGTGCGC GTGGAGCTGA CGGGCGACGA CTCGCTGCAG CGCCGTCCCA TGCGCCGCAT CACCGCGCCG CTCATGAAGA TGGGCGCGCG CTTCGAGCCC GAAGGGCGCG AGACGCTGCC GCTCACGGTG TGCGGCTCCG AGGGCCTGCG CGCGATAACC TACGACGCCC CCATGGCGTC GGCCCAGCTC AAAACCGCGG TGCTGCTGGC CGGCGTGTAT GCGCGGGGCA CGACGACGCT CAACGAGCCC TCGCCCTCGC GCAACCATAC CGAGCTCATG CTGCCCGAAT TCGGCGTGAC CACCACAGCG GCCGACCGCA CGGCCAGCGT GACGGGTCCC GCTGCGCTTC GGGCCTGCGA GGTGCAGGTG CCGGGCGATC CGTCTTCGGC GGCGTTCCTC GTCTGCGCTG CCGTGTTGAA GCCCGACAGC TCCATCCAGG TGGAGAACGT CAGCTTGAAC ACCGCGCGCA TCGGGTTTAC GCGCACGCTC GAGCGCATGG GCGCCGATAT CAGCGTGCGT CACACGGGAG CGGCGGGCAA GGAGCCCTAC GGCATCGTTT CGGCGTGCTA CACGCCGAAC CTGCATGGGT GCGAGGTGCC GGCCGACAAG ATCGCCACCA TCGTCGACGA AGTGCCGGTG TTGGCGGTCG TGGCGGCTCA CGCGCGCGGC GTCACCGTGT TCCGCGAGGT CAGCGAGTTG CGCGTGAAAG AGACCGACCG CCTGGCCGCC ATCGTGGAGG GGCTTGAGAC TCTCGGTGTG GACGCGTGGA TCGACGGCAA CGACCTGTTC GTGGAAGGCC AGCCGGGCTT GCAAGTGCCC GTCGGCGCCG CATTCGACTC GAAGAACGAC CACCGTCTGG CCATGACCTG GGCGCTCGCC GGCCTGTGCG GCAACGCTCC CGTCGAAGTG GAGAACTTCG ACTCCGTGAA AATCAGCTAT CCGCAGTTCC TCACCGATAT CGAAAGGTTG GCACGATGA
|
Protein sequence | MSHEANATVV NPLPAPLRGS ASVPGDKSIS HRAVLFAAMA EGTSRLSGVL DSEDVRSSIR AVGQLGAQVS LEKQPDGSLA GGVTGWGAAG PSQPEAPIDC GNSGTTVRLL MGVLAPWNVR VELTGDDSLQ RRPMRRITAP LMKMGARFEP EGRETLPLTV CGSEGLRAIT YDAPMASAQL KTAVLLAGVY ARGTTTLNEP SPSRNHTELM LPEFGVTTTA ADRTASVTGP AALRACEVQV PGDPSSAAFL VCAAVLKPDS SIQVENVSLN TARIGFTRTL ERMGADISVR HTGAAGKEPY GIVSACYTPN LHGCEVPADK IATIVDEVPV LAVVAAHARG VTVFREVSEL RVKETDRLAA IVEGLETLGV DAWIDGNDLF VEGQPGLQVP VGAAFDSKND HRLAMTWALA GLCGNAPVEV ENFDSVKISY PQFLTDIERL AR
|
| |