Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_3093 |
Symbol | |
ID | 5743179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 3782954 |
End bp | 3784057 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641294193 |
Product | chorismate synthase |
Protein accession | YP_001560188 |
Protein GI | 160881220 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000101466 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGTT CAAGTTTCGG ATCAATTTTT AAAATAGCAA CTTGGGGAGA ATCCCATGGA AAAGGTATCG GCGTTGTTGT TGACGGGTGT CCTGCAGGTC TTACTCTAAA TGAAGAAATG ATTCAGACAT TTCTAAACCG TCGAAAACCT GGGCAAACGA AATATTCGAC TCCAAGAAAA GAAGATGATC TTGTAACAAT CCTATCTGGT GTTTTTGAAG GAAAAACTAC AGGTACCCCA ATTTCCATGA TGATTGCAAA TGAGACTGCA CGTTCTGCAG ATTATAGTGA AATAGCAAGC TTTTATAGAC CTGGTCATGC AGACTATACT TTTGATGCAA AATACGGTTT TCGTGACTAT CGCGGGGGTG GACGTTCCTC AGGACGTGAA ACAATTGGAC GTGTAGCAGC AGGTGCAATC GCTGCTGCCC TCTTAAAAGA ACTAGGAATT GAAGTTTTTA CTTATACCAA ATCCATTGGT CCTATTCAAA TTGATTATCA TAAGTGCCAA AAAGAAAACT TAACTTTAAG TCCTCTTTGC ATGCCAGATT TAGAAGCATC TCAGAAAGCG GAAGATTATC TAGAGCAGTG CATTCACAAT TTAGACTCTA GTGGTGGTAT GATTGAATGC ATTATATCTG GAGTTCCAGC AGGAATTGGG GAACCAGTAT TTGATAAATT AGATGCGCAG CTTGCAAAGG CGATATTCTC TATTGGCGCT GTAAAGGGCT TTGAGATTGG ATCTGGTTTT GAAGTAGCAA AACAGTTAGG TTCCGAAAAT AATGATGGGT TTGCATTCGA TGCAAATGGA AAACTCATTA AGTTAACCAA TCATTCTGGC GGTATCCTTG GAGGAATTAG TGATGGCTCC GAAATTATCT TCCGGGCTGC AATTAAACCA ACTCCTTCTA TAAAAAAAGA ACAGCAAACC GTTAATAAAT CAGGTGAGAA CATAAATGTA TCTATAAAAG GCCGTCATGA TCCAATTATA GTCCCAAGGG CAGTTGTTGT TGTGGAAGCG ATGGCAGCCT TAACTCTAGC AGATTTGTTA CTGAGTGGTA TGTCCTCAAA AATGGATTAC GTAAAGAAAA TCTATCAAAA ATAA
|
Protein sequence | MSGSSFGSIF KIATWGESHG KGIGVVVDGC PAGLTLNEEM IQTFLNRRKP GQTKYSTPRK EDDLVTILSG VFEGKTTGTP ISMMIANETA RSADYSEIAS FYRPGHADYT FDAKYGFRDY RGGGRSSGRE TIGRVAAGAI AAALLKELGI EVFTYTKSIG PIQIDYHKCQ KENLTLSPLC MPDLEASQKA EDYLEQCIHN LDSSGGMIEC IISGVPAGIG EPVFDKLDAQ LAKAIFSIGA VKGFEIGSGF EVAKQLGSEN NDGFAFDANG KLIKLTNHSG GILGGISDGS EIIFRAAIKP TPSIKKEQQT VNKSGENINV SIKGRHDPII VPRAVVVVEA MAALTLADLL LSGMSSKMDY VKKIYQK
|
| |