Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0689 |
Symbol | aroA |
ID | 4202833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 825206 |
End bp | 826480 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 638081574 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_695141 |
Protein GI | 110799152 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.533665 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAGG TAATTATAAC TCCTAGTAAG TTAAAGGGAA GTGTAAAAAT ACCACCTTCT AAAAGTATGG CTCATAGAGC TATTATTTGT GCTTCTTTAA GCAAAGGAGA AAGTGTTATT TCTAACATAG ATTTTTCAGA AGATATTATT GCAACTATGG AAGGTATGAA ATCTTTAGGA GCAAATATAA AAGTAGAAAA AGATAAACTA ATTATAAATG GAGAAAATAT TTTAAAGGAT TCTAATTATA AAGTTATTGA TTGTAATGAA TCAGGTTCCA CTTTAAGATT TTTAGTTCCG ATTTCCTTAA TAAAAGATAA TAGAGTTAAT TTTATCGGTA GAGGAAATTT AGGGAAAAGA CCATTAAAAA CTTATTATGA GATTTTTGAG GAGCAAGAAG TTAAGTATTC CTATGAGGAA GAAAATCTTG ATTTGAATAT AGAAGGAAGC TTAAAAGGTG GAGAATTCAA AGTTAAGGGA AATATAAGTT CTCAATTTAT AAGTGGTTTA TTATTTACTC TTCCTTTATT AAAAGAGGAT TCTAAAATAA TAATAACTAC AGAACTTGAA TCTAAAGGAT ATATAGATTT AACTTTAGAC ATGATAGAAA AGTTTGGAGT TACAATAAAA AATAATAATT ATAGAGAATT TTTAATAAAG GGTAATCAAA GTTATAAGCC TATGAATTAT AAGGTTGAAG GTGATTACTC ACAGGCTGCT TTTTATTTTT CAGCAGGGGC TTTAGGCTCA GAAATAAATT GTCTTGATTT AGATTTAAGT TCTTATCAAG GAGATAAGGA ATGCATTGAA ATATTAGAGG GTATGGGTGC TAGGCTTATA GAAAATCAAG AAGAGTCTTT AAGTATAATT CATGGGGATT TAAATGGAAC AATTATAGAT GCTTCACAAT GCCCAGATAT AATTCCTGTT TTGACAGTGG TTGCTGCTTT AAGTAAGGGA GAGACTAGGA TTATAAACGG AGAAAGACTT AGAATAAAAG AATGTGATAG ATTAAATGCT ATATGTACAG AGCTTAATAA ACTAGGTGCA GATATAAAGG AATTAAAAGA TGGCCTTATA ATAAAGGGAG TTAAAGAATT AATAGGAGGA GAAGTATATA GTCATAAAGA TCATAGAATA GCTATGAGTT TGGCTATTGC TTCTACAAGA TGCAAGGAAG AGGTTATTAT AAAAGAACCA GATTGTGTTA AAAAATCTTA TCCAGGATTT TGGGAAGATT TTAAGAGCTT AGGTGGAATT TTAAAAGGAG AATAA
|
Protein sequence | MKKVIITPSK LKGSVKIPPS KSMAHRAIIC ASLSKGESVI SNIDFSEDII ATMEGMKSLG ANIKVEKDKL IINGENILKD SNYKVIDCNE SGSTLRFLVP ISLIKDNRVN FIGRGNLGKR PLKTYYEIFE EQEVKYSYEE ENLDLNIEGS LKGGEFKVKG NISSQFISGL LFTLPLLKED SKIIITTELE SKGYIDLTLD MIEKFGVTIK NNNYREFLIK GNQSYKPMNY KVEGDYSQAA FYFSAGALGS EINCLDLDLS SYQGDKECIE ILEGMGARLI ENQEESLSII HGDLNGTIID ASQCPDIIPV LTVVAALSKG ETRIINGERL RIKECDRLNA ICTELNKLGA DIKELKDGLI IKGVKELIGG EVYSHKDHRI AMSLAIASTR CKEEVIIKEP DCVKKSYPGF WEDFKSLGGI LKGE
|
| |