Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0690 |
Symbol | aroA |
ID | 4205165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 810385 |
End bp | 811659 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 642565250 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_698016 |
Protein GI | 110802037 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.539954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAGG TAATTATAAC TCCTAGTAAG TTAAGGGGAA GTGTAAAAAT ACCACCTTCT AAAAGTATGG CTCATAGAGC TATTATTTGT GCTTCTTTAA GCAAAGGAGA AAGTGTTATT TCTAACATAG ATTTTTCAGA AGATATTATT GCAACTATGG AAGGAATGAA ATCTTTAGGA GCAAATATAA AAGTAGAAAA AGATAAACTA ATTATAAATG GAGAAAATAT TTTAAAGGAT TCTAATTATA AAGTTATTGA TTGTAATGAA TCAGGTTCCA CTTTAAGATT TTTAGTTCCA ATTTCCTTAA TAAAAGATAA TAAAGTTAAT TTTATCGGTA GAGGAAATTT AGGAAAAAGA CCATTAAAAA CTTATTATGA GATTTTTGAG GAGCAAGAAA TTAAGTATTC CTATGAGGAA GAAAATCTTG ATTTGAATAT AGAAGGAAGC TTAAAAGGTG GAGAATTCAA AGTTAAGGGA AATATAAGTT CTCAATTTAT AAGTGGTTTA TTATTTACTC TTCCTTTATT AAAAGATGAT TCTAAAATAA TAATAACTAC AGAACTTGAA TCTAAAGGAT ATATAGATTT AACTTTAGAC ATGATAGAAA AGTTTGGAGT TACAATAAAA AATAATAATT ATAGAGAATT TTTAATAAAA GGTAATCAAA GTTATAAGCC TATGAATTAT AAGGTTGAAG GTGATTACTC ACAGGCTGCT TTCTATTTTT CAGCAGGGGC CTTAGGCTCA GAAATAAATT GTCTTGATTT AGATTTAAGT TCTTATCAAG GGGATAAGGA ATGCATTGAA ATATTAGAGG GTATGGGTGC TAGGCTTATA AAAAATCAAG AAGAGTCTTT AAGTATAATT CATGGGGATT TAAATGGAAC AATTATAGAT GCTTCACAGT GCCCAGATAT AATTCCAGTT TTGACAGTGG TTGCTGCTTT AAGTAAAGGA GAGACTAGTA TTATAAACGG AGAAAGACTT AGAATAAAAG AATGTGATAG ATTAAATGCT ATATGCACTG AGCTTAATAA ACTAGGTGCA GATATAAAGG AATTAAAAGA TGGACTTATA ATAAATGGAG TTAAAGAGTT AATAGGAGGA GAAGTATATA GCCATAAAGA TCATAGAATA GCTATGAGTT TAGCTATTGC TTCTACAAGA TGCAAGGAAG AGGTTATTAT AAGAGAACCA GATTGTGTTA AAAAATCTTA TCCAGGATTT TGGGAAGATT TTAAGAGCTT AAGTGGAATT TTAAGAGAAG AATAA
|
Protein sequence | MKKVIITPSK LRGSVKIPPS KSMAHRAIIC ASLSKGESVI SNIDFSEDII ATMEGMKSLG ANIKVEKDKL IINGENILKD SNYKVIDCNE SGSTLRFLVP ISLIKDNKVN FIGRGNLGKR PLKTYYEIFE EQEIKYSYEE ENLDLNIEGS LKGGEFKVKG NISSQFISGL LFTLPLLKDD SKIIITTELE SKGYIDLTLD MIEKFGVTIK NNNYREFLIK GNQSYKPMNY KVEGDYSQAA FYFSAGALGS EINCLDLDLS SYQGDKECIE ILEGMGARLI KNQEESLSII HGDLNGTIID ASQCPDIIPV LTVVAALSKG ETSIINGERL RIKECDRLNA ICTELNKLGA DIKELKDGLI INGVKELIGG EVYSHKDHRI AMSLAIASTR CKEEVIIREP DCVKKSYPGF WEDFKSLSGI LREE
|
| |