Gene CPR_0690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_0690 
SymbolaroA 
ID4205165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp810385 
End bp811659 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content29% 
IMG OID642565250 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_698016 
Protein GI110802037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.539954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGG TAATTATAAC TCCTAGTAAG TTAAGGGGAA GTGTAAAAAT ACCACCTTCT 
AAAAGTATGG CTCATAGAGC TATTATTTGT GCTTCTTTAA GCAAAGGAGA AAGTGTTATT
TCTAACATAG ATTTTTCAGA AGATATTATT GCAACTATGG AAGGAATGAA ATCTTTAGGA
GCAAATATAA AAGTAGAAAA AGATAAACTA ATTATAAATG GAGAAAATAT TTTAAAGGAT
TCTAATTATA AAGTTATTGA TTGTAATGAA TCAGGTTCCA CTTTAAGATT TTTAGTTCCA
ATTTCCTTAA TAAAAGATAA TAAAGTTAAT TTTATCGGTA GAGGAAATTT AGGAAAAAGA
CCATTAAAAA CTTATTATGA GATTTTTGAG GAGCAAGAAA TTAAGTATTC CTATGAGGAA
GAAAATCTTG ATTTGAATAT AGAAGGAAGC TTAAAAGGTG GAGAATTCAA AGTTAAGGGA
AATATAAGTT CTCAATTTAT AAGTGGTTTA TTATTTACTC TTCCTTTATT AAAAGATGAT
TCTAAAATAA TAATAACTAC AGAACTTGAA TCTAAAGGAT ATATAGATTT AACTTTAGAC
ATGATAGAAA AGTTTGGAGT TACAATAAAA AATAATAATT ATAGAGAATT TTTAATAAAA
GGTAATCAAA GTTATAAGCC TATGAATTAT AAGGTTGAAG GTGATTACTC ACAGGCTGCT
TTCTATTTTT CAGCAGGGGC CTTAGGCTCA GAAATAAATT GTCTTGATTT AGATTTAAGT
TCTTATCAAG GGGATAAGGA ATGCATTGAA ATATTAGAGG GTATGGGTGC TAGGCTTATA
AAAAATCAAG AAGAGTCTTT AAGTATAATT CATGGGGATT TAAATGGAAC AATTATAGAT
GCTTCACAGT GCCCAGATAT AATTCCAGTT TTGACAGTGG TTGCTGCTTT AAGTAAAGGA
GAGACTAGTA TTATAAACGG AGAAAGACTT AGAATAAAAG AATGTGATAG ATTAAATGCT
ATATGCACTG AGCTTAATAA ACTAGGTGCA GATATAAAGG AATTAAAAGA TGGACTTATA
ATAAATGGAG TTAAAGAGTT AATAGGAGGA GAAGTATATA GCCATAAAGA TCATAGAATA
GCTATGAGTT TAGCTATTGC TTCTACAAGA TGCAAGGAAG AGGTTATTAT AAGAGAACCA
GATTGTGTTA AAAAATCTTA TCCAGGATTT TGGGAAGATT TTAAGAGCTT AAGTGGAATT
TTAAGAGAAG AATAA
 
Protein sequence
MKKVIITPSK LRGSVKIPPS KSMAHRAIIC ASLSKGESVI SNIDFSEDII ATMEGMKSLG 
ANIKVEKDKL IINGENILKD SNYKVIDCNE SGSTLRFLVP ISLIKDNKVN FIGRGNLGKR
PLKTYYEIFE EQEIKYSYEE ENLDLNIEGS LKGGEFKVKG NISSQFISGL LFTLPLLKDD
SKIIITTELE SKGYIDLTLD MIEKFGVTIK NNNYREFLIK GNQSYKPMNY KVEGDYSQAA
FYFSAGALGS EINCLDLDLS SYQGDKECIE ILEGMGARLI KNQEESLSII HGDLNGTIID
ASQCPDIIPV LTVVAALSKG ETSIINGERL RIKECDRLNA ICTELNKLGA DIKELKDGLI
INGVKELIGG EVYSHKDHRI AMSLAIASTR CKEEVIIREP DCVKKSYPGF WEDFKSLSGI
LREE