Gene CPF_0690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0690 
SymbolaroC 
ID4203801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp826544 
End bp827617 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content32% 
IMG OID638081575 
Productchorismate synthase 
Protein accessionYP_695142 
Protein GI110800421 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.140082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGGAG TTTGGGGTAA TAAAATAAAA TTATCTATAT TTGGAGAATC TCATGGAGAA 
GGAATAGGAA TAGTAATAGA TGGAATAGAA CCTGGAATAA AAATAAATAT GGATAACATA
GAAAAAGATA TGGAAAGAAG AGCACCAGGA AGAAATAGTT TATCAACTCA AAGAAAAGAA
GGGGATAAAC CAGAAATTTT AAGTGGAATA TTTAATGGAA TCACCACAGG GGCTCCTATT
TCAATGATAA TAAGAAATAC AGATAAAAGA TCTAGGGATT ATTCAAAAAT AAAAGATGTA
ATGAGACCAG GCCATGCAGA TTTCCCAGGA TACATAAGAT ATAATGGCTT TAATGATTAT
AGAGGGGGAG GACATTTCTC AGGAAGAATA ACAGCGCCCT TAGTTTTTGC TGGAGCCTTA
GCTAAGGAAA TACTTAAGGA AAAAGATATA ACTATTGGTT CTCATATTAA GCAAGTTGGA
AAAGTTAAGG ATTCTTCTTT TGATGCATTA AATTTAAAGA AAGAAGATTT AGAAGAACTT
TTAACTAAAG AACTTCCAGT AATAGATACA AATAAAATAG AAGAAATTAA GGAAGAGATT
ACTTCATATA GAATGGAAGG AGATTCTATT GGAGGAATTG TTGAGTGCGC CATAGTAGGA
TTAGAGGCTG GTATAGGAAA TCCATTCTTT GATTCTTTAG AAAGTACCAT AGCTCATTTA
GCTTTTTCAG TGCCTGCTGT AAAGGGAATT GAATTTGGAG CAGGTTTTGA CTTTGCAAAT
ATGAAAGGTT CAGAAGCAAA TGACGAATAT TTCATAGAAT ATGAAAAAGT TAAGACATAC
TCTAATAATA ATGGAGGAAT AACTGGTGGA ATATCAAATG GAATGCCAGT TATATTCAGA
GTTGTTATAA AACCTACACC ATCTATTTCT AAAGAACAAA GAACTATAAA TATAAAAAAT
ATGACAGAGG AAGTTCTAAG TGTAAATGGT AGACATGATC CTTGTATAGT TCAAAGAGCC
TTAGTTGTTA TAGAAGCCAT TGCAGCTATT TCTATATTAG AGTTAATAAA ATAA
 
Protein sequence
MGGVWGNKIK LSIFGESHGE GIGIVIDGIE PGIKINMDNI EKDMERRAPG RNSLSTQRKE 
GDKPEILSGI FNGITTGAPI SMIIRNTDKR SRDYSKIKDV MRPGHADFPG YIRYNGFNDY
RGGGHFSGRI TAPLVFAGAL AKEILKEKDI TIGSHIKQVG KVKDSSFDAL NLKKEDLEEL
LTKELPVIDT NKIEEIKEEI TSYRMEGDSI GGIVECAIVG LEAGIGNPFF DSLESTIAHL
AFSVPAVKGI EFGAGFDFAN MKGSEANDEY FIEYEKVKTY SNNNGGITGG ISNGMPVIFR
VVIKPTPSIS KEQRTINIKN MTEEVLSVNG RHDPCIVQRA LVVIEAIAAI SILELIK