Gene CPF_0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0689 
SymbolaroA 
ID4202833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp825206 
End bp826480 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content29% 
IMG OID638081574 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_695141 
Protein GI110799152 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.533665 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGG TAATTATAAC TCCTAGTAAG TTAAAGGGAA GTGTAAAAAT ACCACCTTCT 
AAAAGTATGG CTCATAGAGC TATTATTTGT GCTTCTTTAA GCAAAGGAGA AAGTGTTATT
TCTAACATAG ATTTTTCAGA AGATATTATT GCAACTATGG AAGGTATGAA ATCTTTAGGA
GCAAATATAA AAGTAGAAAA AGATAAACTA ATTATAAATG GAGAAAATAT TTTAAAGGAT
TCTAATTATA AAGTTATTGA TTGTAATGAA TCAGGTTCCA CTTTAAGATT TTTAGTTCCG
ATTTCCTTAA TAAAAGATAA TAGAGTTAAT TTTATCGGTA GAGGAAATTT AGGGAAAAGA
CCATTAAAAA CTTATTATGA GATTTTTGAG GAGCAAGAAG TTAAGTATTC CTATGAGGAA
GAAAATCTTG ATTTGAATAT AGAAGGAAGC TTAAAAGGTG GAGAATTCAA AGTTAAGGGA
AATATAAGTT CTCAATTTAT AAGTGGTTTA TTATTTACTC TTCCTTTATT AAAAGAGGAT
TCTAAAATAA TAATAACTAC AGAACTTGAA TCTAAAGGAT ATATAGATTT AACTTTAGAC
ATGATAGAAA AGTTTGGAGT TACAATAAAA AATAATAATT ATAGAGAATT TTTAATAAAG
GGTAATCAAA GTTATAAGCC TATGAATTAT AAGGTTGAAG GTGATTACTC ACAGGCTGCT
TTTTATTTTT CAGCAGGGGC TTTAGGCTCA GAAATAAATT GTCTTGATTT AGATTTAAGT
TCTTATCAAG GAGATAAGGA ATGCATTGAA ATATTAGAGG GTATGGGTGC TAGGCTTATA
GAAAATCAAG AAGAGTCTTT AAGTATAATT CATGGGGATT TAAATGGAAC AATTATAGAT
GCTTCACAAT GCCCAGATAT AATTCCTGTT TTGACAGTGG TTGCTGCTTT AAGTAAGGGA
GAGACTAGGA TTATAAACGG AGAAAGACTT AGAATAAAAG AATGTGATAG ATTAAATGCT
ATATGTACAG AGCTTAATAA ACTAGGTGCA GATATAAAGG AATTAAAAGA TGGCCTTATA
ATAAAGGGAG TTAAAGAATT AATAGGAGGA GAAGTATATA GTCATAAAGA TCATAGAATA
GCTATGAGTT TGGCTATTGC TTCTACAAGA TGCAAGGAAG AGGTTATTAT AAAAGAACCA
GATTGTGTTA AAAAATCTTA TCCAGGATTT TGGGAAGATT TTAAGAGCTT AGGTGGAATT
TTAAAAGGAG AATAA
 
Protein sequence
MKKVIITPSK LKGSVKIPPS KSMAHRAIIC ASLSKGESVI SNIDFSEDII ATMEGMKSLG 
ANIKVEKDKL IINGENILKD SNYKVIDCNE SGSTLRFLVP ISLIKDNRVN FIGRGNLGKR
PLKTYYEIFE EQEVKYSYEE ENLDLNIEGS LKGGEFKVKG NISSQFISGL LFTLPLLKED
SKIIITTELE SKGYIDLTLD MIEKFGVTIK NNNYREFLIK GNQSYKPMNY KVEGDYSQAA
FYFSAGALGS EINCLDLDLS SYQGDKECIE ILEGMGARLI ENQEESLSII HGDLNGTIID
ASQCPDIIPV LTVVAALSKG ETRIINGERL RIKECDRLNA ICTELNKLGA DIKELKDGLI
IKGVKELIGG EVYSHKDHRI AMSLAIASTR CKEEVIIKEP DCVKKSYPGF WEDFKSLGGI
LKGE