Gene CPR_1852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1852 
SymbolpheS 
ID4206257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2052873 
End bp2053892 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content32% 
IMG OID642566402 
Productphenylalanyl-tRNA synthetase subunit alpha 
Protein accessionYP_699166 
Protein GI110803656 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATA AGTTAAATCA AATTAAAGAA TTAGCTTTAG TAGAAATTAA AAAAGCTAAA 
GATAGTACTA CTATTGATAC AATAAGAGTT AAATATCTTG GTAAAAAGGG AGAACTTACA
ACTATATTAA GAGGAATGGG ATCTCTATCT AAAGAGGAGA GACCAATAGT TGGTAAGTTA
GCTAATGAGG TAAGAGAGGT TTTAGAAGCT GAATTAGAGG CTGTAACAAA GGCTGTTAAA
GAAGCTGAAA AACAAGAAAA GCTTAAAAAT GAAGTAATAG ATATTTCAAT GCCTGGTAAA
AAACAAACAA TAGGAAAGAA ACATCCATTA GAGCAAACTT TAGATGAAAT GAAAAAAATA
TTTGTTTCAA TGGGATTTGC TATAGAAGAT GGTCCAGAGG TTGAGAAAGA TTACTATAAC
TTTGAAGCCT TAAACATTCC TAAGAATCAT CCAGCTAGAA GTGAGCAAGA TACATTCTAC
ATAAATGATA ATATAGTTTT AAGAACTCAA ACTTCTCCAG TTCAAGCTAG AGTAATGGAA
AAACAACAAC CACCAATAAA AATGATATCA CCTGGTAAGG TATTTAGATC AGATGCTGTT
GATGCTACGC ATTCACCAAT ATTCTACCAA ATGGAAGGTC TAGTTATAGA TAAAGATATA
ACTTTTGCAG ATCTTAAAGG AACTTTAGAA TTATTTGCTA AGAAAATGTT TGGTGATAAA
GTAAAAACTA AGTTTAGACC ACATCATTTC CCATTCACTG AGCCATCAGC TGAAATGGAT
GCTACATGCT TTGTATGTAA CGGAAAAGGA TGTAAAGTAT GTAAGGGAGA AGGTTGGATA
GAAATACTAG GTTGTGGTAT GGTTCACCCT CAAGTCTTAA GAAACTGTGG AATAGACCCA
GAAGTTTATA GTGGATTTGC TTTCGGCTTT GGTGTAGACA GAATGGTTAT GCTTAAGTAT
GGAATAGATG ACATAAGATT ATTATACGAA AGTGATATGA GATTCTTAAA TCAATTCTAG
 
Protein sequence
MQDKLNQIKE LALVEIKKAK DSTTIDTIRV KYLGKKGELT TILRGMGSLS KEERPIVGKL 
ANEVREVLEA ELEAVTKAVK EAEKQEKLKN EVIDISMPGK KQTIGKKHPL EQTLDEMKKI
FVSMGFAIED GPEVEKDYYN FEALNIPKNH PARSEQDTFY INDNIVLRTQ TSPVQARVME
KQQPPIKMIS PGKVFRSDAV DATHSPIFYQ MEGLVIDKDI TFADLKGTLE LFAKKMFGDK
VKTKFRPHHF PFTEPSAEMD ATCFVCNGKG CKVCKGEGWI EILGCGMVHP QVLRNCGIDP
EVYSGFAFGF GVDRMVMLKY GIDDIRLLYE SDMRFLNQF