Gene CPR_1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1899 
SymbolaspS 
ID4204102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2096359 
End bp2098152 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content32% 
IMG OID642566449 
Productaspartyl-tRNA synthetase 
Protein accessionYP_699209 
Protein GI110801685 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0173] Aspartyl-tRNA synthetase 
TIGRFAM ID[TIGR00459] aspartyl-tRNA synthetase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAAG CTTTAAATGG ATTAAAGCGT AACATAATGT GTGGCGACGC TAGAGAAAGC 
CATATTGGAC AAAAAGTAAC CGTAATGGGC TGGGTTCAAA GAAATAGAAA TCTTGGAGGT
CTTCAATTTA TAGACTTAAG AGATAGAGAA GGAATTTTAC AGGTTGTATT TAATGATGAT
TTAGGAGAAG AAATCCTAGA AAAAGCAAAA TCAATAAGAC CTGAATATTG TATTGCTGTA
ACAGGTGAAA TAGTTAAGAG AGAATCAGTA AATCCAAACA TGCCAACAGG TATGGTTGAG
TTAAAAGCTG AGGAATTAAA GATTTTATCT GAATCAGATA CTCCTCCAAT ATATATAAAA
GAAGATTTAG ATGCTGCTGA AAGTATTAGA TTAAAGTATA GATACTTAGA TTTAAGAAGA
CCAGATATGC AAAATATCTT CAAAATAAGA CATAAGACAA CTAAAGCAAT AAGAGATTAT
TTAGATCAAA ATGGTTTCTT AGAAATGGAG ACGCCAATAC TTACAAAGTC AACTCCAGAA
GGAGCTAGAG ATTATCTTGT TCCATCAAGA AACTATCCAG GAATGTTCTA TGCATTACCT
CAATCACCAC AATTATTTAA ACAATTATTA ATGGTATCAG GTTTTGATAG ATACTTCCAA
ATAGTTAAGT GTTTCAGAGA TGAGGACTTA AGAGCTAACA GACAACCAGA GTTTACTCAA
GTTGACTTAG AAATGTCATT TGTTGAGCAA GATGATGTTA TGGCTTTAAA TGAATGTTTA
ATAAAACATG TATTTAAAGA AGTTTTAGGA GTAGATGTAA AAACTCCAAT AAAGAGAATG
ACATTTAAAG ATGCTATGGA AAAATACGGT TCAGATAAAC CAGACTTAAG ATTTGGAATG
GAAATCACAA ACTTAAGTGA TGTTGTTAAA GAATGTGGAT TTAAAGTATT CACAGACGCT
GTAGCTAATG GTGGTTCTGT TAGAGGTTTA TGCTTAGAAG GCGGAGCTTC TATGGGAAGA
AAAGACATAG ATAGATTAGG AGAGTTCGTT AAAACTTTCA AAGCTAAAGG GTTAGCATGG
ATTCAATTAA AAGAAGAGGG TGTTAAATCA CCAATAGCTA AATTCTTTAG TGAAGAAGAG
CTAAACAAAA TAATTGAAAC TATGGGAGCT AAAACAGGAG ATTTAATCCT TATAGTTGCT
GATAAAAACT CAGTAGTTTT AAAAGCTTTA GGAGAATTAA GATTAGAACT TTCAAGAAAA
TTTGATCTAG TTAAAGATAA GAGTGAGTTT AACTTCACAT GGATAACAGA GTTTGATCTT
CTTGAGTACG ATGAAGAAGA AGGAAGATAC TTTGCAGCTC ACCATCCATT TACAATGCCA
ATGGATGAGG ATATAAAGTA TTTAGATACT GATCCAGGAA GAGTTAGAGC TAAGGCTTAT
GACTTAGTAT TAAATGGAGA AGAGTTAGGT GGAGGATCTA TAAGAATACA TGATACTAAA
CTTCAAGAAA AAATGTTTGA AGTATTAGGA TTTACTCAAG AATCAGCTTG GGAAAGATTT
GGATTCTTAT TAGAAGCATT TAAATTTGGA CCACCACCAC ACGGCGGATT AGCTTTCGGT
TTAGATAGAA TGATAATGTT CTTAGCAGGA ACTGAAAATA TCAAGGATGT TATAACATTC
CCTAAAAACC AAAATGCATT CTGTTATTTA ACTGAAGCAC CTAATATAGT AGATGAAGAA
CAATTAAAAG AATTAGGAAT TGAAACAATA AAGAAAGAAG ATACGGCAGA ATAA
 
Protein sequence
MGEALNGLKR NIMCGDARES HIGQKVTVMG WVQRNRNLGG LQFIDLRDRE GILQVVFNDD 
LGEEILEKAK SIRPEYCIAV TGEIVKRESV NPNMPTGMVE LKAEELKILS ESDTPPIYIK
EDLDAAESIR LKYRYLDLRR PDMQNIFKIR HKTTKAIRDY LDQNGFLEME TPILTKSTPE
GARDYLVPSR NYPGMFYALP QSPQLFKQLL MVSGFDRYFQ IVKCFRDEDL RANRQPEFTQ
VDLEMSFVEQ DDVMALNECL IKHVFKEVLG VDVKTPIKRM TFKDAMEKYG SDKPDLRFGM
EITNLSDVVK ECGFKVFTDA VANGGSVRGL CLEGGASMGR KDIDRLGEFV KTFKAKGLAW
IQLKEEGVKS PIAKFFSEEE LNKIIETMGA KTGDLILIVA DKNSVVLKAL GELRLELSRK
FDLVKDKSEF NFTWITEFDL LEYDEEEGRY FAAHHPFTMP MDEDIKYLDT DPGRVRAKAY
DLVLNGEELG GGSIRIHDTK LQEKMFEVLG FTQESAWERF GFLLEAFKFG PPPHGGLAFG
LDRMIMFLAG TENIKDVITF PKNQNAFCYL TEAPNIVDEE QLKELGIETI KKEDTAE