Gene CPF_2738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2738 
SymbolproS 
ID4203146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2998432 
End bp3000144 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content33% 
IMG OID638083604 
Productprolyl-tRNA synthetase 
Protein accessionYP_697117 
Protein GI110799310 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGT CAAATATGTT AGTAGGAACT TTAAGAGAAG TTCCAGCTGA AGCAGAAATA 
GAAAGTCACA AGCTTATGCT TAGAGCAGGT CTTATGAGAA AGATGGCTGC AGGGATTTAT
AACTATATGC CTTTAGGATT AAAGGTTATA GAAAATGTTA AAAATATAGT AAGAGAAGAA
ATGAATAATG CAGGTGCTCA AGAATTCTTA GCATCAGCTT TAATACCAGC TGAGTTATGG
CAAGAATCAG GAAGATGGGA TGCTTATGGA GCAGAAATGT TTAGATTAAA AGATAGACAT
AACAGAGATT TTTGCTTAGG ACCAACTCAC GAAGAGGTAT TTACTGATAT AGTTAGAAAT
GAAATAAAGT CATATAAGCA ATTACCATTA AATCTTTATC AAATACAAAC TAAGTATAGA
GATGAAAGAA GACCAAGATT TGGAGTTATG AGATCAAGAG AATTCATAAT GAAAGATGGA
TATAGCTTTG ACAAAGATCA AGAAGGATTA GATTTAGCAT ATGAAAAAAT GAGAAAAGCA
TATGTTAATA TATTCAATAG ATGTGGATTA GATGCTAAGG CAGTTGCAGC TGATTCAGGA
GCTATAGGTG GATCAGGTTC AGCTGAGTTT ATGGTTAAAT CAGAAGTTGG AGAAGATGAT
GTAGTATTCT GTACAGCTTG TGATTATGCA GCTAACATAG AAAAAGCTCC ATCAACACCA
GAACATGCAG AAAAAGAAGA ATTAATGGAA GTAGAAAAAG TTGAAACTCC AGCTGTTAAA
TCAATTGAAG ATTTAGCAAA ATTCTTTGAA TGCTCACCAA AGAAAATAGC AAAAACTTTA
ATATTCCAAG CTGATGATAA AGTGGTTGCT GTTGTATTAA GAGGAGATAG AGAAGCTAAC
GAAGTTAAGA TAGCTAATGC TATTGGAGAA GTTATAGAAT TAGAAATGGC AAGTGAAGAG
GCTGTTAAAG AAGCTACTGG CGCAGCTGTT GGATTTGCAG GTCCTATGGG AATAAAAGTA
GATATGTTAT TAGTTGACCA AGAAGTAGCT AATATGTATA ACTTCATAAT TGGTGCTAAT
GAAACTGATA TGCACTTAAA AAATGTAAAC TATGGAAGAG ACTTTGAAGG AATAGTTGGT
GACTTTAGAA ATGTTACTAT AGGAGAAAAA TGTCCTGAGT GTGGAAAAGA AATAACTATT
TCAAGAGGTA CTGAGGTTGG ACATATATTC AAACTTGGAA CTAAGTATTC AGAGTCTATG
GGTGCAACAT TTATTGATGA AGATGGAAAA GCTAAACCAT TTATAATGGG ATGCTATGGA
ATAGGGGTTA CAAGAACTGT AGCTTCAATA ATAGAGCAAC ACAATGACGA AAACGGAATA
ATATGGCCAT TAGAAGTAGC TCCATACCAT GTATCAGTTA TACCAGCTAA TGTTAAAAAT
GAAGAACAAG CAACTAAAGC TGAAGAAATA TACAATGAAT TAAGAAAAAT GGGAGTTGAA
GCTCTACTTG ATGATAGAAA AGAAAGAGCA GGAGTTAAAT TCAAAGATTC TGAATTAATG
GGAATTCCAA TGAGAATAAC TGTTGGAAAG ATGATTGGTG AAGGTCAAGT TGAATTTAAA
CTTAGAAACG GTGGAGAAGT TGAGACTTTA TCTATAGAAG AAGTTTATAA TAGAGTAAGA
GAAGAATTTG AAAGAGCAAA TTTATCTTTA TAA
 
Protein sequence
MKMSNMLVGT LREVPAEAEI ESHKLMLRAG LMRKMAAGIY NYMPLGLKVI ENVKNIVREE 
MNNAGAQEFL ASALIPAELW QESGRWDAYG AEMFRLKDRH NRDFCLGPTH EEVFTDIVRN
EIKSYKQLPL NLYQIQTKYR DERRPRFGVM RSREFIMKDG YSFDKDQEGL DLAYEKMRKA
YVNIFNRCGL DAKAVAADSG AIGGSGSAEF MVKSEVGEDD VVFCTACDYA ANIEKAPSTP
EHAEKEELME VEKVETPAVK SIEDLAKFFE CSPKKIAKTL IFQADDKVVA VVLRGDREAN
EVKIANAIGE VIELEMASEE AVKEATGAAV GFAGPMGIKV DMLLVDQEVA NMYNFIIGAN
ETDMHLKNVN YGRDFEGIVG DFRNVTIGEK CPECGKEITI SRGTEVGHIF KLGTKYSESM
GATFIDEDGK AKPFIMGCYG IGVTRTVASI IEQHNDENGI IWPLEVAPYH VSVIPANVKN
EEQATKAEEI YNELRKMGVE ALLDDRKERA GVKFKDSELM GIPMRITVGK MIGEGQVEFK
LRNGGEVETL SIEEVYNRVR EEFERANLSL