Gene CPR_1900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1900 
SymbolhisS 
ID4205504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2098173 
End bp2099420 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content32% 
IMG OID642566450 
Producthistidyl-tRNA synthetase 
Protein accessionYP_699210 
Protein GI110802627 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATAC AAGCTCAAAA GGGTACAAAG GATATGTTAC CTAATGACGC TTATAAATGG 
CATTATATAG AAGAAAAGTT AAGAAAAATA TCAGCTGAAT ATGGAATTAG AGAGATTAGA
ACTCCTATGT TTGAGGCAAC TGAACTTTTC AAAAGAGGAG TTGGAGAAAC TACTGACGTG
GTTCAAAAGG AAATGTATAC TTTTGAAGAC AAAGGTGGAA GAAGCATAAC TCTTAAACCA
GAGGGGACAG CTCCAGCTGT TAGAGCATTT ATTGAAAATA GTTTATATGC TGATGCTCAG
CCAACAAAAA TGTTCTATTT TACTCCATGC TTTAGATATG AAAAAATGCA AAAAGGAAGA
TTAAGAGAAT TCCATCAATA TGGAATAGAA GTTTTTGGTT CACAAGAAGC TTCTATTGAT
GCAGAAATCT TATCTTTAGT TATGAGAGCA TTAACAGAGG ATTTTGGAAT AAAAGGATTA
AGCTTAAATA TAAACAGTTT AGGATGTCCA AAATGTAGAG CAAAATTCAA TGAAGCTTTA
AAACAATATT TAAAAGAGAA CTATGATAAT CTTTGTGAAA CTTGTAAAAC AAGATTTGAA
AAGAATCCTA TGAGAATCAT AGACTGTAAA GAAAAGAGAT GTAAGGAAAT AGTTAAGGAA
GCTCCTTCAA TACTAGATTA CATCTGCGAA GAGTGCAGTG ATCACTTTAG CAAGTTAAAA
GCTTACTTAG ATGTTATGGG AATAGAATAT AACATAGATC CACAAATAGT AAGAGGATTA
GATTACTATA GTAAAACTGT TTTTGAAGTT ATAAAAGATG GATTAACAGT TTGTGGTGGA
GGAAGATATG ATTATCTAGT AGAAGAAGTA GATGGTCCTA AAACTCCAGC TATGGGATTT
GGATTAGGTT TAGAAAGACT TCTTTTAATA TTAGATGAAG AAGGAATAGA AATTCCTGAG
CCTGTTAGAT GCGAAGTTTA TATTGGATCA ATGGGAGATA GGGCTAAGCT TGAAGCTATG
AAATTAGCAT TTAATCTTAG AAAATCTGGT ATTAAGGCTG AAATAGATCA CTTAGGAAAG
AGTGTTAAGG CTCAAATGAA GTATGCTAAT AAAATAGGAG CTAAATATAC TTTTGTTATA
GGTGACTCTG AAATAGAAGA AAACAAAATT AAAATTAAGA GAATGAGCGA TGGAGAACAA
TTCGAAGTCA GCTTAGATAT AAATGAAATA GTAAATATAG TTAAGTAG
 
Protein sequence
MAIQAQKGTK DMLPNDAYKW HYIEEKLRKI SAEYGIREIR TPMFEATELF KRGVGETTDV 
VQKEMYTFED KGGRSITLKP EGTAPAVRAF IENSLYADAQ PTKMFYFTPC FRYEKMQKGR
LREFHQYGIE VFGSQEASID AEILSLVMRA LTEDFGIKGL SLNINSLGCP KCRAKFNEAL
KQYLKENYDN LCETCKTRFE KNPMRIIDCK EKRCKEIVKE APSILDYICE ECSDHFSKLK
AYLDVMGIEY NIDPQIVRGL DYYSKTVFEV IKDGLTVCGG GRYDYLVEEV DGPKTPAMGF
GLGLERLLLI LDEEGIEIPE PVRCEVYIGS MGDRAKLEAM KLAFNLRKSG IKAEIDHLGK
SVKAQMKYAN KIGAKYTFVI GDSEIEENKI KIKRMSDGEQ FEVSLDINEI VNIVK