Gene CPF_2189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2189 
SymbolhisS 
ID4201377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2429144 
End bp2430391 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content32% 
IMG OID638083054 
Producthistidyl-tRNA synthetase 
Protein accessionYP_696613 
Protein GI110798964 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGATAC AAGCTCAAAA GGGTACAAAG GATATGTTAC CTAATGACGC TTATAAATGG 
CATTATATAG AAGAAAAGTT AAGAAAAATA TCAGCTGAAT ATGGAATTAG AGAGATCAGA
ACTCCTATGT TTGAGGCAAC TGAACTTTTC AAAAGAGGAG TTGGAGAAAC TACTGACGTG
GTTCAAAAGG AAATGTATAC TTTTGAAGAT AAAGGTGGAA GAAGTATAAC TCTTAAACCA
GAGGGGACAG CTCCAGCTGT TAGAGCATTT ATTGAAAATA GTTTATATGC TGATGCTCAG
CCAACAAAAA TGTTCTATTT TACTCCATGC TTTAGATATG AAAAAATGCA AAAAGGAAGA
TTAAGAGAAT TCCATCAATA TGGAATAGAA GTTTTTGGTT CACAAGAAGC TTCTATTGAT
GCAGAAATCT TATCTTTAGT TATGAGAGCA TTAACAGAGG ATTTTGGAAT AAAAGGATTA
AGCTTAAATA TAAACAGTTT AGGATGTCCA AAATGTAGAG CAAAATTCAA TGAAGCTTTA
AAACAATACT TAAAAGAGAA CTATGATAAT CTTTGTGAAA CTTGTAAAAC AAGATTTGAA
AAGAATCCTA TGAGAATCAT AGACTGTAAA GAAAAGAGAT GTAAGGAAAT AGTTAAGGAA
GCTCCTTCAA TACTAGATTA CATCTGCGAA GAGTGTAGTG ATCACTTTAG CAAGTTAAAA
GCTTACTTAG ATGTTATGGG AATAGAGTAT AACATAGATC CACAAATAGT AAGAGGATTA
GATTACTATA GTAAAACTGT TTTTGAAGTT ATAAAAGATG GATTAACAGT TTGTGGTGGA
GGAAGATATG ACTACTTAGT AGAAGAAGTA GACGGTCCTA AAACTCCAGC TATGGGATTT
GGATTAGGTT TAGAAAGACT TCTTTTAATA TTAGATGAGG AAGGAATAGA AATTCCTGAA
CCTGTTAGAT GCGAAGTTTA TATTGGCTCA ATGGGAGACA ATGCTAAGCT TGAAGCTATG
AAATTAGCAT TTAATCTTAG AAAAGCTGGC ATAAAGGCTG AAATAGATCA CTTAGGAAAG
AGTGTTAAGG CTCAAATGAA ATATGCTAAT AAAATAGGAG CTAAATATAC TTTTGTTATA
GGTGACTCTG AAATAGAAGA AAACAAAATT AAAATTAAGA GAATGAGCGA TGGAGAACAA
TTCGAAGTCA GCTTAGATAT AAATGAAATA GTAAATATAG TTAAGTAG
 
Protein sequence
MAIQAQKGTK DMLPNDAYKW HYIEEKLRKI SAEYGIREIR TPMFEATELF KRGVGETTDV 
VQKEMYTFED KGGRSITLKP EGTAPAVRAF IENSLYADAQ PTKMFYFTPC FRYEKMQKGR
LREFHQYGIE VFGSQEASID AEILSLVMRA LTEDFGIKGL SLNINSLGCP KCRAKFNEAL
KQYLKENYDN LCETCKTRFE KNPMRIIDCK EKRCKEIVKE APSILDYICE ECSDHFSKLK
AYLDVMGIEY NIDPQIVRGL DYYSKTVFEV IKDGLTVCGG GRYDYLVEEV DGPKTPAMGF
GLGLERLLLI LDEEGIEIPE PVRCEVYIGS MGDNAKLEAM KLAFNLRKAG IKAEIDHLGK
SVKAQMKYAN KIGAKYTFVI GDSEIEENKI KIKRMSDGEQ FEVSLDINEI VNIVK