Gene YpAngola_A0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0417 
SymbolhisS 
ID5798881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp432113 
End bp433387 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content51% 
IMG OID641338424 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001605023 
Protein GI162420576 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00143012 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCAAAGA ACATTCAAGC CATCCGTGGT ATGAACGATT ACCTGCCAGC CGATACGGCA 
ATATGGCAGC GTATCGAAAG CATCTTGAAG CAAGTGCTAA GCGGTTACGG TTACAGTGAA
ATCCGTATGC CGATTGTAGA GCAGACCCCG TTATTCAAAC GCGCTATCGG TGAAGTGACC
GATGTGGTTG AAAAAGAGAT GTACACCTTT GATGATCGCA ATGGCGAAAG TCTGACACTG
CGCCCGGAAG GGACTGCAGG CTGTGTGCGC GCGGGTATTG AACATGGTCT GCTGTACAAT
CAGGAACAGC GTCTGTGGTA CATCGGTCCG ATGTTCCGCT ATGAACGTCC GCAAAAAGGT
CGTTATCGTC AGTTCCATCA GTTGGGGGCC GAAGTCTTTG GTCTGCCAGG GCCAGATATT
GATGCCGAGC TGATTTTACT GACAGCCCGC TGGTGGCGTG CATTAGGTAT CTTTGAACAT
GTGAAGTTGG AGCTGAACTC TATTGGTTCG TTGGCTGCCC GTGCTGACTA TCGAGAAGCG
CTGGTGGCGT TCCTGGAGCA ACATGTTGAG GTATTGGACG AAGATTGTAA GCGCCGTATG
TACAGCAATC CGTTGCGGGT GTTAGATTCC AAAAACCCTG ATGTTCAGCA GTTGCTTGAT
GATGCGCCAA AACTGTCTGA TTATCTGGAT GAAGAGTCAA AACAACATTT TGCCGGGTTG
TGTGAACTTT TAGATAAGGC CAGCATCCCA TATACCGTTA ATGAGCGTTT AGTTCGTGGT
TTAGATTATT ACAACCGTAC GGTATTTGAG TGGGTAACGC ACAGCCTGGG CGCACAAGGC
ACAGTCTGTG CGGGTGGGCG TTATGATGGG TTGGTCGAGC AACTCGGTGG TCGTGCAACA
CCGGCCGTGG GTTTTGCGAT GGGCCTTGAG CGCTTGGTTC TGCTGGTGCA GGCTGTTAAT
GCTGATTTCC AGGTGCCTGC GACAGTTGAT GCTTATGTTA TCTCCTCCGG TGAGGGGGCG
CAAAGTGCTG CGATGCTACT TGCTGAGAGC CTACGTGATG CCTTACCAAC GCTAAAAATA
ATGACCAATT ACGGTGGCGG TAATGTTAAG AAACAATTTA CACGCGCTGA TAAGTGGGGC
GCTCGTGTCG CTTTAATGCT GGGTGAAAGC GAAGTGGCCG CGCAGCAAGT CGTTGTAAAA
GATCTGCGAA ATGGTGAACA AGAAACGCTG GCGCAAGCGG ATGTTGCTGC GCGTCTGGCT
TTGATGTTGG GTTAA
 
Protein sequence
MAKNIQAIRG MNDYLPADTA IWQRIESILK QVLSGYGYSE IRMPIVEQTP LFKRAIGEVT 
DVVEKEMYTF DDRNGESLTL RPEGTAGCVR AGIEHGLLYN QEQRLWYIGP MFRYERPQKG
RYRQFHQLGA EVFGLPGPDI DAELILLTAR WWRALGIFEH VKLELNSIGS LAARADYREA
LVAFLEQHVE VLDEDCKRRM YSNPLRVLDS KNPDVQQLLD DAPKLSDYLD EESKQHFAGL
CELLDKASIP YTVNERLVRG LDYYNRTVFE WVTHSLGAQG TVCAGGRYDG LVEQLGGRAT
PAVGFAMGLE RLVLLVQAVN ADFQVPATVD AYVISSGEGA QSAAMLLAES LRDALPTLKI
MTNYGGGNVK KQFTRADKWG ARVALMLGES EVAAQQVVVK DLRNGEQETL AQADVAARLA
LMLG