Gene SNSL254_A2716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A2716 
SymbolhisS 
ID6485081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp2654914 
End bp2656188 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content56% 
IMG OID642738046 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002041780 
Protein GI194443808 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAAAA ACATTCAAGC CATTCGCGGC ATGAACGATT ATCTGCCTGG CGAAACCGCC 
ATCTGGCAGC GCATTGAAGG CACACTCAAA AACGTGCTCG GCAGCTACGG TTACAGTGAA
ATCCGCTTGC CGATTGTAGA GCAGACCCCG TTATTCAAAC GCGCGATCGG TGAAGTTACT
GACGTGGTTG AAAAAGAGAT GTACACCTTT GAGGATCGCA ACGGCGATAG CCTGACATTG
CGCCCTGAAG GTACGGCGGG CTGTGTACGC GCCGGCATCG AACATGGTCT CCTGTACAAT
CAGGAACAGC GTTTGTGGTA TATCGGGCCG ATGTTCCGTC ATGAGCGTCC GCAAAAAGGC
CGCTACCGTC AGTTCCACCA GCTAGGTGCC GAAGTCTTTG GCTTACAAGG CCCGGATATC
GACGCGGAAC TGATTATGCT GACCGCACGC TGGTGGCGCG CGCTGGGCAT CTCTGAACAC
GTTAGCCTGG AGCTGAACTC TATTGGTTCC TTAGAGGCGC GTGCGAACTA TCGCGATGCG
CTGGTCGCGT TCCTCGAACA GCATCAAGAG ACGTTGGACG AAGACTGCAA ACGCCGTATG
TATACCAATC CGCTGCGCGT GCTGGATTCA AAAAATCCGG ACGTGCAGGC GCTGCTCAAC
GACGCGCCCG CTCTCGGCGA CTATCTCGAT GACGATTCAC GCGAGCACTT TGCCGGCCTG
TGTAAATTGC TGGACGCGGC GGGAATTGCC TACACCGTCA ACCAGCGGCT GGTACGCGGT
CTGGACTACT ACAACCGCAC CGTATTTGAA TGGGTAACAA ACAGTCTGGG GTCACAAGGC
ACCGTCTGTG CGGGTGGTCG TTATGACGGT CTGGTGGAAC AATTGGGCGG TCGCGCTACC
CCGGCAGTGG GCTTTGCGAT GGGCCTGGAA CGACTTGTTT TGTTAGTTCA GGCAGTTAAT
CCGGAATTTA TTGCCTCTCC TGTTGTCGAT ATATACCTGG TAGCTGCCGG CGCACAAACG
CAGTCTGCGG CAATGACGCT GGCGGAGCGG CTGCGCGATG AAATGCCAGG CGTGAAGCTA
ATGACAAACC ACGGCGGCGG CAACTTTAAG AAACAGTTTG CCCGCGCCGA TAAGTGGGGC
GCCCGTATTG CACTGGTTCT TGGCGAATCT GAAGTCGCCG ATGGGACTGT TGTAGTGAAG
GATTTGCGCT CCGGTGAGCA AACGGCAGTG GCGCAGGACA GCGTTGCCGC GCATTTGCGC
ACTTTATTGG GCTAA
 
Protein sequence
MAKNIQAIRG MNDYLPGETA IWQRIEGTLK NVLGSYGYSE IRLPIVEQTP LFKRAIGEVT 
DVVEKEMYTF EDRNGDSLTL RPEGTAGCVR AGIEHGLLYN QEQRLWYIGP MFRHERPQKG
RYRQFHQLGA EVFGLQGPDI DAELIMLTAR WWRALGISEH VSLELNSIGS LEARANYRDA
LVAFLEQHQE TLDEDCKRRM YTNPLRVLDS KNPDVQALLN DAPALGDYLD DDSREHFAGL
CKLLDAAGIA YTVNQRLVRG LDYYNRTVFE WVTNSLGSQG TVCAGGRYDG LVEQLGGRAT
PAVGFAMGLE RLVLLVQAVN PEFIASPVVD IYLVAAGAQT QSAAMTLAER LRDEMPGVKL
MTNHGGGNFK KQFARADKWG ARIALVLGES EVADGTVVVK DLRSGEQTAV AQDSVAAHLR
TLLG