Gene SeD_A2890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A2890 
SymbolhisS 
ID6872814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2782547 
End bp2783821 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content56% 
IMG OID642785935 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002216585 
Protein GI198241839 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.112536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones83 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAAAA ACATTCAAGC CATTCGCGGC ATGAACGATT ATCTGCCTGG CGAAACCGCC 
ATCTGGCAGC GCATTGAAGG CACACTCAAA AACGTGCTCG GCAGCTACGG TTACAGTGAA
ATCCGCTTGC CGATTGTAGA GCAGACCCCG TTATTCAAAC GTGCGATCGG TGAAGTTACT
GACGTGGTTG AAAAAGAGAT GTACACCTTT GAGGATCGCA ACGGCGATAG CCTGACATTG
CGCCCTGAAG GTACGGCGGG CTGTGTACGC GCCGGCATCG AACATGGTCT CCTGTACAAT
CAGGAACAGC GTTTGTGGTA TATCGGGCCG ATGTTCCGTC ATGAGCGTCC GCAAAAAGGC
CGCTACCGTC AGTTCCACCA GCTAGGTGCC GAAGTCTTTG GCTTACAAGG CCCGGATATC
GACGCGGAAC TGATTATGCT GACCGCACGC TGGTGGCGCG CGCTGGGCAT CTCTGAACAC
GTTAGCCTGG AGCTGAACTC TATTGGTTCC TTAGAGGCGC GTGCGAACTA TCGCGATGCG
CTGGTCGCGT TCCTCGAACA GCATCAAGAG ACGCTGGACG AAGACTGCAA ACGCCGTATG
TATACCAATC CGCTGCGCGT GCTGGATTCA AAAAATCCGG ACGTGCAGGC GCTGCTCAAC
GACGCGCCCG CTCTCGGCGA CTATCTCGAT GACGATTCAC GCGAGCACTT TGCCGGCCTG
TGTAAATTGC TGGACGCGGC GGGGATTGCC TACACCGTCA ACCAGCGTCT GGTACGCGGT
CTGGATTACT ACAACCGCAC CGTATTTGAA TGGGTAACAA ACAGTCTGGG GTCACAAGGC
ACCGTCTGTG CGGGTGGTCG TTATGACGGT CTGGTGGAAC AACTGGGCGG TCGCGCTACC
CCGGCAGTGG GCTTTGCGAT GGGCCTGGAA CGACTTGTTT TGTTAGTTCA GGCAGTTAAT
CCGGAATTTA TTGCCTCTCC TGTTGTCGAT ATATACCTGG TAGCTGCCGG CGCACAAACG
CAGTCTGCGG CAATGACGCT GGCGGAGCGG CTGCGCGATG AAATGCCAGG CGTGAAGCTA
ATGACAAACC ACGGCGGCGG CAACTTTAAG AAACAGTTTG CCCGCGCCGA TAAGTGGGGC
GCCCGTATTG CACTGGTTCT TGGCGAATCT GAAGTCGCCG ATGGGACTGT TGTAGTGAAG
GATTTGCGCT CCGGTGAGCA AACGGCAGTG GCGCAGGACA GCGTCGCCGC GCATTTGCGC
ACTTTATTGG GCTAA
 
Protein sequence
MAKNIQAIRG MNDYLPGETA IWQRIEGTLK NVLGSYGYSE IRLPIVEQTP LFKRAIGEVT 
DVVEKEMYTF EDRNGDSLTL RPEGTAGCVR AGIEHGLLYN QEQRLWYIGP MFRHERPQKG
RYRQFHQLGA EVFGLQGPDI DAELIMLTAR WWRALGISEH VSLELNSIGS LEARANYRDA
LVAFLEQHQE TLDEDCKRRM YTNPLRVLDS KNPDVQALLN DAPALGDYLD DDSREHFAGL
CKLLDAAGIA YTVNQRLVRG LDYYNRTVFE WVTNSLGSQG TVCAGGRYDG LVEQLGGRAT
PAVGFAMGLE RLVLLVQAVN PEFIASPVVD IYLVAAGAQT QSAAMTLAER LRDEMPGVKL
MTNHGGGNFK KQFARADKWG ARIALVLGES EVADGTVVVK DLRSGEQTAV AQDSVAAHLR
TLLG