Gene NSE_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0781 
SymbolhisS 
ID3931975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp693305 
End bp694546 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content40% 
IMG OID637900937 
Producthistidyl-tRNA synthetase 
Protein accessionYP_506656 
Protein GI88608104 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATAA AAATCAATAA TGTCAAAGGG ACTAGGGATC TGTTTGGTGA GCAGTTAGAA 
AAGATGCGTC TCATTGAGCA GGTAGCCAAG AATCTTTCGA TTCGGTATTT GTTTACTGAG
CTTGAGACTC CGATAATTGA GCATACAGAG CTTTTTATTA GGAACCTTGG TGAAACGTCA
GACGTAGTAA ATAAAGAGAT CTATTCGTTT CAGGACAAAA GTGGTCACAA TATTTGTCTA
AGACCAGAAT TTACCGCTGC TGTCACTAGA GCATTCGTGG AGAATTTTCA GCATATTCAA
TCACCTGTTC GGTTATTTTC TTTTGGTCCG CTATTCAGAT ATGAAAGACC ACAAAAAGGG
AGATATAGGC AATTTCATCA GGTGAATTTT GAATGGATCG GAGCAAAGCA TTATCTTTGG
GCTGTTGAAG CTATAGTTTT AGCAAAGTCG TTCCTTAAAG AAATTGGAAT AAGGTGTGAA
ATACGTGTTA ATTCACTTGG TTGTTCTAGA ACTCGTGAAG AGTATAAACT TGCACTCATC
AACTATTTTC AACAGTACAA AGAGCACCTT TCAGCTGATA GTTTGCTCAG ATTGAAAAAG
AATCCGTTGA GAATATTAGA CTCGAAGGAT CCATCTGAGA AGGAAATTGT GGTGGGCGCG
CCAAGAATTC TGGATTACCA TACTGATGAT GCTCTAAAGG AATTTGAATC AATTTGTGAT
ATACTGAAGC TCCTCGATAT TGAGTTTTCT GTAGATCATA GGTTGGTCAG AGGATTAGAT
TATTATTCTG GTTTAATTTT CGAATTTACT AGTCCTGATC TCGGTGCGCA GGATGCCCTC
TTGGGAGGTG GAGCATATGA GCAACTTTCA GAGAATTTGG GCGGAAAAAA AGTACAATCA
ATTGGGTTTG CTGCGGGGAT TGAGCGTTTA ATCGATATAA TGCCAGTTTT GGCACCTACG
AGTGATAAGA TCGTTTCGAT TGTTCCCATC GGGGAAATTG CAGAAAGGGA GGCGCTAAAA
CTACTGTTTT ACCTGCGCAG TGAAGGATTA TGCGCCGATA TGTGCTATGG GCTCAGTGTT
AAGTCGAGAA TGAAACGTGC TGAAAGAAGC ACAGTTACAG TCATTCTTGG TGAGGAAGAA
TTTTCAAGGG GTGAGTCGAC CGTAAGAATA ATGGAGACTG GTCAACAAAT GACTGTTGCG
CACGAAAAAC TCCTATCAAC ATTGAGGGAA TTGCTCTGTT GA
 
Protein sequence
MSIKINNVKG TRDLFGEQLE KMRLIEQVAK NLSIRYLFTE LETPIIEHTE LFIRNLGETS 
DVVNKEIYSF QDKSGHNICL RPEFTAAVTR AFVENFQHIQ SPVRLFSFGP LFRYERPQKG
RYRQFHQVNF EWIGAKHYLW AVEAIVLAKS FLKEIGIRCE IRVNSLGCSR TREEYKLALI
NYFQQYKEHL SADSLLRLKK NPLRILDSKD PSEKEIVVGA PRILDYHTDD ALKEFESICD
ILKLLDIEFS VDHRLVRGLD YYSGLIFEFT SPDLGAQDAL LGGGAYEQLS ENLGGKKVQS
IGFAAGIERL IDIMPVLAPT SDKIVSIVPI GEIAEREALK LLFYLRSEGL CADMCYGLSV
KSRMKRAERS TVTVILGEEE FSRGESTVRI METGQQMTVA HEKLLSTLRE LLC