Gene Shewmr4_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1229 
SymbolhisS 
ID4251285 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1429150 
End bp1430427 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content49% 
IMG OID638117814 
Producthistidyl-tRNA synthetase 
Protein accessionYP_733366 
Protein GI113969573 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.322745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAAAC AGATCCAAGC GATTCGCGGA ATGAACGACA TTCTGCCAAC TCAAAGTCCT 
CTATGGCAAA AAGTGGAAGC GGTACTACGT GCGAGCGTGG CTGCTTATGG TTACAGTGAG
ATCCGCACTC CAATAGTGGA AAATACCGAT CTTTTTAAAC GCTCTATCGG TGAAGTGACT
GATATTGTTG AAAAAGAAAT GTATACCTTC GAGGACCGTA ACGGTGACAG TTTAACCCTA
CGTCCTGAAG GCACCGCCTC AACCGTGCGT GCGGGTAATG AGCATGGTCT GCTCTATAAC
CAAGAACAAC GCCTGTGGTA CATGGGCCCA ATGTTCCGCC ACGAACGTCC GCAAAAAGGT
CGTTATCGCC AATTCCACCA GTTTGGTGTG GAAGTCTACG GCATTGGTAG CGCCGATATC
GACGCCGAAG TGTTGATGTT ATCGGCCCGT CTATGGGAAA AACTCGGCAT TAGCGAGCAT
GTGACGCTTG AGCTGAACAC CTTAGGTGAC CCTGCTGAGC GCGCCGCTTA CCGTGAAGCC
TTGATTGCCT TCTTAGAGCA ACACAAAGAC AAATTAGATG AAGATAGCCA GCGCCGTATG
TATAGCAATC CGCTGCGGGT ATTGGACTCA AAAGATCCCC AAGTGCAAAG CATTTTAGGT
GATGCGCCTG CGCTGATGGA TTACTTAGGT GAAGAATCTT CACAACATTT TGCACAATTG
CGTGAACTCC TCGACGCGGT TGGCATCCAA TACCGAGTTA ACCCCCGCTT AGTCCGTGGT
TTAGATTACT ACAATCGCAC GGTTTTTGAA TGGGTTACCA ATAGCTTAGG CTCTCAAGGT
ACTGTCCTTG CGGGCGGTCG TTACGATGGT TTAGTCGCTC AATTGGGCGG TAAAGATACG
CCAGCCGTCG GTTTTGCGAT GGGATTAGAG CGCATCGTTC TATTGCTTGA AACCTTAGAA
CTGACCCAAG ATATCCCTGC GGCTGTCGAT GTGTATGTCG CAGCGATGGG GGATAGCTGT
TTGGTTGAAG CCATAAAAGT GGCACAGGAG TTACGCTCAA CCTTACCGAC ACTGCGTGTC
ATGAGCCACT GCGGCGGCGG TAACTTCAAG AAGCAAATTA AGCGTGCCGA TAAAAGCGGC
GCACAAGTGG CCTTACTCAT CGGTGAAGAA GAACTCGCCG AAGGTGTGGT TACCGTTAAA
TATTTACGCA ATGACAACGA ACAACAACGG GTCGCTCGAA ATGCACTAAG CGCATTTTTA
GCTGAACTTA CCAAATAA
 
Protein sequence
MAKQIQAIRG MNDILPTQSP LWQKVEAVLR ASVAAYGYSE IRTPIVENTD LFKRSIGEVT 
DIVEKEMYTF EDRNGDSLTL RPEGTASTVR AGNEHGLLYN QEQRLWYMGP MFRHERPQKG
RYRQFHQFGV EVYGIGSADI DAEVLMLSAR LWEKLGISEH VTLELNTLGD PAERAAYREA
LIAFLEQHKD KLDEDSQRRM YSNPLRVLDS KDPQVQSILG DAPALMDYLG EESSQHFAQL
RELLDAVGIQ YRVNPRLVRG LDYYNRTVFE WVTNSLGSQG TVLAGGRYDG LVAQLGGKDT
PAVGFAMGLE RIVLLLETLE LTQDIPAAVD VYVAAMGDSC LVEAIKVAQE LRSTLPTLRV
MSHCGGGNFK KQIKRADKSG AQVALLIGEE ELAEGVVTVK YLRNDNEQQR VARNALSAFL
AELTK