Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E2889 |
Symbol | hisS |
ID | 6270687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 2688747 |
End bp | 2690021 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641726832 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_001881305 |
Protein GI | 187733422 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.731565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCAAAAA ACATTCAAGC CATTCGCGGC ATGAACGATT ACCTGCCTGG CGAAACGGCC ATCTGGCAGC GCATTGAAGG CACACTGAAA AACGTGCTCG GCAGCTACGG TTACAGTGAA ATCCGCTTGC CGATTGTAGA GCAGACCCCG CTATTCAAAC GTGCGATTGG TGAAGTCACC GACGTGGTTG AAAAAGAGAT GTACACCTTT GAGGATCGCA ATGGCGACAG CCTGACTCTG CGCCCTGAAG GGACGGCGGG CTGTGTACGC GCCGGCATCG AGCATGGTCT TCTGTACAAT CAGGAACAGC GTCTGTGGTA TATCGGGCCG ATGTTCCGTC ACGAGCGTCC GCAGAAAGGG CGTTATCGTC AGTTCCATCA GTTGGGCTGC GAAGTTTTCG GTCTGCAAGG TCCGGATATC GACGCTGAAC TGATTATGCT CACCGCCCGC TGGTGGCGCG CGCTGGGTAT TTCCGAACAC GTAACTCTTG AGCTGAATTC TATCGGTTCG CTGGAAGCAC GCGCCAATTA CCGCGATGCG CTGGTGGCAT TCCTTGAGCA GCATAAAGAA AAGCTGGACG AAGACTGCAA ACGCCGCATG TACACTAACC CGCTGCGCGT GCTGGATTCA AAAAATCCGG AAGTGCAGGC GCTTCTCAAC GACGCTCCGG CATTAGGTGA TTATCTGGAC GAGGAATCTC GTGAGCATTT TGCCGGTCTG TGCAAACTGC TTGAGAGCGC GGGGATCGCT TACACCGTAA ACCAGCGTCT GGTGCGTGGT CTGGATTACT ATAACCGTAC CGTTTTCGAG TGGGTGACTA ACAGTCTCGG CTCCCAGGGC ACCGTGTGTG CAGGCGGTCG TTATGACGGT CTTGTGGAAC AACTGGGCGG TCGTGCAACA CCGGCTGTCG GTTTTGCGAT GGGCCTCGAA CGTCTTGTAT TGTTAGTACA GGCCGTTAAT CCGGAATTTA AAGCCGATCC TATTGTCGAT ATATACCTGG TGGCTTCAGG TGCTGATACA CAATCTGCGG CTATGGCATT AGCTGAGCGT CTGCGTGATG AATTACAGGG CGTGAAATTG ATGACCAACC ACGGCGGCGG CAACTTTAAG AAACAGTTTG CCCGTGCTGA TAAATGGGGT GCCCGCGTTG CTGTGGTGCT GGGTGAGTCT GAAGTGACTA ACGGCACAGC AGTAGTGAAG GATTTGCGCT CTGGTGAGCA AACGGCAGTT GCGCAGGATA GCGTAGCCGC GCATTTGCGC ACGTTACTGG GTTAA
|
Protein sequence | MAKNIQAIRG MNDYLPGETA IWQRIEGTLK NVLGSYGYSE IRLPIVEQTP LFKRAIGEVT DVVEKEMYTF EDRNGDSLTL RPEGTAGCVR AGIEHGLLYN QEQRLWYIGP MFRHERPQKG RYRQFHQLGC EVFGLQGPDI DAELIMLTAR WWRALGISEH VTLELNSIGS LEARANYRDA LVAFLEQHKE KLDEDCKRRM YTNPLRVLDS KNPEVQALLN DAPALGDYLD EESREHFAGL CKLLESAGIA YTVNQRLVRG LDYYNRTVFE WVTNSLGSQG TVCAGGRYDG LVEQLGGRAT PAVGFAMGLE RLVLLVQAVN PEFKADPIVD IYLVASGADT QSAAMALAER LRDELQGVKL MTNHGGGNFK KQFARADKWG ARVAVVLGES EVTNGTAVVK DLRSGEQTAV AQDSVAAHLR TLLG
|
| |