Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4681 |
Symbol | hisS |
ID | 5902143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 5061285 |
End bp | 5062781 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641565200 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_001686299 |
Protein GI | 167648636 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.892356 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC AAGCTGACAC CCCGGCCCAG ACCTTCCGCC CCGAAGCCCG CAACCCGCGC GGCTTCGCCG ACAAGCGCGC CCGCGACCTG CGGGCCGAGC GGGCGATCCT GGAGGCCGTG TCGGCGGTCT ATGAGCGCTA CGGCTTCGAG GCGCTGGACA CCGGGGCGTT CGAATATGCC GACGCCCTGG GCAAGTTCCT GCCCGACAGC GACCGTCCCA ACGAGGGGGT GTTCGCCCTG CAGGACGACG ACGACCAGTG GATGGCGCTG CGCTACGACC TGACCGCGCC CCTGGCCCGC TTCGCCGCCC AGAACTGGGA GACCCTGCCC AAGCCGTTCC GCCGCTATGC CTTCGGGCCG GTGTGGCGCA ACGAGAAGCC GGGGCCAGGA AGGTTCCGCG AGTTCATCCA GTGCGACGCC GACACCGTCG GCTCGGCCCG TCCCGAGGCC GACGCCGAGA TCATCGCCAT GGCCGTCGAG GGCCTGCAGG CGGCGGGCCT GCCCCGAGGC GCGGCGGTGC TGAAGATCAA CAATCGCAAG CTGCTGAACG GCCTGCTGAC CGCCGCCGGG GTCGAGACCC AGGGCCAGAA GCTGGGCGTG CTGCGCGCCG TCGACAAGCT GGACCGCCTG GGCGTCGAGG GCGTGCGCCT GCTGCTGGGC GAGGGCCGGC TGGACGAGAG CGGCGCCTTC ACCAAGGGCG CGGGCCTGAT GGGCAAGGCG ATCGACTCGG TGCTCGACTT CGTCCAGGCC GGGCCGATGG GCGGTATCGG CGGGCGCTCG GACACCCTGG CCAGGATCGC CAATGTGATC GGCGGCTCGG CCGAAGGGGA CGAGGGGCTG GAGGAGCTGG CCAAGATCGA CGCGGCGCTC AAGAGCCTGG GCGTGGCCGA CGACCAGGCG CTCTTTGAGC CCTCTATCGT TCGTGGGCTC GAGTACTACA CCGGCGCGGT GTTCGAGGCC GAGCTGCTGT TGTCGACCAC GGACGACAAG GGCGCCAGCG TCAGCTTCGG CTCGATCGGC GGCGGCGGGC GCTATGACGA CCTGGTGGCG CGGTTCACCG GCCAGGTGAC GCCGGCCACG GGCTTCTCGT TCGGCGTCTC GCGCCTGGCG GCGGCGCTGC GGGCGGCGGG ACGCGAACCC GGAGGGGTCG CGCGAGGTCC GGTGGTGGTC ATCGCCTTCG ACCAGGCCCA CATGGGCGAG TACTTCGCCG TCGTCACCGA GCTGCGCAAC GCCGGCGTCG CCGCCGAGGT CTATCTGGGA ACCTCGGGTA TGCGGCCGCA GATGAAATAC GCCGACCGCC GCATGGCGCC GGCCGCCATC ATGCTGGGCG GCGACGAGAT CGCGGCCGGC ACGGTGACGA TCAAGGATCT GGACCTGGGT CGCGAGCTGG CCGCCGGGGT GGCCGACAAC GCCGCCTGGA AGGCCGAGCG GCCCGGCCAG CAGACCATCC CGCGCGGCGA GCTGGTCGCG GCGGTCAAGA AAATCATCGG GGGTTGA
|
Protein sequence | MTDQADTPAQ TFRPEARNPR GFADKRARDL RAERAILEAV SAVYERYGFE ALDTGAFEYA DALGKFLPDS DRPNEGVFAL QDDDDQWMAL RYDLTAPLAR FAAQNWETLP KPFRRYAFGP VWRNEKPGPG RFREFIQCDA DTVGSARPEA DAEIIAMAVE GLQAAGLPRG AAVLKINNRK LLNGLLTAAG VETQGQKLGV LRAVDKLDRL GVEGVRLLLG EGRLDESGAF TKGAGLMGKA IDSVLDFVQA GPMGGIGGRS DTLARIANVI GGSAEGDEGL EELAKIDAAL KSLGVADDQA LFEPSIVRGL EYYTGAVFEA ELLLSTTDDK GASVSFGSIG GGGRYDDLVA RFTGQVTPAT GFSFGVSRLA AALRAAGREP GGVARGPVVV IAFDQAHMGE YFAVVTELRN AGVAAEVYLG TSGMRPQMKY ADRRMAPAAI MLGGDEIAAG TVTIKDLDLG RELAAGVADN AAWKAERPGQ QTIPRGELVA AVKKIIGG
|
| |