Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3835 |
Symbol | |
ID | 5735700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4814738 |
End bp | 4816138 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280988 |
Product | histidine--tRNA ligase |
Protein accession | YP_001546599 |
Protein GI | 159900352 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase [TIGR00443] ATP phosphoribosyltransferase, regulatory subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000618349 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGCTATC AATTAACTCC TGAAGTGGTT CGCGGAACCC GTGACCTTTT TGCCGCTGCC GTTCACCAAC GTCAGGCGCT GATTCAGACA TTAACCGCTA CTTTTGATCA AGCTGGCTAT GACCCGATTG AGTTGCCATT GCTTGAGCAT CGTGAGTTGT ATTTAAAAAA ATCTGGTGAT GATCTCATCG CCAAATTGTA CCATTGGAAT CAAGGTGGCC GTGATTTGGC CTTACGCCCT GAATGGACAG CCTCGGTCTT ACGAGCAGTC ATCAGTGGCA TGGGCGATGC GCCAATTCCC TTACGCTTGC GCTACGCTGG GCCAGTGTTT CGCTATGAAC GCCCACGTCG CGCGACCTAT CGGCAATTTA CCCAAGTTGG CATCGAATTA ATCGGTGCAC CTGGCCCGTT GGCCGATGCT GAAGCCCTTG GTTTGGCTGT CACTGGCCTG CGCGAATTGG GCATTCAGCA ATGGACGTTA ACCATTGGCC ATATTGGTGT AATTAAGACG TTGCTCAATA GCCTCGGCTT ACCCGAACGG ATCACGTCGG CGCTCACCTG GAGCCTCGAA CGGATTCGTT CTAAAGGGCT TGATGCGGTT AAACAACAAT GGCGCGACGA CGACGACGAT CTGCCAGTTG ATTTAGCCAG CCTAGCCCAT CTCGCCGACC AAGATCTTGA GACGCTGTTG TTGCGCGTAT TACCCAGTCT TGGGGTGCGC CTCGATAGCG GCGGGCGTGA GCCACAAGCA ATCATTCAAC GCTTGGTGCG CAAGCTGCGC CGTGGCGACG ATGCGCTTAA TCTTGATCGC GCATGGCAGT TGCTTTCAGC TTTAACCGCT GCCCGTGGCT CGGCACCAGA TGTGATGCAA CAGATGCGCG AGCTATGCCA AGAATATACG GTTGCACCCG ACGCTTTGGA TGAGTTGCAA ACCACCCTGA CCTTACTTGA GGCATATGGC GTGCCAGCAG ATCAGATTGT GCTGGATTTT GGCATGGGTC GTGGCTTGCA CTACTACACT GGCCTGATTT TCGAGATCGA TGGCGCTGAT GGCTTGCAAC TCTGTGGCGG TGGCCGCTAT GATGATTTAG TTGCGGCGCT TGGTGGGCGA GCAATGCCAG CCGTTGGCTT TGCCTATGGA CTCGAACGAA TCGTCGCAGC AGTTGCGCCA GCCGAAATTG CCCCAGTTAA AAGTGTGTTG GTTGTTGGCG ATGATCATGG CTTGGTAATT CAGGCAGCAG CAGCACTCCG CCAACAAGGC TATCGCACGG CAGTTGATTT GCGCCAGCGT TCGTATGCGG CCAATTTAAA TGATGCCCGT CGGCGTGAAA TGAGCCATCT GGCCTTGGTT AGTGTTGATG GTATTCAACT GCGCGATTTA AATGAACAGA AACCCCAATG A
|
Protein sequence | MGYQLTPEVV RGTRDLFAAA VHQRQALIQT LTATFDQAGY DPIELPLLEH RELYLKKSGD DLIAKLYHWN QGGRDLALRP EWTASVLRAV ISGMGDAPIP LRLRYAGPVF RYERPRRATY RQFTQVGIEL IGAPGPLADA EALGLAVTGL RELGIQQWTL TIGHIGVIKT LLNSLGLPER ITSALTWSLE RIRSKGLDAV KQQWRDDDDD LPVDLASLAH LADQDLETLL LRVLPSLGVR LDSGGREPQA IIQRLVRKLR RGDDALNLDR AWQLLSALTA ARGSAPDVMQ QMRELCQEYT VAPDALDELQ TTLTLLEAYG VPADQIVLDF GMGRGLHYYT GLIFEIDGAD GLQLCGGGRY DDLVAALGGR AMPAVGFAYG LERIVAAVAP AEIAPVKSVL VVGDDHGLVI QAAAALRQQG YRTAVDLRQR SYAANLNDAR RREMSHLALV SVDGIQLRDL NEQKPQ
|
| |