Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2246 |
Symbol | |
ID | 5734133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2862943 |
End bp | 2864247 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279387 |
Product | histidyl-tRNA synthetase |
Protein accession | YP_001545014 |
Protein GI | 159898767 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0124] Histidyl-tRNA synthetase |
TIGRFAM ID | [TIGR00442] histidyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000301109 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATTA CTCCTCGCGC CTACAAAGGC ATGCGCGATC ACTTGCCCGA AGCTATGCGC TTGCGGCGTT TTATCACCGA TACCTTGATT GGTATTTTAG AGCGCTATGG CTTTGAGCCA CTTTCCACAC CGATTGTCGA ATATTCGGAA ACGCTCGAAG GCAAGATTGG CGATGAAGAA AAATTGCTGT ATCGTTTGAA ATATGGCGAT GATGCTTTGA CCTTGCGTTA CGACCAAACT GTGCCCTTGG CGCGGGTGGT GGCCCAAAAC GAAGGCAAAT TAACTATGCC CTTCAAACGC TATGCGCTCG GCCAATCGTA TCGCGGCGAA CGCCAAGCTC GTGGTCGCTA CCGCGAATTT TGGCAGCTTG ATGCCGATAT TGTTGGGGTA GATAGCCCGA TTGCCGATGC TGAAATTGTG GCGGTGGTCG TTGAAGGCTT GCGAGCTTTA GGCTTTACTG GTGCAAAAGT CTTGCTCAAT CACCGCGAAA TCTTGAGTGG GTTGGCGCGG GTAGCAGGTG TGCCCGAAAA CGAGGCAGGT GGAGTATATC GCGCGATCGA TAAGCTCGAT AAAATTGGCA ATGACGGTGT GCGCAACGAA TTGCTCAAAA GTGGCGTGAG TGCCGAAGCT GCTGAGCGTG TGTTGCACTT CGTCGGGATT AGCGGCTCGA TCGAAGCGGT GTTGGCCGAA ATGGAAAGCG TGCTAGCCAA CGATCCGCCA GCTTTGGCAG CGGTTGCGGC CTTGCGCACC ATCTGCGAGG TATTGACGAG CTTTGGCGTG CCAGCCGATA GTTTTACGAT TGCGCCTAGC TTGGCTCGCG GCTTATCCTA CTATACGGGC TGTGTGTTCG AGGCCGTGCT CGATTCGCCA CCGATGGGTT CGTTATTGGG CGGTGGCCGC TACGACAATT TGGTGGGTAT GTTTAGCAAA CGCTCGTTGC CGACGGTTGG CTGTGCTTTT GGGCTTGAAC GCTTGTTTGA TTTGATGCTT GAGCTTAATA TGGGGCCACG CCCAGAGCGC ACGATCGATG CCTATGTTAC CTTATTTGCT GGCGATTTTC AAAATGAGAG CCTGCGCTTG GCGGGCGAAT TGCGGGTAGC TGGCCTGAGT GTATTGACCG CGTATAGTCC AGTCAAAATT GCCAACCAGT TCAAAGAGGC AGATCGCAAG GGTGCGAATT TTGCCTTGGT GCTTGGCCCC GATGATTTGG CTGCAGGCGT AGTACAGCTC AAAGATTTAC GCTCGGGTCA GCAACAAGCT GTAGCGCGTG ATGCGATCGT AGCGGCAATC AAAGCGGCCC AATAA
|
Protein sequence | MPITPRAYKG MRDHLPEAMR LRRFITDTLI GILERYGFEP LSTPIVEYSE TLEGKIGDEE KLLYRLKYGD DALTLRYDQT VPLARVVAQN EGKLTMPFKR YALGQSYRGE RQARGRYREF WQLDADIVGV DSPIADAEIV AVVVEGLRAL GFTGAKVLLN HREILSGLAR VAGVPENEAG GVYRAIDKLD KIGNDGVRNE LLKSGVSAEA AERVLHFVGI SGSIEAVLAE MESVLANDPP ALAAVAALRT ICEVLTSFGV PADSFTIAPS LARGLSYYTG CVFEAVLDSP PMGSLLGGGR YDNLVGMFSK RSLPTVGCAF GLERLFDLML ELNMGPRPER TIDAYVTLFA GDFQNESLRL AGELRVAGLS VLTAYSPVKI ANQFKEADRK GANFALVLGP DDLAAGVVQL KDLRSGQQQA VARDAIVAAI KAAQ
|
| |