Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0220 |
Symbol | |
ID | 5732115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 256057 |
End bp | 257043 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277344 |
Product | tryptophanyl-tRNA synthetase |
Protein accession | YP_001543000 |
Protein GI | 159896753 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0180] Tryptophanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00233] tryptophanyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTCGT TACCACGGGT TTTTTCGGGC ATTCAACCAT CAGGCAATTT GCACCTTGGC AACTATTTGG GGGCGATTCA TACCTGGGTG CAAGAGCAAA ATCAATACGA TAATTTCTTC TGTATTGTCG ATCTGCATGC GATCACCGTG CCACAAGATC CAGCCGAGTT GCGCAAAAAT GTGCGCGATT TGGCCGCGCT GTATCTGGCC GCAGGCATTA GCCTTGAGCA TGCCACGATT TTTGTCCAAT CGCATGTGCC AGCTCACGTT GAGCTAGGCT GGATTTTAAA TTGTCAAACG CCCTTGGGCT GGTTGAATCG CATGACCCAA TTTAAGGATA AATCGCAAAA GCAAGAAACC GTAGCGGCTG GCTTGATGAA TTATCCAACG CTGATGGCTG CTGATATTTT GCTCTACGAC ACCCAAGTTG TGCCAGTGGG CGACGATCAA CGTCAGCATA TTGAGCTAAC TCGCGATATT GCCGAGCGTT TTAATCATCT TTATGGCGAA ACCTTTGTCA TCCCCGAGGC CTTGATTCGG CCTGTGGCGG CGCGGGTGAT GGGCCTCGAC GACCCAACCC AGAAGATGAG CAAGAGCAAC AAAGCAGCCA ATCATTCAAT TCCTTTGTTG GGCGATTTGA AGGCAACGCG CAAGGCCATG ATGCGGGCAG TCACCGATTC GGGCAGCGAA ATCAAATTCG ACGAAACCCG ACCTGGGGTC AACAATCTTT TGGGGATTTA TCAAGCCTTG ACTGGCAAAG ACAAAGCCAC GATTGAGGCT GAATTTGAAG GCCAAGGCTA TGGCAAACTC AAAGGTGCAG TTGCCGATGT GGTGATCGCT ACCTTGGAGC CATTGCAACA ACGCTACAAT CAACTCACCG CTGATCCAGC TGAACTCGAT GGGATTCTCA AACGTGGCGC TGAACGCGCT GCCGAAGTTG CTAACGCTAC CTTGTTACGG GCTTATCAAC AACTTGGGCT GCGCTAA
|
Protein sequence | MGSLPRVFSG IQPSGNLHLG NYLGAIHTWV QEQNQYDNFF CIVDLHAITV PQDPAELRKN VRDLAALYLA AGISLEHATI FVQSHVPAHV ELGWILNCQT PLGWLNRMTQ FKDKSQKQET VAAGLMNYPT LMAADILLYD TQVVPVGDDQ RQHIELTRDI AERFNHLYGE TFVIPEALIR PVAARVMGLD DPTQKMSKSN KAANHSIPLL GDLKATRKAM MRAVTDSGSE IKFDETRPGV NNLLGIYQAL TGKDKATIEA EFEGQGYGKL KGAVADVVIA TLEPLQQRYN QLTADPAELD GILKRGAERA AEVANATLLR AYQQLGLR
|
| |