Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2855 |
Symbol | |
ID | 5736892 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3622614 |
End bp | 3625379 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279998 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001545621 |
Protein GI | 159899374 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCTG ACAAACATTT ACACAATGCC TTAGCCGGAG CCGATCTTGG CCAAGCCTAC GAGGCCGCCA AGGTTGAGGA ACATTTGTAC AAGTGGTGGG AAGCGGCGGG CTATTTCAAG CCAACTGCTG CCAATACGAA AGCCCCATTT GTGATTGCGA TTCCGCCGCC CAATGTGACT GGTGTGCTGC ACACTGGTCA TGGCTTGACC AACACGATCG AGGATATTTT AACCCGCTGG CATCGCATGC TCGGCCAACC AACGTTGTGG GTTCCAGGTA CCGACCACGC GGGGATCGCC ACTCAAAATG TGGTTGAAAA GCAACTTGCC AAAGTTGGCA AAACTCGCCA CGATCTCGGA CGCGAGGATT TTCTCGACGC TGTTTGGGAA TGGAAGGGCC GTTCGCATTC CACAATCACC AACCAAATTC GTCGCCTCGG CTCATCGGTC GATTGGCAGC GCGAACGCTT TACCCTCGAC GAAGGTCTAT CGCAAGCGGT TGTTGTGGCC TTCAAGCGCT TGTACGACGA TGGCTTGATC TATCGGGGCA CGCGCTTGGT CAACTGGTGT CCGCGCTGTC TTTCGGCGAT CTCCGACCTT GAAGTGGTTT ATCGCGATGA GCAAGAGCAA GGCAATTTGT GGCACATTCG CTATAAAGTT GCTAACGATG CCAACGATAC CGAGTGGCGG ATCAGCGAGG GCGATCAATC AATTACGATT GCCACGACTC GACCCGAAAC CTTGCTAGCC GACGTGGCGG TCGCGGTGCA TCCTGAAGAT GAGCGCTATG CCGATTTGGT GGGCAAATTT GTGGTGCTGC CAGCTTTGGG TCGCCAAATT CCAATTATCG CCGACACCTA TGTTGAGCGT GAGTTTGGCA CGGGTGCGCT CAAAATCACT CCAGGCCACG ATCCGAATGA CTATATTGTT GGTCAACGTC ATAACTTGCC GATTCTCAAT GCCATGAATC TTGATGCAAC GATCAATTCT GAGGGTGGCA GTTACGCTGG GCTTGATCGC TTTGAGGCTC GTAAGCGCTT GGTCGCCGAT TTGACTGAAA CTGGCAATTT GGTTGAAACC AAGCCGCACT TGATGAAGAT TGGCCGCTGC GAGCGCTGCG ATACAATCAT CGAGCCATTA ATTAGCACCC AGTGGTTTGT CAAAACGCAA CCATTGGCTG AGCCAGCGAT GGCCGCCGTG CGTGAAGGCC GCACCAAAAT CGTGCCCGAA CGCTTCAATA AAATCTATTT CCATTGGATG GAAAATATTC AGGATTGGTG TATCAGCCGC CAACTCTGGT GGGGCCATCG GATTCCGGTG TGGTACGGCC CCGATAACCA GATGTTTGTC GAGTTGAATG CTGCTGATGC GATGGCTGCG GCAACTGCGC ATTATGGTCA AGTGGTTGAG TTGCGCCAAG ACGAAGATGT GTTGGATACG TGGTTCTCAT CGGGCTTGTG GCCATTCAGC ATTTTGGGCT GGCCCGATGT TGAAAATCCT GATTTCAAAC AATTTTACCC AACCACACTG CTCGAAACCG GCTACGACAT TTTGTTTTTC TGGGTAGCGC GGATGATGAT GTTGGGGCTT TATCTTACGG GCAAAGAGCC GTTTGAATGG GTATATTTGC ATGGCCTTGT GCGCGATGAA CATGGCCGCA AGATGTCAAA ATCGTTGGGC AACCAGGTTG ATCCGATGGA TTTGATCGAG CAATATGGCA CTGATGCGCT GCGGTTTACC TTCGCCACCT CATCAACGCC AGGCCAAGAT TTTGCGCTGC AACCAACCCG TTTGGATTCG GCGCGTTCGT TCGCCAACAA AATCTGGAAT GCCACCCGCT TTGTGATTTC GAAGTTGGGC GATTTGCCGC GCACTGCCGA GAGCAAAGTT GATGCTGAAC GTTTGAATGC CCAAGCCTAT ACCGTCGCTG ATCGCTGGAT TCTTTCGCGG TTCAATCGCT TAGCGGGCGA TGTTGAACGC TTGATGAACA GCTTCAATTT GGGCGAAGCT GGTCGCCAAA TCCAAACCTT TTTCTGGGAT GAATTTGCTG ATTGGTATAT TGAAACCGCC AAAATTCAAA TTGACACTGG CGATGAACAA CAGCAATTAC GCACCCGCGA AACGCTCTAC AGCGTTTTAG AAGGAACTTT GCGGTTGCTG CACCCATTTA TGCCGTTTGT GAGCGAGGCC GCTTGGCAAA AATTACACAA TAGCGAGCAA ACCACGCCAA CTCCAGCAGC CTTGATTATC GCTGAGTATC CTTTAATTAA TGCTGCCATG CTTAACGAGC AAGCTGAGCG TGATTGGGAC TTGGTGCAAA ATATCATTCG CGGCGTGCGC AACGTGCGTA CTGAAACAGG CGTGGAAGCA GTTAAATGGA TCGAGGCATT GATCGCGGCT GGTTCAGCCA CAGCGATGTT AACCGAACAA ACTGCAATTA TCAGCCGCCT GGCGCGGATC GCTCCCGACA AACTACTGAT TAGCGAAAGC CTGAGCGAAC GGCCTGAGCA AGCCACAACC TTGGTATTTG CTCCAGCCGA AGTTGTCTTG CCATTGGCGG GTATGGTCGA TTTGGCCGCC GAGCGCGAAC GGCTCAACAA GGAGCTTGAA CGGGTCGAGG CCGATGTCGA GCGCCGCCGC ACGAAGCTGG CTAACGAAAA TTTTGTGGCT AAAGCCAAGC CCGAAGTTGT GCAAAAAGAG CGTGAGGCTT TAGCTGCTCA AGAGTTGGCC GCTACAACCT TGCGTGAACG CTTGGCAAGT TTCTAG
|
Protein sequence | MSADKHLHNA LAGADLGQAY EAAKVEEHLY KWWEAAGYFK PTAANTKAPF VIAIPPPNVT GVLHTGHGLT NTIEDILTRW HRMLGQPTLW VPGTDHAGIA TQNVVEKQLA KVGKTRHDLG REDFLDAVWE WKGRSHSTIT NQIRRLGSSV DWQRERFTLD EGLSQAVVVA FKRLYDDGLI YRGTRLVNWC PRCLSAISDL EVVYRDEQEQ GNLWHIRYKV ANDANDTEWR ISEGDQSITI ATTRPETLLA DVAVAVHPED ERYADLVGKF VVLPALGRQI PIIADTYVER EFGTGALKIT PGHDPNDYIV GQRHNLPILN AMNLDATINS EGGSYAGLDR FEARKRLVAD LTETGNLVET KPHLMKIGRC ERCDTIIEPL ISTQWFVKTQ PLAEPAMAAV REGRTKIVPE RFNKIYFHWM ENIQDWCISR QLWWGHRIPV WYGPDNQMFV ELNAADAMAA ATAHYGQVVE LRQDEDVLDT WFSSGLWPFS ILGWPDVENP DFKQFYPTTL LETGYDILFF WVARMMMLGL YLTGKEPFEW VYLHGLVRDE HGRKMSKSLG NQVDPMDLIE QYGTDALRFT FATSSTPGQD FALQPTRLDS ARSFANKIWN ATRFVISKLG DLPRTAESKV DAERLNAQAY TVADRWILSR FNRLAGDVER LMNSFNLGEA GRQIQTFFWD EFADWYIETA KIQIDTGDEQ QQLRTRETLY SVLEGTLRLL HPFMPFVSEA AWQKLHNSEQ TTPTPAALII AEYPLINAAM LNEQAERDWD LVQNIIRGVR NVRTETGVEA VKWIEALIAA GSATAMLTEQ TAIISRLARI APDKLLISES LSERPEQATT LVFAPAEVVL PLAGMVDLAA ERERLNKELE RVEADVERRR TKLANENFVA KAKPEVVQKE REALAAQELA ATTLRERLAS F
|
| |