Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_2982 |
Symbol | valS |
ID | 3681275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 3698600 |
End bp | 3701608 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637718328 |
Product | valyl-tRNA synthetase |
Protein accession | YP_323487 |
Protein GI | 75909191 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCAA CTATAACCAA TCTTCCCAGT CTCTACGATC CTTTTACCAC TGAAGCCAAG TGGCAAAAAT TCTGGGAAGA AAACCAAATT TACAAAGCTG ACCCTAACAA GGATGGTGAA CCTTATTGTG TGGTGATTCC GCCGCCAAAT GTCACTGGTA GTCTGCACAT GGGTCACGCC TTCGAGAGTG CGTTGATTGA TACCCTAGTG CGCTATCACC GAATGCAGGG GCGTAATACC TTGTGGCTAC CCGGAACTGA CCACGCCAGT ATTGCAGTCC ATACAATTCT GGAAAAACAA CTCAAGGCTG AGGGCAAAAC TCGCCAAGAG TTGGGACGTG ATAAATTCCT AGAACGTTCT TGGCAATGGA AGGCGGAATC AGGGGGAACC ATTGTTAATC AGCTGCGACG TTTGGGTGTT TCGGTAGATT GGTCGCGGGA GAGGTTTACT TTAGATGAGG GCTTATCTAA GGCTGTAGCC GAAGCTTTCG TCAGTCTCTA CGAAGAGGGT TTGATTTATC GTGGTGAATA TTTGGTAAAT TGGTGTCCGG CCACTCAGTC AGCTGTGTCT GATGTGGAGG TGGAATCAAA AGAAGTGGAG GGTAATCTTT GGCATTTCCG TTATCCTCTG ACCGATGGTT CTGGTTATGT GGAAGTAGCG ACGACTCGAC CGGAAACCAT GCTTGGTGAT ACGGCTGTTG CAGTTAATCC CAATGATGAC AGATATAAAC ATCTGATTGG TAAAACCCTC ACACTGCCAA TTACACAACG GGAAATTCCT ATTATTAGTG ATGAATTAGT TGACCCTGCT TTCGGTACAG GTTGCGTAAA AGTGACTCCC GCCCATGACC CCAACGATTT TGAAATGGGT AAGCGTCACA ATCTGCCGTT TATTAACATC CTAAATAAAG ACGGTACACT CAACGCCAAT GGTGGGGAGT TTGCAGGACA AGACCGCTTT GTAGCAAGGA AGAACGTAGT ATCTCGCCTA GAAACAGATG GTTTTCTGGT GAAGATAGAA GATTATAAGC ATACCGTACC CTATAGCGAT CGCGGTAAGG TTCCTGTCGA ACCTTTATTA TCTACTCAAT GGTTCGTACA AATTCGCCCC TTGGCAGATA AAGCGTTAGC ATTTCTTGAC GAGAAAAATA GCCCAGAGTT TGTTCCCCAA CGCTGGACAA AGGTTTATCG TGACTGGTTG GTAAATCTGC GGGATTGGTG TATTTCTCGA CAATTATGGT GGGGTCATCA AATCCCCGCT TGGTACGCGG TGAGTGAAAC CAACGGACAA ATTACCGATA ACACGCCTTT TGTGGTGGCA AAATCCACAA ATGAAGCCTG GGAGAAAGCT AAAGCGCAAT TTGGGGAGAA TGTCCAACTA GAACAAGACC CAGATGTACT AGATACTTGG TTTTCCTCAG GACTGTGGCC GTTTTCTACC TTAGGCTGGC CAGAACAAAC CCCAGATTTA GCCAAATACT ACCCCACTAC TACCTTAGTT ACGGGCTTTG ACATCATCTT TTTCTGGGTA GCAAGAATGA CGATGATGGC TGGTCATTTC ACAGGACAAA TGCCGTTTCA GACCGTTTAT ATTCACGGTT TGGTCAGGGA TGAAAATAAT AAAAAGATGT CCAAGTCGGC TAACAATGGA ATTGACCCAT TGTTACTGAT TGATAAATAC GGTACTGATG CCCTACGGTA TACCTTAGTT AGGGAAGTAG CCGGTGCTGG TCAAGATATC CGCTTGGAAT ATGACCGTAA AAAAGATGAA TCACCCTCGG TGGAAGCATC CCGCAACTTT GCCAATAAGT TGTGGAACGC TGCCAGATTT GTGATGATGA ATTTGGATGG ACAGACACCA GGGCAACTTG GTCAACCAAA TGCCACGGAA TTAAGCGATC GCTGGATTAT TTCCCGCTAT CATCAAGTTA TCAAGCAAAC TACCAATTAC ATTGATAATT ACGGTTTAGG GGAAGCAGCC AAAGGAATTT ACGAATTCAT CTGGGGCGAT TTCTGCGACT GGTATATTGA ACTAGTAAAA TCCAGACTGC AAAAGGACGC AGACCCTTTA TCACGTAAAG CAGCACAACA AACCCTCGCC TACGTCCTCG AAGGGATTCT CAAGCTACTG CATCCCTTTA TGCCCCACAT TACAGAGGAA ATTTGGCAGA CTCTCACCCA ACAACCAGAA AATTCTTCAC AAACTTTAGC TTTACAAGCC TATCCCCAAG CAGATGCAAA CTGGATAAAT CCCGCCTTGG AAACACAGTT TGATTTGTTG ATTGGTACTA TCCGCACAAT TCGTAACTTA CGCGCTGAGG CGGAGGTGAA GCCAGGGGCA AGAATCATCG CCAATTTACA AACTGATAGT GAATCAGAAA GACAAATCCT CATAGCTGGT CAATCTTATA TTAAAGATTT AGCCAAGGTG GAGACTTTGA CCATTGCTGC TGGACAACAG CCATCAACGG TGACAAAAAA GAAACCCCAA AGGGGCTTAA AAACTATCGG CTTAGTTATC GCCGGCCTGG TTTTCCTCAG GGTAGCTTTG GCGGTAGCGG ATACAGTTGA TAATGTTCCT TTCCTGGGAA ATTTCTTTGA AATTGTTGGG TTGGGTTACT CTGCTTGGTT TGTCGCCCGT AACTTATTAT CCACCCCAGC TAGACAAAGA TTTTTAGCTA AGTTCTTCGC TTCACCCACT GAGAAGAATC TTTCAGAGAC AGTACCACAA GCGCCACAAG CAGCAGAAAA GTCTATCGCT GGTGTGGTAG GAACTGTACA AGTTGTTATA CCTCTAGCTG GTGTAGTGGA CATTGAAACT CTACGTGCCA AACTAGAGAG AAGCATCAGC AAAGCGGAGG CTGAAGCTCA ATCTCTCAAA GGTCGGTTAA GTAATCCTAA GTTTGTCGAT AAAGCTCCAG CAGATGTAGT ACAAGCTGCG CGAGATGCTT TAGCTGAGGC AGAAAAACAA GTGGAAATTT TGCGCTTGCG CCTTCAGACA TTGGTGTAA
|
Protein sequence | MTATITNLPS LYDPFTTEAK WQKFWEENQI YKADPNKDGE PYCVVIPPPN VTGSLHMGHA FESALIDTLV RYHRMQGRNT LWLPGTDHAS IAVHTILEKQ LKAEGKTRQE LGRDKFLERS WQWKAESGGT IVNQLRRLGV SVDWSRERFT LDEGLSKAVA EAFVSLYEEG LIYRGEYLVN WCPATQSAVS DVEVESKEVE GNLWHFRYPL TDGSGYVEVA TTRPETMLGD TAVAVNPNDD RYKHLIGKTL TLPITQREIP IISDELVDPA FGTGCVKVTP AHDPNDFEMG KRHNLPFINI LNKDGTLNAN GGEFAGQDRF VARKNVVSRL ETDGFLVKIE DYKHTVPYSD RGKVPVEPLL STQWFVQIRP LADKALAFLD EKNSPEFVPQ RWTKVYRDWL VNLRDWCISR QLWWGHQIPA WYAVSETNGQ ITDNTPFVVA KSTNEAWEKA KAQFGENVQL EQDPDVLDTW FSSGLWPFST LGWPEQTPDL AKYYPTTTLV TGFDIIFFWV ARMTMMAGHF TGQMPFQTVY IHGLVRDENN KKMSKSANNG IDPLLLIDKY GTDALRYTLV REVAGAGQDI RLEYDRKKDE SPSVEASRNF ANKLWNAARF VMMNLDGQTP GQLGQPNATE LSDRWIISRY HQVIKQTTNY IDNYGLGEAA KGIYEFIWGD FCDWYIELVK SRLQKDADPL SRKAAQQTLA YVLEGILKLL HPFMPHITEE IWQTLTQQPE NSSQTLALQA YPQADANWIN PALETQFDLL IGTIRTIRNL RAEAEVKPGA RIIANLQTDS ESERQILIAG QSYIKDLAKV ETLTIAAGQQ PSTVTKKKPQ RGLKTIGLVI AGLVFLRVAL AVADTVDNVP FLGNFFEIVG LGYSAWFVAR NLLSTPARQR FLAKFFASPT EKNLSETVPQ APQAAEKSIA GVVGTVQVVI PLAGVVDIET LRAKLERSIS KAEAEAQSLK GRLSNPKFVD KAPADVVQAA RDALAEAEKQ VEILRLRLQT LV
|
| |