Gene Ava_2982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_2982 
SymbolvalS 
ID3681275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp3698600 
End bp3701608 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content45% 
IMG OID637718328 
Productvalyl-tRNA synthetase 
Protein accessionYP_323487 
Protein GI75909191 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAA CTATAACCAA TCTTCCCAGT CTCTACGATC CTTTTACCAC TGAAGCCAAG 
TGGCAAAAAT TCTGGGAAGA AAACCAAATT TACAAAGCTG ACCCTAACAA GGATGGTGAA
CCTTATTGTG TGGTGATTCC GCCGCCAAAT GTCACTGGTA GTCTGCACAT GGGTCACGCC
TTCGAGAGTG CGTTGATTGA TACCCTAGTG CGCTATCACC GAATGCAGGG GCGTAATACC
TTGTGGCTAC CCGGAACTGA CCACGCCAGT ATTGCAGTCC ATACAATTCT GGAAAAACAA
CTCAAGGCTG AGGGCAAAAC TCGCCAAGAG TTGGGACGTG ATAAATTCCT AGAACGTTCT
TGGCAATGGA AGGCGGAATC AGGGGGAACC ATTGTTAATC AGCTGCGACG TTTGGGTGTT
TCGGTAGATT GGTCGCGGGA GAGGTTTACT TTAGATGAGG GCTTATCTAA GGCTGTAGCC
GAAGCTTTCG TCAGTCTCTA CGAAGAGGGT TTGATTTATC GTGGTGAATA TTTGGTAAAT
TGGTGTCCGG CCACTCAGTC AGCTGTGTCT GATGTGGAGG TGGAATCAAA AGAAGTGGAG
GGTAATCTTT GGCATTTCCG TTATCCTCTG ACCGATGGTT CTGGTTATGT GGAAGTAGCG
ACGACTCGAC CGGAAACCAT GCTTGGTGAT ACGGCTGTTG CAGTTAATCC CAATGATGAC
AGATATAAAC ATCTGATTGG TAAAACCCTC ACACTGCCAA TTACACAACG GGAAATTCCT
ATTATTAGTG ATGAATTAGT TGACCCTGCT TTCGGTACAG GTTGCGTAAA AGTGACTCCC
GCCCATGACC CCAACGATTT TGAAATGGGT AAGCGTCACA ATCTGCCGTT TATTAACATC
CTAAATAAAG ACGGTACACT CAACGCCAAT GGTGGGGAGT TTGCAGGACA AGACCGCTTT
GTAGCAAGGA AGAACGTAGT ATCTCGCCTA GAAACAGATG GTTTTCTGGT GAAGATAGAA
GATTATAAGC ATACCGTACC CTATAGCGAT CGCGGTAAGG TTCCTGTCGA ACCTTTATTA
TCTACTCAAT GGTTCGTACA AATTCGCCCC TTGGCAGATA AAGCGTTAGC ATTTCTTGAC
GAGAAAAATA GCCCAGAGTT TGTTCCCCAA CGCTGGACAA AGGTTTATCG TGACTGGTTG
GTAAATCTGC GGGATTGGTG TATTTCTCGA CAATTATGGT GGGGTCATCA AATCCCCGCT
TGGTACGCGG TGAGTGAAAC CAACGGACAA ATTACCGATA ACACGCCTTT TGTGGTGGCA
AAATCCACAA ATGAAGCCTG GGAGAAAGCT AAAGCGCAAT TTGGGGAGAA TGTCCAACTA
GAACAAGACC CAGATGTACT AGATACTTGG TTTTCCTCAG GACTGTGGCC GTTTTCTACC
TTAGGCTGGC CAGAACAAAC CCCAGATTTA GCCAAATACT ACCCCACTAC TACCTTAGTT
ACGGGCTTTG ACATCATCTT TTTCTGGGTA GCAAGAATGA CGATGATGGC TGGTCATTTC
ACAGGACAAA TGCCGTTTCA GACCGTTTAT ATTCACGGTT TGGTCAGGGA TGAAAATAAT
AAAAAGATGT CCAAGTCGGC TAACAATGGA ATTGACCCAT TGTTACTGAT TGATAAATAC
GGTACTGATG CCCTACGGTA TACCTTAGTT AGGGAAGTAG CCGGTGCTGG TCAAGATATC
CGCTTGGAAT ATGACCGTAA AAAAGATGAA TCACCCTCGG TGGAAGCATC CCGCAACTTT
GCCAATAAGT TGTGGAACGC TGCCAGATTT GTGATGATGA ATTTGGATGG ACAGACACCA
GGGCAACTTG GTCAACCAAA TGCCACGGAA TTAAGCGATC GCTGGATTAT TTCCCGCTAT
CATCAAGTTA TCAAGCAAAC TACCAATTAC ATTGATAATT ACGGTTTAGG GGAAGCAGCC
AAAGGAATTT ACGAATTCAT CTGGGGCGAT TTCTGCGACT GGTATATTGA ACTAGTAAAA
TCCAGACTGC AAAAGGACGC AGACCCTTTA TCACGTAAAG CAGCACAACA AACCCTCGCC
TACGTCCTCG AAGGGATTCT CAAGCTACTG CATCCCTTTA TGCCCCACAT TACAGAGGAA
ATTTGGCAGA CTCTCACCCA ACAACCAGAA AATTCTTCAC AAACTTTAGC TTTACAAGCC
TATCCCCAAG CAGATGCAAA CTGGATAAAT CCCGCCTTGG AAACACAGTT TGATTTGTTG
ATTGGTACTA TCCGCACAAT TCGTAACTTA CGCGCTGAGG CGGAGGTGAA GCCAGGGGCA
AGAATCATCG CCAATTTACA AACTGATAGT GAATCAGAAA GACAAATCCT CATAGCTGGT
CAATCTTATA TTAAAGATTT AGCCAAGGTG GAGACTTTGA CCATTGCTGC TGGACAACAG
CCATCAACGG TGACAAAAAA GAAACCCCAA AGGGGCTTAA AAACTATCGG CTTAGTTATC
GCCGGCCTGG TTTTCCTCAG GGTAGCTTTG GCGGTAGCGG ATACAGTTGA TAATGTTCCT
TTCCTGGGAA ATTTCTTTGA AATTGTTGGG TTGGGTTACT CTGCTTGGTT TGTCGCCCGT
AACTTATTAT CCACCCCAGC TAGACAAAGA TTTTTAGCTA AGTTCTTCGC TTCACCCACT
GAGAAGAATC TTTCAGAGAC AGTACCACAA GCGCCACAAG CAGCAGAAAA GTCTATCGCT
GGTGTGGTAG GAACTGTACA AGTTGTTATA CCTCTAGCTG GTGTAGTGGA CATTGAAACT
CTACGTGCCA AACTAGAGAG AAGCATCAGC AAAGCGGAGG CTGAAGCTCA ATCTCTCAAA
GGTCGGTTAA GTAATCCTAA GTTTGTCGAT AAAGCTCCAG CAGATGTAGT ACAAGCTGCG
CGAGATGCTT TAGCTGAGGC AGAAAAACAA GTGGAAATTT TGCGCTTGCG CCTTCAGACA
TTGGTGTAA
 
Protein sequence
MTATITNLPS LYDPFTTEAK WQKFWEENQI YKADPNKDGE PYCVVIPPPN VTGSLHMGHA 
FESALIDTLV RYHRMQGRNT LWLPGTDHAS IAVHTILEKQ LKAEGKTRQE LGRDKFLERS
WQWKAESGGT IVNQLRRLGV SVDWSRERFT LDEGLSKAVA EAFVSLYEEG LIYRGEYLVN
WCPATQSAVS DVEVESKEVE GNLWHFRYPL TDGSGYVEVA TTRPETMLGD TAVAVNPNDD
RYKHLIGKTL TLPITQREIP IISDELVDPA FGTGCVKVTP AHDPNDFEMG KRHNLPFINI
LNKDGTLNAN GGEFAGQDRF VARKNVVSRL ETDGFLVKIE DYKHTVPYSD RGKVPVEPLL
STQWFVQIRP LADKALAFLD EKNSPEFVPQ RWTKVYRDWL VNLRDWCISR QLWWGHQIPA
WYAVSETNGQ ITDNTPFVVA KSTNEAWEKA KAQFGENVQL EQDPDVLDTW FSSGLWPFST
LGWPEQTPDL AKYYPTTTLV TGFDIIFFWV ARMTMMAGHF TGQMPFQTVY IHGLVRDENN
KKMSKSANNG IDPLLLIDKY GTDALRYTLV REVAGAGQDI RLEYDRKKDE SPSVEASRNF
ANKLWNAARF VMMNLDGQTP GQLGQPNATE LSDRWIISRY HQVIKQTTNY IDNYGLGEAA
KGIYEFIWGD FCDWYIELVK SRLQKDADPL SRKAAQQTLA YVLEGILKLL HPFMPHITEE
IWQTLTQQPE NSSQTLALQA YPQADANWIN PALETQFDLL IGTIRTIRNL RAEAEVKPGA
RIIANLQTDS ESERQILIAG QSYIKDLAKV ETLTIAAGQQ PSTVTKKKPQ RGLKTIGLVI
AGLVFLRVAL AVADTVDNVP FLGNFFEIVG LGYSAWFVAR NLLSTPARQR FLAKFFASPT
EKNLSETVPQ APQAAEKSIA GVVGTVQVVI PLAGVVDIET LRAKLERSIS KAEAEAQSLK
GRLSNPKFVD KAPADVVQAA RDALAEAEKQ VEILRLRLQT LV