Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_11620 |
Symbol | valS |
ID | 7760104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 1115343 |
End bp | 1118177 |
Gene Length | 2835 bp |
Protein Length | 944 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643804064 |
Product | valyl-tRNA synthetase |
Protein accession | YP_002798366 |
Protein GI | 226943293 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGA CCTACCAGCC GCACGCAATC GAATCCCGTT GGTACGCCGA GTGGGAGTCG AAGAACTACT TCGCCCCGCA GGGCAGCGGC GAACCCTACA CCATCATGAT TCCGCCGCCG AACGTCACCG GCAGCCTGCA CATGGGCCAC GGCTTCAACA ACGCGATCAT GGACGCGCTG ATCCGCTTCC GCCGCATGCA GGGGCGCAAT ACCCTGTGGC AGCCGGGCAC CGACCACGCC GGCATCGCCA CCCAGATGGT AGTGGAGCGC CAACTGGCGG CCCTGGGCCT CGACCGCCAC GCGCTCGGTC GCGAGAAGTT TCTCGACAAG GTCTGGGAAT GGAAGGAGCA GTCCGGCGGC ACCATCACCC GGCAGATTCG CCGCCTCGGC AGCTCGGTGG ACTGGTCGCG GGAACGCTTC ACCATGGACG AGGGCCTTTC CGAAGCGGTC AAGGAAGCCT TCGTCCGCCT CCACGAGGAC GGCCTGATCT ACCGCGGCAA GCGCCTGGTC AACTGGGACA CCAAGCTGCA CACCGCGATC TCCGACCTGG AAGTGGAGAA CCACGACGAA AAGGGCCACC TCTGGCACCT GCGCTACCCG CTGGCCGACG ACGCCTGCAC CGCCGAAGGC AAGGACTACC TGGTGGTCGC AACCACCCGC CCGGAAACCA TGCTGGGCGA CGCCGCCGTC GCCGTACACC CGGAGGACGA GCGCTACCGG GACCTGATCG GCCGCCACGT GCTGCTGCCG CTGGTCAACC GCCTGATCCC GATCGTCGCC GACGAGTACG TCGACCGCGA ATTCGGCACC GGCTGCGTGA AGATCACCCC GGCCCACGAC TTCAACGACT ACGAGGTCGG CAAGCGCCAC CACCTGCCGC TGATCAACAT CTTCGACAAG AACGCCGGCA TCCTGGCCCA GGCCCAGGTG TTCGACATCG ACGGCACGCC GAACACCCGC GTCGCCCCCA GCCTGCCGGA CGGCTACGCC GGCATGGACC GCTTCGACGC GCGCAAGGCC ATCGTCGCCG ACTTCGAGGG CATGGGCCTT CTCGAGAAGA TCGACGACCA TGCCCTGAAG GTGCCGCGCG GCGACCGTTC CGGCACCATC ATCGAACCCT GGCTGACCGA CCAGTGGTAC GTCTCCACCA AACCGCTGGC CGAGAAGGCC ATCGCCGCCG TCGAGGACGG TTCCATCCAG TTCGTGCCCA GGCAGTACGA GAACATGTAT TTCTCCTGGA TGCGCGACAT CCAGGACTGG TGCATCAGCC GCCAGCTCTG GTGGGGCCAC CGCATCCCGG CCTGGTACGA CGAGGCCGGC AACGCCTACG TCGGCCGCGA CGAGGCGGAA GTGCGCAGCA AGTACGCGAT CCGCAACGAC GAGCCGCTGC GCCAGGACGA AGACGTGCTG GACACCTGGT TCAGCTCCGG CCTGTGGACC TTCTCCACCC TCGGCTGGCC GCAGCAGACC GAGTTCCTCA AGACCTTCCA CCCCACCGAT GTGCTGGTCA CCGGCTTCGA CATCATCTTC TTCTGGGTCG CCCGGATGAT CATGCTGTCC CTGCACCTGA CCGGGCAGAT CCCCTTCAGG ACCGTCTACG TCCATGGCCT GGTACGCGAC AGCCAAGGCC ACAAGATGTC CAAGTCCAAG GGTAACGTGC TCGACCCGCT GGACATCGTC GACGGCATCG ACCTGGAAAG CCTGGTGACC AAGCGCACCA GCGGCATGAT GCAGCCCAAG CTCGCCGAGA AGATCGCCAA GCAGACCCGC GCCGAATTCC CCGAGGGCAT CGCCAGCTAC GGCACCGACG CGCTGCGTTT CACCTTCTGC TCGCTGGCCT CCACCGGCCG CGACATCAAG TTCGACATGG GCCGGGTCGA GGGCTACCGC AACTTCTGCA ACAAGCTGTG GAACGCCGCC AACTTCGTCT TCGAGAACAC CGAGGGCAAG GATTGCGGCG CCGCCGACGA ACCCGTCGAG CTGTCCCCGG TGGACCGCTG GATCGTCTCG GCGCTGCAGC GCACCGAGCA GGAGGTGACC CGCCAGCTCG ACGCCTTCCG CTTCGACCTC GCTGCCCAGG CGCTCTACGA GTTCATCTGG GACCAGTACT GCGCCTGGTA CCTGGAGCTG GTCAAGCCGG TGCTCTGGGA CGAGACTGCT AGCGTCGAGC GCCAGCGCGG CACTCGGCGC ACCCTGGTGC GGGTGCTGGA AACCGCCCTG CGCCTGGCGC ACCCCTTCAT GCCTTTCATC AGCGAGGAAA TCTGGCAGCG CCTCGCGCCG CTGGCCGGCA AGTCCGGCCC GACCCTGATG CTGCAACCCT GGCCGCTGGC CGACGAGGCG CGCATCGACG CGGCCGCCGA GGAGGACATC GAGTGGGTCA AGGCGCTGAT GCTCGGCATA CGGCAGATCC GCGGCGAAAT GAACATCTCC ATGGCCAAGC GCATTGACGT GGCGCTGAAC AACGCCTCGG ACAGCGACCG GCGGCGCCTC GAAGAAAACC GGCCGTTGCT GACGAAACTG GCCAAGCTGG AATCCATCCG CGTGCTGGAA GCCGGCGAGG AAGCACCGCT GGCCGCCACC GCGCTGGTCG GCGAGATGCA GGTGCTGGTG CCGATGGCCG GGCTGATCGA CAAGGACGCC GAACTGGCCC GTCTGGACAA GGAGATCCAG CGCCTGGAAG GCGAGGTCAA GCGCGTCGGC GGCAAGCTGG GCAACGCCAG TTTCGTCGAC AAGGCCCCGG CCGAGGTGAT CGCCAAGGAG CGCGCCAGAC TGAACGAGGC CGAACAGGCC TTGGGCAAGC TGGGCGAACA GCGCGCGCGC ATCGCCAGCC TCTGA
|
Protein sequence | MDKTYQPHAI ESRWYAEWES KNYFAPQGSG EPYTIMIPPP NVTGSLHMGH GFNNAIMDAL IRFRRMQGRN TLWQPGTDHA GIATQMVVER QLAALGLDRH ALGREKFLDK VWEWKEQSGG TITRQIRRLG SSVDWSRERF TMDEGLSEAV KEAFVRLHED GLIYRGKRLV NWDTKLHTAI SDLEVENHDE KGHLWHLRYP LADDACTAEG KDYLVVATTR PETMLGDAAV AVHPEDERYR DLIGRHVLLP LVNRLIPIVA DEYVDREFGT GCVKITPAHD FNDYEVGKRH HLPLINIFDK NAGILAQAQV FDIDGTPNTR VAPSLPDGYA GMDRFDARKA IVADFEGMGL LEKIDDHALK VPRGDRSGTI IEPWLTDQWY VSTKPLAEKA IAAVEDGSIQ FVPRQYENMY FSWMRDIQDW CISRQLWWGH RIPAWYDEAG NAYVGRDEAE VRSKYAIRND EPLRQDEDVL DTWFSSGLWT FSTLGWPQQT EFLKTFHPTD VLVTGFDIIF FWVARMIMLS LHLTGQIPFR TVYVHGLVRD SQGHKMSKSK GNVLDPLDIV DGIDLESLVT KRTSGMMQPK LAEKIAKQTR AEFPEGIASY GTDALRFTFC SLASTGRDIK FDMGRVEGYR NFCNKLWNAA NFVFENTEGK DCGAADEPVE LSPVDRWIVS ALQRTEQEVT RQLDAFRFDL AAQALYEFIW DQYCAWYLEL VKPVLWDETA SVERQRGTRR TLVRVLETAL RLAHPFMPFI SEEIWQRLAP LAGKSGPTLM LQPWPLADEA RIDAAAEEDI EWVKALMLGI RQIRGEMNIS MAKRIDVALN NASDSDRRRL EENRPLLTKL AKLESIRVLE AGEEAPLAAT ALVGEMQVLV PMAGLIDKDA ELARLDKEIQ RLEGEVKRVG GKLGNASFVD KAPAEVIAKE RARLNEAEQA LGKLGEQRAR IASL
|
| |