Gene Avin_11620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_11620 
SymbolvalS 
ID7760104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1115343 
End bp1118177 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content66% 
IMG OID643804064 
Productvalyl-tRNA synthetase 
Protein accessionYP_002798366 
Protein GI226943293 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGA CCTACCAGCC GCACGCAATC GAATCCCGTT GGTACGCCGA GTGGGAGTCG 
AAGAACTACT TCGCCCCGCA GGGCAGCGGC GAACCCTACA CCATCATGAT TCCGCCGCCG
AACGTCACCG GCAGCCTGCA CATGGGCCAC GGCTTCAACA ACGCGATCAT GGACGCGCTG
ATCCGCTTCC GCCGCATGCA GGGGCGCAAT ACCCTGTGGC AGCCGGGCAC CGACCACGCC
GGCATCGCCA CCCAGATGGT AGTGGAGCGC CAACTGGCGG CCCTGGGCCT CGACCGCCAC
GCGCTCGGTC GCGAGAAGTT TCTCGACAAG GTCTGGGAAT GGAAGGAGCA GTCCGGCGGC
ACCATCACCC GGCAGATTCG CCGCCTCGGC AGCTCGGTGG ACTGGTCGCG GGAACGCTTC
ACCATGGACG AGGGCCTTTC CGAAGCGGTC AAGGAAGCCT TCGTCCGCCT CCACGAGGAC
GGCCTGATCT ACCGCGGCAA GCGCCTGGTC AACTGGGACA CCAAGCTGCA CACCGCGATC
TCCGACCTGG AAGTGGAGAA CCACGACGAA AAGGGCCACC TCTGGCACCT GCGCTACCCG
CTGGCCGACG ACGCCTGCAC CGCCGAAGGC AAGGACTACC TGGTGGTCGC AACCACCCGC
CCGGAAACCA TGCTGGGCGA CGCCGCCGTC GCCGTACACC CGGAGGACGA GCGCTACCGG
GACCTGATCG GCCGCCACGT GCTGCTGCCG CTGGTCAACC GCCTGATCCC GATCGTCGCC
GACGAGTACG TCGACCGCGA ATTCGGCACC GGCTGCGTGA AGATCACCCC GGCCCACGAC
TTCAACGACT ACGAGGTCGG CAAGCGCCAC CACCTGCCGC TGATCAACAT CTTCGACAAG
AACGCCGGCA TCCTGGCCCA GGCCCAGGTG TTCGACATCG ACGGCACGCC GAACACCCGC
GTCGCCCCCA GCCTGCCGGA CGGCTACGCC GGCATGGACC GCTTCGACGC GCGCAAGGCC
ATCGTCGCCG ACTTCGAGGG CATGGGCCTT CTCGAGAAGA TCGACGACCA TGCCCTGAAG
GTGCCGCGCG GCGACCGTTC CGGCACCATC ATCGAACCCT GGCTGACCGA CCAGTGGTAC
GTCTCCACCA AACCGCTGGC CGAGAAGGCC ATCGCCGCCG TCGAGGACGG TTCCATCCAG
TTCGTGCCCA GGCAGTACGA GAACATGTAT TTCTCCTGGA TGCGCGACAT CCAGGACTGG
TGCATCAGCC GCCAGCTCTG GTGGGGCCAC CGCATCCCGG CCTGGTACGA CGAGGCCGGC
AACGCCTACG TCGGCCGCGA CGAGGCGGAA GTGCGCAGCA AGTACGCGAT CCGCAACGAC
GAGCCGCTGC GCCAGGACGA AGACGTGCTG GACACCTGGT TCAGCTCCGG CCTGTGGACC
TTCTCCACCC TCGGCTGGCC GCAGCAGACC GAGTTCCTCA AGACCTTCCA CCCCACCGAT
GTGCTGGTCA CCGGCTTCGA CATCATCTTC TTCTGGGTCG CCCGGATGAT CATGCTGTCC
CTGCACCTGA CCGGGCAGAT CCCCTTCAGG ACCGTCTACG TCCATGGCCT GGTACGCGAC
AGCCAAGGCC ACAAGATGTC CAAGTCCAAG GGTAACGTGC TCGACCCGCT GGACATCGTC
GACGGCATCG ACCTGGAAAG CCTGGTGACC AAGCGCACCA GCGGCATGAT GCAGCCCAAG
CTCGCCGAGA AGATCGCCAA GCAGACCCGC GCCGAATTCC CCGAGGGCAT CGCCAGCTAC
GGCACCGACG CGCTGCGTTT CACCTTCTGC TCGCTGGCCT CCACCGGCCG CGACATCAAG
TTCGACATGG GCCGGGTCGA GGGCTACCGC AACTTCTGCA ACAAGCTGTG GAACGCCGCC
AACTTCGTCT TCGAGAACAC CGAGGGCAAG GATTGCGGCG CCGCCGACGA ACCCGTCGAG
CTGTCCCCGG TGGACCGCTG GATCGTCTCG GCGCTGCAGC GCACCGAGCA GGAGGTGACC
CGCCAGCTCG ACGCCTTCCG CTTCGACCTC GCTGCCCAGG CGCTCTACGA GTTCATCTGG
GACCAGTACT GCGCCTGGTA CCTGGAGCTG GTCAAGCCGG TGCTCTGGGA CGAGACTGCT
AGCGTCGAGC GCCAGCGCGG CACTCGGCGC ACCCTGGTGC GGGTGCTGGA AACCGCCCTG
CGCCTGGCGC ACCCCTTCAT GCCTTTCATC AGCGAGGAAA TCTGGCAGCG CCTCGCGCCG
CTGGCCGGCA AGTCCGGCCC GACCCTGATG CTGCAACCCT GGCCGCTGGC CGACGAGGCG
CGCATCGACG CGGCCGCCGA GGAGGACATC GAGTGGGTCA AGGCGCTGAT GCTCGGCATA
CGGCAGATCC GCGGCGAAAT GAACATCTCC ATGGCCAAGC GCATTGACGT GGCGCTGAAC
AACGCCTCGG ACAGCGACCG GCGGCGCCTC GAAGAAAACC GGCCGTTGCT GACGAAACTG
GCCAAGCTGG AATCCATCCG CGTGCTGGAA GCCGGCGAGG AAGCACCGCT GGCCGCCACC
GCGCTGGTCG GCGAGATGCA GGTGCTGGTG CCGATGGCCG GGCTGATCGA CAAGGACGCC
GAACTGGCCC GTCTGGACAA GGAGATCCAG CGCCTGGAAG GCGAGGTCAA GCGCGTCGGC
GGCAAGCTGG GCAACGCCAG TTTCGTCGAC AAGGCCCCGG CCGAGGTGAT CGCCAAGGAG
CGCGCCAGAC TGAACGAGGC CGAACAGGCC TTGGGCAAGC TGGGCGAACA GCGCGCGCGC
ATCGCCAGCC TCTGA
 
Protein sequence
MDKTYQPHAI ESRWYAEWES KNYFAPQGSG EPYTIMIPPP NVTGSLHMGH GFNNAIMDAL 
IRFRRMQGRN TLWQPGTDHA GIATQMVVER QLAALGLDRH ALGREKFLDK VWEWKEQSGG
TITRQIRRLG SSVDWSRERF TMDEGLSEAV KEAFVRLHED GLIYRGKRLV NWDTKLHTAI
SDLEVENHDE KGHLWHLRYP LADDACTAEG KDYLVVATTR PETMLGDAAV AVHPEDERYR
DLIGRHVLLP LVNRLIPIVA DEYVDREFGT GCVKITPAHD FNDYEVGKRH HLPLINIFDK
NAGILAQAQV FDIDGTPNTR VAPSLPDGYA GMDRFDARKA IVADFEGMGL LEKIDDHALK
VPRGDRSGTI IEPWLTDQWY VSTKPLAEKA IAAVEDGSIQ FVPRQYENMY FSWMRDIQDW
CISRQLWWGH RIPAWYDEAG NAYVGRDEAE VRSKYAIRND EPLRQDEDVL DTWFSSGLWT
FSTLGWPQQT EFLKTFHPTD VLVTGFDIIF FWVARMIMLS LHLTGQIPFR TVYVHGLVRD
SQGHKMSKSK GNVLDPLDIV DGIDLESLVT KRTSGMMQPK LAEKIAKQTR AEFPEGIASY
GTDALRFTFC SLASTGRDIK FDMGRVEGYR NFCNKLWNAA NFVFENTEGK DCGAADEPVE
LSPVDRWIVS ALQRTEQEVT RQLDAFRFDL AAQALYEFIW DQYCAWYLEL VKPVLWDETA
SVERQRGTRR TLVRVLETAL RLAHPFMPFI SEEIWQRLAP LAGKSGPTLM LQPWPLADEA
RIDAAAEEDI EWVKALMLGI RQIRGEMNIS MAKRIDVALN NASDSDRRRL EENRPLLTKL
AKLESIRVLE AGEEAPLAAT ALVGEMQVLV PMAGLIDKDA ELARLDKEIQ RLEGEVKRVG
GKLGNASFVD KAPAEVIAKE RARLNEAEQA LGKLGEQRAR IASL