Gene Avin_13010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_13010 
SymboltrpS 
ID7760243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp1265328 
End bp1266689 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content61% 
IMG OID643804203 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_002798502 
Protein GI226943429 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTC GAATTCTCAC CGGCATCACC ACTACCGGTA CTCCGCATCT TGGCAATTAT 
GCCGGTGCCA TTCGCCCGGC AATCGTTGCC AGCCGCGATC CCCAGGCCGA CTCGTTCTAT
TTCCTCGCCG ACTATCACGC ATTGATCAAG TGTGACGACC CGGCGCGCAT CCAGCGCTCA
CGCCTGGAAA TTGCCGCTAC CTGGCTGGCC TGCGGACTCG ATGCGCAGAG GACGACTTTT
TACCGGCAGT CGGACATCCC CGAGATTACC GAGCTGACCT GGCTACTCAC CTGCGTGACG
GCCAAGGGGC TGCTCAACCG CGCGCATGCC TACAAGGCCG CGGTGGACAA GAATCTGGAG
GGCGGAGAAG ACCCCGATAC CGGGGTGACG ATGGGCTTGT TCAGCTATCC AGTGCTGATG
GCGGCGGACA TCCTGATGTT CAACGCGAAC AAGGTGCCGG TCGGTCGCGA TCAGATCCAG
CACGTGGAAA TGGCACGCGA CATCGGACAG CGCTTCAACC ATCTGTTCGG TCGGGGACGG
GAGCTCTTCA CCCTGCCAGA GGCGGTGATC GAGGCAGAGG TCGCTACCCT GCCGGGACTC
GATGGCCGCA AGATGTCGAA GAGTTATGAC AATACCATTC CGCTGTTCGG CTCCAGTCGC
CAGTTGAAAG ATGCCATTGC CCGCATCGTC ACCGATTCGC GGGAGCCGGG CGAGCCGAAG
GATCCGGACG GCTCGCATCT GTTCACGCTC TACCAGGCGT TCGCCACGCC CGAGCAACTC
GACGAATTTC GTGCCGATCT GCTGGCTGGT CTCGCCTGGG GCGAGGCCAA GCAGCGTTTG
CTCCAACTAT TGGAAAACGA GTTGGGCGAG GCTCGTGAGC GCTACCAAAC GCTGATTGCG
AGACCGTTTG ACTTGGAGGA CATTCTTCTC GCCGGTGCGG CAAAGGCGCG CAGGATCGCC
ACCCCTTTCC TCGGCGAGCT GCGCGAGGCG GTCGGGTTAC GCTCGTTCCG CACTGAGGCG
CACAGTCAGA CTGCCGGTGG CAAGAAGAAG GCAGTGAAAA CTGCGCGTTT CGTCAGTTTT
CGTGAGCCCG ACGGAAGTTT CCGCTTCCGT TTTCTGGCCG CCGATGGCGA GGAGTTGCTG
TTGTCGCGGC CTTTTGACGA TCCTAAGGCT GTGGGGCGGA TCAGCCAGCA GTTGATTGCC
CTGGGGGCGG ATGCGCTCGA ACTGCGTGCC GACGAAGGCG CCCAGTTCAG TCTCTGGCTG
GATGGCGAAT GCATGGCCGA CAGCCCGACG TATGCCGATC CGGATGCTTT GGAGGCAGCC
ATGTTGCGCC TGCGGGAAGC GATTTCAGGG CTTGCCGATT GA
 
Protein sequence
MTTRILTGIT TTGTPHLGNY AGAIRPAIVA SRDPQADSFY FLADYHALIK CDDPARIQRS 
RLEIAATWLA CGLDAQRTTF YRQSDIPEIT ELTWLLTCVT AKGLLNRAHA YKAAVDKNLE
GGEDPDTGVT MGLFSYPVLM AADILMFNAN KVPVGRDQIQ HVEMARDIGQ RFNHLFGRGR
ELFTLPEAVI EAEVATLPGL DGRKMSKSYD NTIPLFGSSR QLKDAIARIV TDSREPGEPK
DPDGSHLFTL YQAFATPEQL DEFRADLLAG LAWGEAKQRL LQLLENELGE ARERYQTLIA
RPFDLEDILL AGAAKARRIA TPFLGELREA VGLRSFRTEA HSQTAGGKKK AVKTARFVSF
REPDGSFRFR FLAADGEELL LSRPFDDPKA VGRISQQLIA LGADALELRA DEGAQFSLWL
DGECMADSPT YADPDALEAA MLRLREAISG LAD