Gene BURPS668_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_3371 
SymboltyrS 
ID4884380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp3306103 
End bp3307344 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID640129299 
Producttyrosyl-tRNA synthetase 
Protein accessionYP_001060382 
Protein GI126440034 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0162] Tyrosyl-tRNA synthetase 
TIGRFAM ID[TIGR00234] tyrosyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0306351 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCG ATCCCACTTC CAAGCCCGCC TTCCCGATCA CCGATGAAGT CCGCCATGCG 
CTCGCCGTCA CGAAGCGCGG TGTTGACGAG CTGCTGATCG AGGAAGAGTT CGCGCAGAAG
CTCGCGAAAA GCGCGGCGAC GGGCAAGCCG CTGCGCATCA AGCTGGGCCT CGATCCGACG
GCGCCCGACA TCCACCTCGG CCACACGGTC GTGCTGAACA AGATGCGCCA GTTGCAGGAT
CTCGGCCATA CGGTGATTTT CCTGATCGGC GATTTCACGT CGCTGATCGG CGATCCGTCG
GGCCGCAACG CGACCCGCCC GCCGCTCACG CGCGAGCAGA TCGAATCGAA CGCGAGGACC
TACTTCGAGC AGGCCGCGCT CGTGCTCGAT CGCGAGAAGA CCGAAATTCG CTACAACAGC
GAGTGGTCGA TGCCGCTCGG CGCGGATGGG ATGATCAAGC TCGCGTCGCG CTACACGGTC
GCGCGGATTC TCGAGCGCGA GGATTTCACG AAACGCTTCC AAGGCGGCAT CCCGATCTCG
ATCCATGAAT TCCTGTACCC GCTGATGCAG GGTTACGATT CGGTCGCGCT GAACGCCGAT
CTCGAGCTCG GCGGCACCGA CCAGAAATTC AACCTGCTCG TCGGCCGCGA GCTGCAGAAG
CAATACGGCC AGGAGCAGCA GTGCATCCTG ACGATGCCGC TGCTCGAAGG CCTCGACGGC
GTCGAGAAGA TGTCGAAATC GAAGGGCAAC TACGTCGGCA TCAGCGAGAA GCCGACCGAC
ATGTTCGGCA AGCTGATGAG CATCTCGGAC GTGCTGATGT GGCGCTACTT CGAGCTGCTG
TCGTTCCGCA GCCTCGACGA GATCGCGCGG TTCCGCGGCG AGGCCGAAGG CGGGCGCAAC
CCGCGCGACT TCAAGGTGAT GCTCGCGCAG GAGATCGTCG CGCGTTTCCA TTCGCAGGCC
GACGCCGAAC GCGCGCTCGA GGACTTCAAC CATCGCGCGA AGGGCGGCGT GCCCGACGAT
ATCCCGGCCG TGACGCTCGC CGGCGCGCCG CTCGCGATCG GCCAGTTGCT GAAGCAGGCG
GGGCTCGTGC CTTCGACGAG CGAGGCGCTG CGCAACATCG AGCAGGGCGG TGTGAAGATC
GACGGCGCGA CGGTGTCCGA CAAGGCGCTG AAAGTCGACG CGGGCGAGTT CGTCGTGCAG
GTCGGCAAGC GCCGCTTCGC GCGCGTGACG CTTACCGCAT GA
 
Protein sequence
MSTDPTSKPA FPITDEVRHA LAVTKRGVDE LLIEEEFAQK LAKSAATGKP LRIKLGLDPT 
APDIHLGHTV VLNKMRQLQD LGHTVIFLIG DFTSLIGDPS GRNATRPPLT REQIESNART
YFEQAALVLD REKTEIRYNS EWSMPLGADG MIKLASRYTV ARILEREDFT KRFQGGIPIS
IHEFLYPLMQ GYDSVALNAD LELGGTDQKF NLLVGRELQK QYGQEQQCIL TMPLLEGLDG
VEKMSKSKGN YVGISEKPTD MFGKLMSISD VLMWRYFELL SFRSLDEIAR FRGEAEGGRN
PRDFKVMLAQ EIVARFHSQA DAERALEDFN HRAKGGVPDD IPAVTLAGAP LAIGQLLKQA
GLVPSTSEAL RNIEQGGVKI DGATVSDKAL KVDAGEFVVQ VGKRRFARVT LTA