Gene Bcep18194_A5112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_A5112 
SymbolhisS 
ID3750320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007510 
Strand
Start bp2154534 
End bp2155874 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content64% 
IMG OID637763408 
Producthistidyl-tRNA synthetase 
Protein accessionYP_369350 
Protein GI78066581 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.238333 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAC AAAAACGCAA GATCGAGAAG CTCACCGGCG TCAAGGGCAT GAACGACATC 
CTTCCGCAGG ATGCCGGCCT TTGGGAGTTC TTCGAAGCTA CGGTGAAATC GCTGCTGCGC
GCATACGGTT ACCAGAACAT CCGCACGCCG ATCGTCGAGC ACACGCAGCT CTTCACGCGC
GGCATCGGTG AGGTCACCGA CATCGTCGAA AAAGAGATGT ACAGCTTCAC CGACGCGCTG
AACGGCGAGA ACCTGACGAT GCGCCCGGAA AACACCGCGG CCGTCGTGCG CGCGTCGATC
GAGCACAACA TGCTGTACGA CGGCCCGAAG CGCCTGTGGT ACATCGGCCC GATGTTCCGT
CACGAGCGTC CGCAGCGCGG CCGCTATCGC CAGTTCCACC AGGTCGGCGT CGAGGCGCTC
GGCTTCGCGG GTCCTGATGC GGATGCCGAG ATCATCATGA TGTGCCAGCG CCTGTGGGAC
GACCTCGGCC TGACCGGCAT CAAGCTCGAG ATCAACTCGC TCGGCCTCGC CGAAGAGCGT
GCCGCGCATC GCGTCGAACT GATCAAGTAC CTCGAGCAGT TTGCCGACGT GCTCGACGAG
GATGCGAAGC GCCGCCTGTA TACGAACCCG CTGCGCGTGC TCGACACGAA GAACCCCGCG
CTGCAGGAAA TCGCGCAGAA CGCGCCGAAG CTGATCGACT TCCTCGGCGA CGAGTCGCGC
GCGCACTTCG AAGGGCTGCA GCGCCTGCTG CTCGCGAACA ACATTCCGTT CAAGATCAAC
CCGCGTCTCG TGCGTGGCCT GGACTACTAC AACCTGACCG TGTTCGAGTG GGTGACCGAC
AAGCTCGGCG CGCAGGGCAC CGTCGCAGCC GGTGGCCGCT ACGATCCGCT GATCGAGCAG
CTCGGCGGCA AGCCGACCGC CGCGTGCGGC TGGGCAATGG GCATCGAGCG GATCCTCGAA
CTGCTGAAGG AAGACGATCT CGCTCCCGAG CAGGAAGGCG TCGACGTATA CGTCGTGCAT
CAGGGCGAGA CCGCGCGCGA ACAGGCGTTC ATCGCGGCCG AGCGCCTGCG CGATACGGGC
CTCGACGTGA TCTTCCACTG CAGCGCCGAC GGCGCGCCGG CGAGCTTCAA GTCGCAAATG
AAACGGGCCG ACGCAAGCGG CGCCGCGTTC GCGGTGATCT TCGGTGAAGA AGAGGTTGCA
AACGGCACGG TGGGCGTCAA AGCGCTGCGC GGTGCGGGCG GAGACGGGGA AAAGAACGTT
CAGCAGACCG TACCGGTCGA AGGCTTGACC GAATTCCTAA TCAATGCGAT GGTTGCATCC
GCCGAAGACG GCGACGACTG A
 
Protein sequence
MTEQKRKIEK LTGVKGMNDI LPQDAGLWEF FEATVKSLLR AYGYQNIRTP IVEHTQLFTR 
GIGEVTDIVE KEMYSFTDAL NGENLTMRPE NTAAVVRASI EHNMLYDGPK RLWYIGPMFR
HERPQRGRYR QFHQVGVEAL GFAGPDADAE IIMMCQRLWD DLGLTGIKLE INSLGLAEER
AAHRVELIKY LEQFADVLDE DAKRRLYTNP LRVLDTKNPA LQEIAQNAPK LIDFLGDESR
AHFEGLQRLL LANNIPFKIN PRLVRGLDYY NLTVFEWVTD KLGAQGTVAA GGRYDPLIEQ
LGGKPTAACG WAMGIERILE LLKEDDLAPE QEGVDVYVVH QGETAREQAF IAAERLRDTG
LDVIFHCSAD GAPASFKSQM KRADASGAAF AVIFGEEEVA NGTVGVKALR GAGGDGEKNV
QQTVPVEGLT EFLINAMVAS AEDGDD