Gene Daro_2984 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2984 
SymbolhisS 
ID3568518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3221828 
End bp3223117 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content60% 
IMG OID637681455 
Producthistidyl-tRNA synthetase 
Protein accessionYP_286184 
Protein GI71908597 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA CCTTGCAAGC CGTGCGCGGG ATGAATGATG TCCTGCCCGA CGAAGCTGAA 
TTCTGGGAAC TGTTTGAGGA CACCATCCGT TCTTGGCTGA AGGGCTACGG CTATCGTCCG
ATCCGTATGC CGATCGTCGA GCCGACGCCG CTGTTCAAGC GTGCCATCGG TGAGGTGACC
GATATCGTCG AGAAGGAAAT GTATTCCTTT GTCGATGGTT TGAACGGTGA GGCACTGACG
CTGCGTCCAG AAGGTACCGC TGGCTGCGTG CGAGCCGTCA TCGAACACAA CCTGGCCGCA
CGCCAGACGC AGCGGCTCTA CTACATTGGC CAGATGTTCC GACACGAGCG GCCACAAAAA
GGGCGCTATC GCCAGTTCCA CCAGGTCGGT GTCGAGTCTT TTGGCATGGC CGGACCGGAC
ATCGATGCCG AAATGATCCT GATGGGCGCA CGCCTGTGGG CCGATCTCGG CCTGGATGGC
ATCGAACTGC AGCTCAACAG TCTTGGCCAG CCGGAAGAAC GGGCCCTGCA CCGTGCCGCG
CTGATCACCT ATTTCGAGGA AAACGCCGAA CTGCTCGACG AGGATGCCAA ACGTCGCCTG
CATACCAATC CGCTGCGTAT TCTTGATACC AAGAATCCGG CGATGCAGGA ACTGTGCGCT
GCGGCCCCGA AACTGATCGA TTACCTCGGC GCCGAGTCGC TGGCGCATTT CGAGGGCGTC
CAGCGCGTCC TGCGCGATGC CGGCGTGCCA TTCACGATCA ACCCGCGTCT AGTGCGTGGC
CTCGACTATT ACAACCTGAC CGTCTTCGAA TGGGTGACCG ACAAACTCGG TGCCCAAGGC
ACGGTCTGCG CTGGCGGCCG TTACGACGGA CTGGTCGAGC AACTGGGTGG CAAGCCAACG
CCGGCCTGCG GTTTTGCCAT GGGGGTCGAG CGCCTGATCG CCTTGATCCG GGAATCAGGC
GGCGAACCGG CGGCGCCGGC CCCTGACGTT TACCTTGTGC ATCAGGGTGA AGCGGCTGCC
CGCCAGGCTT TCCGGGTTGC CGAAGGCCTG CGTGACCAGG GTATCAATGT ATTGCAGCAT
TGCGGCGGCG GCAGCTTCAA GTCGCAGATG AAAAAGGCCG ACGGCAGCGG TGCGACCTTT
GCTGTCATCA TTGGTGATGA CGAAGCGGCG ACCGGAGAGG CGCAACTGAA ATCGTTGCGT
GCAGAAGGCT CGGCACAATT GAAACTGAAA GTCGATGATC TGGCCGAGGC CATCATCGGA
CAACTGATTG ATTCGGACGA AGAGGAATAA
 
Protein sequence
MSQTLQAVRG MNDVLPDEAE FWELFEDTIR SWLKGYGYRP IRMPIVEPTP LFKRAIGEVT 
DIVEKEMYSF VDGLNGEALT LRPEGTAGCV RAVIEHNLAA RQTQRLYYIG QMFRHERPQK
GRYRQFHQVG VESFGMAGPD IDAEMILMGA RLWADLGLDG IELQLNSLGQ PEERALHRAA
LITYFEENAE LLDEDAKRRL HTNPLRILDT KNPAMQELCA AAPKLIDYLG AESLAHFEGV
QRVLRDAGVP FTINPRLVRG LDYYNLTVFE WVTDKLGAQG TVCAGGRYDG LVEQLGGKPT
PACGFAMGVE RLIALIRESG GEPAAPAPDV YLVHQGEAAA RQAFRVAEGL RDQGINVLQH
CGGGSFKSQM KKADGSGATF AVIIGDDEAA TGEAQLKSLR AEGSAQLKLK VDDLAEAIIG
QLIDSDEEE