Gene Achl_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_2023 
SymbolhisS 
ID7293484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2281149 
End bp2282522 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content66% 
IMG OID643590427 
Producthistidyl-tRNA synthetase 
Protein accessionYP_002488086 
Protein GI220912777 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00000000257055 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCACGCA CCGCCTCCCT GTCCGGATTC CCCGAGTGGC TTCCCGAGGA GCGGCTGGTG 
GAGATTCATG TCCTGGATAC CCTGCGCCGG GTCTTCGAAC TGCACGGTTT CGCCTCCATT
GAGACGCGCT CGGTGGAAAC AGTGGGACAG CTGCTGCGCA AGGGCGAGAT CGACAAGGAA
GTGTACGGAC TCAGCCGCCT GCAGGAGGAC GAGGGCGAGA ACCCGGTCAA AGGCGGCAAA
GCGGACCCGC ACGCGCTTGC CCTGCACTTT GACCTCACGG TTCCCTTCGC CCGCTACGTC
GTCGAGAATG CCGGCTACCT GGCCTTCCCG TTCCGGCGCT ACCAGATACA GAAAGTCTGG
CGCGGCGAAC GCCCCCAGGA AGGCCGCGCC CGTGAATTCA CCCAGGCGGA TATTGACGTC
GTCGGCGATG GCGAGTTGCC GTTCCGCTAT GACGTTGAGA TCGCCCTGGT CATCGCCGAG
GCACTCAGCG CGCTTCCCAT CCCGGACTTC CAGCTGCGGG TCAACAACCG CAAACTGGCA
GAGGGCTTCT ACCAGGGCAT CGGACTGACG GACACCGCAG GGGTCCTGCG CAGCATCGAC
AAACTGGAAA AAATCGGTCC GGCCAAGGTT GCCGAACTCC TGAAATCCGA ACTTGGTGCC
ACCGACGAGC AGGCACAGAA GGCCCTGCAG CTTGCCGGTA TCCGCACCGG GGACCTGTCC
TTCGTGGCCC AGGTCCGTGC CCTCGGCGTC AGCAACGACC TGCTCGAGGA GGGCCTTAGC
GAGCTGGAGC AGGTCATCGA CGCCGCCGTC CAGCGGGCTC CCGGCAAGGT GCTGGCGGAC
CTCAGTATTG CCCGCGGACT GGACTACTAC ACGGGCACCG TGGTGGAGAC CGTCCTGTTG
GGTCATGAAC AGCTGGGTTC CATCTGCTCC GGCGGAAGGT ATGACGCCCT GGCCTCCAAG
GGCAACCGGA AGTTCCCCGG CGTCGGCCTG TCCATCGGTG TGACCCGGCT GGTGTCCCGG
ATCTTGAGCC AGGAGCTGGC CAAAGCCTCC CGTTCCGTTC CCACCGCCGT GCTGGTGGCC
CTGTCGCACG ACGACAGCTG GGGCGCTGCG CAGGACGTCG CCGCCCAGTT GCGCAGCCGG
GGGATTCCCA CCGAGGTCGC CGCCAAAGCG GAAAAGTTCG GCAAGCAGAT CAAGTTCGCC
GACCGCCGGG GCATCCCGTT CGTCTGGTTC ACGGACGACG ACGGCACGCA CCAGGTCAAG
GACATCCGGT CCGGTGAACA GGTGGTCGCT GCCCCGGAGA CGTGGATGCC GCCGGCCGCC
GACCTCGTGG TACAGGTGGC CACCGCCGGC CCCGTTCCCG CCCAGGTCTC CTGA
 
Protein sequence
MARTASLSGF PEWLPEERLV EIHVLDTLRR VFELHGFASI ETRSVETVGQ LLRKGEIDKE 
VYGLSRLQED EGENPVKGGK ADPHALALHF DLTVPFARYV VENAGYLAFP FRRYQIQKVW
RGERPQEGRA REFTQADIDV VGDGELPFRY DVEIALVIAE ALSALPIPDF QLRVNNRKLA
EGFYQGIGLT DTAGVLRSID KLEKIGPAKV AELLKSELGA TDEQAQKALQ LAGIRTGDLS
FVAQVRALGV SNDLLEEGLS ELEQVIDAAV QRAPGKVLAD LSIARGLDYY TGTVVETVLL
GHEQLGSICS GGRYDALASK GNRKFPGVGL SIGVTRLVSR ILSQELAKAS RSVPTAVLVA
LSHDDSWGAA QDVAAQLRSR GIPTEVAAKA EKFGKQIKFA DRRGIPFVWF TDDDGTHQVK
DIRSGEQVVA APETWMPPAA DLVVQVATAG PVPAQVS