Gene Namu_3356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3356 
SymbolhisS 
ID8448971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3693273 
End bp3694547 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content72% 
IMG OID645042433 
Producthistidyl-tRNA synthetase 
Protein accessionYP_003202673 
Protein GI258653517 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00613181 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.200159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATT TTGTCGCCCC CAAGGGCATC CCCGAGTACT ACCCGCCCGT CTCCCGCACC 
TTCGAGGCGA TCCGCCGGAC GTTGCTGGCC GCGGCCGACC GGGCCGGGTA CGGACCCATC
GAGCTGCCGG TGTTCGAGGA CACCGGGCTG TTCGCCCGCG GTGTCGGCGA GTCCACCGAC
GTGGTCTCCA AGGAGATGTA CACCTTCGCC GACCGCGGTG GCCGCTCGGT CACCCTGCGC
CCGGAGGGCA CCGCCGGCGT CATGCGGGCG GTGATCGAGC ACAGCCTGGA TCGCGGTCCG
CTGCCGGTCA AACTGGTCTA CGCCGGCCCG TTCTTCCGCT ACGAGCGGCC GCAGGCCGGG
CGGTACCGGC AGTTGCAGCA GGTCGGCGTG GAGGCCATCG GCAGCGACGA TCCGGCCCTG
GACGCCGAGG TGATCGCCAT CGCCGACGAG GGGTTCCGCG ACGTCGGGCT GTCCCAGTTC
CGGCTGGACC TGACCTCGCT GGGTGATGCG GTCTGCCGGC CCGCCTACCG GGAGCGGCTC
ATCGCCTTCC TGGACGGCCT GGACCTGGAC GAGCCCACCC GCGAGCGGGC CCGGCTCAAC
CCGTTGCGGG TGCTGGACGA CAAACGGCCG GCCATGCAGG AGCAGCTGGC CGGGGCTCCG
CTGATGCTCG ACCACCTGTG CGACGCCTGC CGCGAGCACT TCGACCGGGT CCGGCAGGTG
CTGGACGCGC TGTCGGTGCC CTACGAGCTC AACCCGCGGA TGGTTCGCGG CCTGGATTAC
TACACCCGCA CCACCTTCGA GTTCGTGCAC CCATTGCTGG GCGCGCAGTC GGGGATGGGC
GGCGGTGGGC GCTACGACGG GCTGATGGCC GAGCTGGGCG GACAGTCGTT GTCCGGCATC
GGGTTCGGCC TGGGGGTGGA CCGCACGCTG CTGGCCGCCC AGGCCGAAGG CCTGACCGTC
GGCCACCCGG CCCGCTGCGA GATCTTCGGC GTGCCCATGG GTCCGGACTC CTCCCTGCGG
CTGGCCACCC TGGCCGGGGA GCTGCGGCGG GCCGGCTACC GGGTCGACAT GGCCTACGGC
GGGCGCGCGC TCAAGACCGC GATGAAGATG GCCGACGCGT CCGGGGCGGC GCTGGCCCTG
GTGCTCGGTG ATCGGGAGAT CGTCGACGGC ACGGTGGTCG TGCGCGACCT GCGTTCGGGC
GAGCAGAACG CTTACCCGAT GGGCGATCTG GTGCAGGTGG TGGGCCCGCT GCTGACGGCG
GTACCGGTCG TCTGA
 
Protein sequence
MPDFVAPKGI PEYYPPVSRT FEAIRRTLLA AADRAGYGPI ELPVFEDTGL FARGVGESTD 
VVSKEMYTFA DRGGRSVTLR PEGTAGVMRA VIEHSLDRGP LPVKLVYAGP FFRYERPQAG
RYRQLQQVGV EAIGSDDPAL DAEVIAIADE GFRDVGLSQF RLDLTSLGDA VCRPAYRERL
IAFLDGLDLD EPTRERARLN PLRVLDDKRP AMQEQLAGAP LMLDHLCDAC REHFDRVRQV
LDALSVPYEL NPRMVRGLDY YTRTTFEFVH PLLGAQSGMG GGGRYDGLMA ELGGQSLSGI
GFGLGVDRTL LAAQAEGLTV GHPARCEIFG VPMGPDSSLR LATLAGELRR AGYRVDMAYG
GRALKTAMKM ADASGAALAL VLGDREIVDG TVVVRDLRSG EQNAYPMGDL VQVVGPLLTA
VPVV