Gene M446_2647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2647 
SymbolhisS 
ID6135341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2937536 
End bp2939134 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content73% 
IMG OID641642861 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001769520 
Protein GI170740865 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.708997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.141234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACT CCGCGAAGGC CAGTTCCGCC AAGGCCAGCC CCGCCAAGGC CAGCGCCGCC 
AAGGCAGGCT CCGGGAAGCC CGGCTCCGGG AAGCCCGGCA CCCTCAAGCC CCGCCTGCCC
CGCGGCCTCG TCGACCGCGG CCCGGCCGAG ATCGCCGCGA CCCAGCGGAT GCTGCAGACC
ATCCGCGAGA GCTTCGAACT CTACGGCTTC GAGGCGGTGG AGACCCCCTT CATCGAGTAC
ACCGAGGCGC TCGGGAAATT CCTGCCCGAC CTCGACCGCC CGAACGAGGG CGTGTTCTCG
TTTCAGGACG ACGACGAGGC GTGGCTGTCC CTGCGCTACG ACCTGACCGC GCCGCTCGCC
CGCTACGTCG CCGAGCATTT CGACGCCCTG CCCAAGCCCT ACCGCAGCTA CCGGGCGGGC
TACGTCTTCC GCAACGAGAA GCCGGGGCCC GGCCGCTTCC GCCAGTTCAT GCAGTTCGAC
GCCGACATCG TCGGCGCCCC GACCGTCGCG GCCGATGCGG AGATCTGCAT GATGGCCGCC
GACACGCTGG AGCGGGTCGG CATCGCCCGC GGCGACTATG TGGTGAAGGT CAACAACCGC
AAGGTGCTCG ACGGCGTCAT GGAGGCGATC GGGCTCGGCG GTGACGACCA GGCCGGGCGC
CGCCTCACGG TGCTGCGGGC CATCGACAAG CTCGACCGCC TCGGGGTGGA CGGGGTGCGG
CTGCTGCTGG GGCCGGGCCG CCGGGACGAG AGCGGCGACT TCACCAAGGG GGCCGGGCTC
CCGGACGAGG CGATCGACCG CATCATCCGC TACGTCTCCT TCCAGGCGGC GCCCACGGAC
GGGGGCGACC GGCTCGCCTT CTGGGAGAAT TTCTTCGGCG GCTGGGCCGA GGTGGTGGGC
GGCTCCGAGA CCGGCCGCCA GGGCATCGCC GAATTGCACG CGATCCTGCG CCTGTGCGAG
GCGGCGGGCT ACGGCCACGA CCGGGTGCGG GCCGACCCCT CGGTGGTGCG CGGCCTCGAA
TACTACACGG GGCCGGTCTA CGAGGCCGAG CTGACCTTCC CGGTCGTGGG CGAGGACGGG
CAGACCGTGC GCTTCGGCTC GGTGGCCGGG GGAGGGCGCT ACGACGGCCT GGTCGGGCGC
TTCCGGGCCG AGCCCGTCCC GGCGACGGGC TTCTCGATCG GCGTGTCGCG CCTGTTCGCG
GCCCTGCAGA TCGTCGGGAG CCCGATCGTG GCGGGCGCGG CCGGGCCCGG ACCCGTGGTG
GTGCTGGTGC TCGACCGCGA GGAGATCGCG AGCTACCAGG CCCTGGTCGC CGCCCTGCGC
CAGGCCGGCA TCCGGTCCGA ACTCTATCTC GGGGCCGCCG GCCTCAAGGC GCAGATGAAG
TACGCCGACC GCCGCGGCGC GCCGGCCGTG GTGATCCAGG GCAGCGACGA GCGGGCCCGC
GGCGAGGTCC AGATCAAGGA CCTGATCGAG GGCGCCCGCG CCGCCGAGGC GATCGCCAGC
AACGCCGAGT GGAAGGCCGC CCGGCCGGCG CAGTTCTCGG TGCCGGGGGC CGAGATGGTC
GCCCGCCTGC GCGAGGTGCT GGGCCGGCAT TTCGGGTGA
 
Protein sequence
MSNSAKASSA KASPAKASAA KAGSGKPGSG KPGTLKPRLP RGLVDRGPAE IAATQRMLQT 
IRESFELYGF EAVETPFIEY TEALGKFLPD LDRPNEGVFS FQDDDEAWLS LRYDLTAPLA
RYVAEHFDAL PKPYRSYRAG YVFRNEKPGP GRFRQFMQFD ADIVGAPTVA ADAEICMMAA
DTLERVGIAR GDYVVKVNNR KVLDGVMEAI GLGGDDQAGR RLTVLRAIDK LDRLGVDGVR
LLLGPGRRDE SGDFTKGAGL PDEAIDRIIR YVSFQAAPTD GGDRLAFWEN FFGGWAEVVG
GSETGRQGIA ELHAILRLCE AAGYGHDRVR ADPSVVRGLE YYTGPVYEAE LTFPVVGEDG
QTVRFGSVAG GGRYDGLVGR FRAEPVPATG FSIGVSRLFA ALQIVGSPIV AGAAGPGPVV
VLVLDREEIA SYQALVAALR QAGIRSELYL GAAGLKAQMK YADRRGAPAV VIQGSDERAR
GEVQIKDLIE GARAAEAIAS NAEWKAARPA QFSVPGAEMV ARLREVLGRH FG