Gene Mext_1555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1555 
SymbolhisS 
ID5832177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1735257 
End bp1736768 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content70% 
IMG OID641367353 
Producthistidyl-tRNA synthetase 
Protein accessionYP_001639025 
Protein GI163850982 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.84291 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGG CCGACACCCT GAAACCCCGC CTGCCGCGCG GCTTCCCCGA CCGCACCGAG 
GCCGACATCC TGGCCCAGGG GCGGATGCTC GACACGATCC GGCAGACCTT CGAACTCTAC
GGCTTCGAGG CGCTGGAAAC CCCCTTCGTC GAGTACACCG AGTCGCTCGG CAAGTTCCTG
CCGGATCTCG ACCGGCCGAA CGAGGGCGTG TTCTCGTTCC AGGACGACGA CGAGCAGTGG
CTCTCGCTGC GCTACGATCT CACCGCGCCG CTCGCCCGCC ACGTCGCCGA GAACTTCGAC
GCGATCCCGA AGCCCTATCG CAGCTACCGG GCGGGCTACG TCTTCCGCAA CGAGAAGCCG
GGGCCGGGCC GCTTCCGCCA GTTCATGCAG TTCGATGCCG ACATCGTCGG CGCGGGCTCC
GTCGCGGCCG ACGCCGAGAC CTGCATGCTG ATGGCCGACA CGCTGGAGCG GCTCGGGCTC
GCGGGCCAGT ACGTGGTCAA GGTCAACAAC CGCAAGGTGC TCGACGGCGT CATGGAAGCG
ATCGGCCTCG CCGGTCCGGA CAAGGCCGGC CAGCGCCTCA CCGTGCTGCG CGCCATCGAC
AAGCTCGACC GCCTCGGCGC CGACGGCGTG CGCCTGCTGC TCGGTCCCGG CCGCAAGGAC
GAGAGCGGCG ACTTCACCAA GGGCGCCGGG CTTGGCGACG ACGCGATTGA GCGCATCCTC
GCCTATGTCG GCTTCGAGGC GAGCCCGCAC GAGGGCGCCG ACCGGATGGC GTTCTGGGAG
AAGTTCTTTG GCTCCTGGCA GGAGGTCGTC GGCACCTCCG AGACCGGCCG CGAGGGTATC
GCCGAACTCC ACGCAATCAT GCGGCTCTGC GAGGCGGCGG GCTACGGCCA TGACCGGGTG
CGGGCCGACC CCTCCGTGGT GCGCGGCCTC GAATACTACA CCGGCCCCGT CTATGAGGCG
GAGCTGACCT TCCCCGTCAC CAACGAGGAC GGACAGACCG TCCGCTTCGG CTCGGTGGCC
GGCGGCGGGC GCTATGACGG GCTCGTCGGT CGCTTCCGCA GCGAGCCGGT GCCGGCGACC
GGCTTTTCCA TCGGCGTCTC GCGGCTGTTC TCGGCCCTGC GGCTGACCAA AAGCCCGTTG
GTGGAAGGCG CGGCCAAGCC CGGCCCGGTC GTGGTGCTGG TGCTCGATCG CGAGAACATC
GCCGAGTATC AGGCGCTGGT CGCGCAGCTG CGCGCGGAGA ACATCCGGGC CGAGCTCTAC
CTCGGTGCGG CCGGGATGAA GGCGCAGATG AAATATGCCG ACCGCCGCCG CGCGCCGGCG
GTGGTGATCC AGGGCTCGAA CGAGCGCGAG GCCGGCGAGG TCCAGATCAA GGACCTGATC
GCGGGCGCCC GCGCCGCCGA AGCCATCGCC AGCAATGCCG AATGGAAGGC CGCCCGCCCG
GCCCAGGTGT CGGTGCCGGT GGAACGGATG GTCGAGACGG TTCGCGAAAC GCTGGCCCGG
CATTTCGGGT GA
 
Protein sequence
MAKADTLKPR LPRGFPDRTE ADILAQGRML DTIRQTFELY GFEALETPFV EYTESLGKFL 
PDLDRPNEGV FSFQDDDEQW LSLRYDLTAP LARHVAENFD AIPKPYRSYR AGYVFRNEKP
GPGRFRQFMQ FDADIVGAGS VAADAETCML MADTLERLGL AGQYVVKVNN RKVLDGVMEA
IGLAGPDKAG QRLTVLRAID KLDRLGADGV RLLLGPGRKD ESGDFTKGAG LGDDAIERIL
AYVGFEASPH EGADRMAFWE KFFGSWQEVV GTSETGREGI AELHAIMRLC EAAGYGHDRV
RADPSVVRGL EYYTGPVYEA ELTFPVTNED GQTVRFGSVA GGGRYDGLVG RFRSEPVPAT
GFSIGVSRLF SALRLTKSPL VEGAAKPGPV VVLVLDRENI AEYQALVAQL RAENIRAELY
LGAAGMKAQM KYADRRRAPA VVIQGSNERE AGEVQIKDLI AGARAAEAIA SNAEWKAARP
AQVSVPVERM VETVRETLAR HFG