Gene MCA2889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2889 
SymbolhisS 
ID3103527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3079946 
End bp3081214 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID637172017 
Producthistidyl-tRNA synthetase 
Protein accessionYP_115282 
Protein GI53802991 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA AAATCCAAGC CATCCGCGGC ATGCACGACA TCCTGCCGGA TATGACACCA 
CGCTGGCAAC AGGTCGAACA CCAGATCCGC AGTGTCATGG CCAATTACGG CTACCGCGAG
ATCCGTCTGC CCATCGTCGA AAAGACCGAG CTGTTCAAGC GCTCCATCGG CGAAGTCACC
GACATCGTCG AAAAGGAGAT GTACGTTTTC GAGGACCGCA ACGGCGACTC GCTGACCCTG
CGCCCCGAAG GCACCGCAGG GTGTCTGCGC GCCTGCCTCG AACATGGCCT GCTCCACAAT
CAGACTCATC GGCTGTGGTA TGCCGGCCCC ATGTTCCGCC ACGAGCGCCC CCAGAAAGGG
CGTTACCGGC AGTTCCACCA GGTCGGTGTC GAGGTCTTCG GCATGCCCGG ACCGGATATC
GACGCGGAGC TGATCTTCCT GAGCCGCCGG CTCTGGCAGC GCTTAGGGGT GGCCCACAAA
CTGCGGCTGG AGCTGAACTC CCTGGGCACG CCCGACGAAC GGCTGGCCTA CCGCCAGATC
CTGGTGGATT ATCTGCGCGA GAATTACGAC GCGCTGGACG CAGACAGCGC TCGGCGCCTG
GAAACCAACC CGCTGCGCAT CCTCGACAGC AAGAATCCGG CGATGAAGGA TCTGCTGGCC
GGCGCGCCGG TGCTCGCTGA CCATCTGGGC GGCGCCTCGC GGGCCCACTT CGAGGGACTG
ACCGAGCTGC TCGATGCCGC CGAAATCCCT TATGTGATCA ACCCCCGCCT GGTGCGCGGC
CTGGATTATT ACTGCAACAC CGTGTTCGAA TGGGTGACCG ACGAACTGGG CACCCAGGGC
ACTGTCTGCG CCGGCGGCCG CTATGACGGC CTGGTGGCCC AGCTCGGCGG CCGTGACTCC
AGTGCCATCG GCTTCGCCTT GGGGATGGAG CGCCTGATCG AGCTGACCGG CGAGACCTTC
GCCGAAAACG CCCCGCACAT CTATTTCATC ACGGTGGGCA CGGCCGCGGA ACGGCGCGGC
GCCGTGCTGG CAGAAATGCT CAGGGATGCG TTTCCCAGCC TGCGGCTGGT GGTCAACTGC
GGCGGCGGCA GCTTCAAGAA CCAGTTCAAA CGCGCCGACA AGAGCGCGGC GGCTTTCGCC
CTCGTCCTGG GCGAGGACGA AATGCGCGAC GACAGCATCA GTCTGAAACC CTTACGCTCG
GACGATGCCC AGCAGACGGT CGCCCAGGCC GACCTGGTCC GGCTCATCCA ATCCTGTGTC
CCAATCTGA
 
Protein sequence
MSEKIQAIRG MHDILPDMTP RWQQVEHQIR SVMANYGYRE IRLPIVEKTE LFKRSIGEVT 
DIVEKEMYVF EDRNGDSLTL RPEGTAGCLR ACLEHGLLHN QTHRLWYAGP MFRHERPQKG
RYRQFHQVGV EVFGMPGPDI DAELIFLSRR LWQRLGVAHK LRLELNSLGT PDERLAYRQI
LVDYLRENYD ALDADSARRL ETNPLRILDS KNPAMKDLLA GAPVLADHLG GASRAHFEGL
TELLDAAEIP YVINPRLVRG LDYYCNTVFE WVTDELGTQG TVCAGGRYDG LVAQLGGRDS
SAIGFALGME RLIELTGETF AENAPHIYFI TVGTAAERRG AVLAEMLRDA FPSLRLVVNC
GGGSFKNQFK RADKSAAAFA LVLGEDEMRD DSISLKPLRS DDAQQTVAQA DLVRLIQSCV
PI