Gene Nmul_A2376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2376 
SymbolhisS 
ID3784967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2702469 
End bp2703728 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content59% 
IMG OID637812465 
Producthistidyl-tRNA synthetase 
Protein accessionYP_413057 
Protein GI82703491 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCACTA GGGGGATTCA GGCTGTACGC GGGATGAACG ACATCCTGCC CGATCAGATT 
GACCGGTGGG AGTTCTTCGA GCAAAGTGTC CGCGACTGGA TGGCGGCTTA TGGCTACCGC
AATATCCGTA TGCCGATAGT CGAGCAGACC GACCTGTTTG TGCGTTCCAT CGGTGCGGTC
ACAGATATCG TCGAGAAAGA GATGTACACC TTCGTGGATC ATCTCAACGG CGAGAGCCTG
ACACTGCGGC CGGAAGGAAC CGCGTCCTGC GTGCGTGCGG TGCTTGAGCA TAATCTGCTC
TATTCCGGCC CGCAACGGCT ATATTATTCG GGTGCGATGT TCCGCCACGA GCGTCCGCAA
AAAGGACGTT ACCGGCAATT CCATCAGGTC GGCGCCGAAG CTCTGGGGTA TGGCGGACCC
GACATCGATG CCGAGCTCAT CATCATGGGC GCCGACTTGT GGAAGCGGCT CGGCGTTTCC
GGGGTGCGGC TCGAAATCGG GACGCTTGGC AGTGCGGAGT CGCGCTCGGT GCACCGTACC
CGCCTGATCG ATTACCTGCA GCGGCATCTA TGCAAGCTGG ATGAAGATGC ATCCAGGCGC
CTGCACAGCA ATCCCCTACG CATACTCGAC AGCAAGAATG CGGGGATGAG AGAGATTATC
GAGGGCGCTC CGCGGTTACT GGATGACCTG GACGAGGACT CTCTCATTCA TTTTGAACGC
TTGCAGCAAA TCCTGCGCGA GCAGGGGGTC GACTTCGAGA TCAACCCGCG GCTGGTACGG
GGGCTGGATT ATTATAATCG CACCGTATTC GAGTGGGTTA CCGACAAGCT GGGGGCGCAG
GGAACCGTCT GCGCAGGTGG ACGTTATGAC GGACTGGTAG AACAGGTTGG CGGCAAGGCT
ACCCCCGCAT GCGGATTTGC CCTGGGCGTG GAACGAGTGC TGGCACTGGT GATGGACAGT
ATCATCCCTC AGGCTCCTCC TGATGTCTAT GTGGTTCACA AGGGCGATGC CGCGGCCGGG
TTTGCCTGGA AAACGGCAAG ACACTTGCGG GATCGTGGGT TCCAGGCAAT TCTGCATTGC
GGAGAGGGCA GCTTCAAGGC GCAGATGAGA AAAGCCGACG CCAGCGGAGC GCGTTTTGCG
ATCATCATCG GAGATGATGA AGCGCAAGCC GGCGAAATAA GCATCAAGCC GCTGCGGGAA
GCGGCGGAGC AGGTCCGGGT AGGCCTTGCG GAAGCTGCCG ACCTGCTGAA AAGGGCCTGA
 
Protein sequence
MPTRGIQAVR GMNDILPDQI DRWEFFEQSV RDWMAAYGYR NIRMPIVEQT DLFVRSIGAV 
TDIVEKEMYT FVDHLNGESL TLRPEGTASC VRAVLEHNLL YSGPQRLYYS GAMFRHERPQ
KGRYRQFHQV GAEALGYGGP DIDAELIIMG ADLWKRLGVS GVRLEIGTLG SAESRSVHRT
RLIDYLQRHL CKLDEDASRR LHSNPLRILD SKNAGMREII EGAPRLLDDL DEDSLIHFER
LQQILREQGV DFEINPRLVR GLDYYNRTVF EWVTDKLGAQ GTVCAGGRYD GLVEQVGGKA
TPACGFALGV ERVLALVMDS IIPQAPPDVY VVHKGDAAAG FAWKTARHLR DRGFQAILHC
GEGSFKAQMR KADASGARFA IIIGDDEAQA GEISIKPLRE AAEQVRVGLA EAADLLKRA