Gene Noca_2397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2397 
SymbolhisS 
ID4599497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2554390 
End bp2555745 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content71% 
IMG OID639777000 
Producthistidyl-tRNA synthetase 
Protein accessionYP_923589 
Protein GI119716624 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.165814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAC CCACCCCGCT GAGCGGGTTC CCCGAGCTGC TGCCCGAGCA GCGGGTCGTC 
GAGCAGCAGG TCGTCGACAC CCTCCGGCAC ACGTTCGAGC TCTACGGCTT CGCGGGCATC
GAGACCCGGG CCGTCGAACC GCTCGACCAG CTGCTGCGCA AGGGCGACAC CTCCAAGGAG
GTCTACCTCC TGCGCCGGCT GCACGAGGAG TCGGCGGAGG GGCACTCCGG CATCGGGCTG
CACTTCGACC TGACCGTGCC GTTCGCGCGC TACGTGCTCG AGAACGCCGG CAGGCTGGAG
TTTCCGTTCC GCCGCTACCA GATCCAGAAG GCGTGGCGCG GCGAGCGGCC GCAGGAGGGC
CGCTACCGCG AGTTCACCCA GGCCGACATC GACGTCATCG GCCGCGACGA GCTGCCCTTC
CACCACGACG TCGAGGTCAC CCGGGTGATG GTCGACGCAC TGAACCGGCT CGACTTCTTG
CCGGCGTTCC GGCTCCAGGT GAACAACCGC AAGCTGATCC AGGGTTTCTA CGCCGGCCTG
GGCGTCGCCG ACACCGACGA GGCGATGCGG ATCGTCGACA AGCTCGACAA GCTCCCGGTC
GAGAAGGTGC GCGCGATGCT GGTCGCCGAG GCCGGCGTCG ACGAGGCGAC CGCCGACCGC
GTGCTGTCGC TGGCGACGAT CCGCGCCACC GACGACTCCT TCGTCGACGC CGTCCGGGCG
CTGGGCGTCG AACACCCGCT GCTCGACGAG GGCCTGGCCG AGCTCGGTGC ACTGGTGCGG
GCGTGTGCCG ATCTCGTCAG CGACCGGGTG CAGGTCGTCG CGGACCTCTC GATCGCGCGC
GGCCTCGACT ACTACACCGG CACGGTCTTC GAGACCCGGC TCGACGGCTA CGAGTCGCTG
GGATCGATCT GCTCCGGCGG TCGCTACGAC GCGCTCGCGT CCGACGGCCG CACGACGTAC
CCCGGCGTCG GCATCTCGCT CGGCGTCAGC CGGGTCGTCG TGCCGCTGAT GGCGCGCACC
GGCCTGGCGG CCAGCCGCAA GGTGCCGAGC GCCGTCGTGG TCGCGGTCGT CTCCGAGGAG
TCGCGACCCG AGAGCGAGGC GGTCGCCGCC GCCCTGCGTG CCCGCAACAT CCCCTGCGAG
GTCGCGGCGA GCGCGCAGAA GTTCGGCAAG CAGATCCGGT ACGCCGAGCG GCGGGGCATC
CCGTACGTCT GGTTCCCGGA CCAGGCCGAG GTGAAGGACA TCCGGTTGGG GGAGCAGGTG
GCGGCCGACC CCGCCTCCTG GACCCCACCC ATCGAGGACC TGCGACCGCA GGTCGTTGCG
ACGAGCCCGA CAAGCGAGAT GGAGAGTACG AAGTGA
 
Protein sequence
MAKPTPLSGF PELLPEQRVV EQQVVDTLRH TFELYGFAGI ETRAVEPLDQ LLRKGDTSKE 
VYLLRRLHEE SAEGHSGIGL HFDLTVPFAR YVLENAGRLE FPFRRYQIQK AWRGERPQEG
RYREFTQADI DVIGRDELPF HHDVEVTRVM VDALNRLDFL PAFRLQVNNR KLIQGFYAGL
GVADTDEAMR IVDKLDKLPV EKVRAMLVAE AGVDEATADR VLSLATIRAT DDSFVDAVRA
LGVEHPLLDE GLAELGALVR ACADLVSDRV QVVADLSIAR GLDYYTGTVF ETRLDGYESL
GSICSGGRYD ALASDGRTTY PGVGISLGVS RVVVPLMART GLAASRKVPS AVVVAVVSEE
SRPESEAVAA ALRARNIPCE VAASAQKFGK QIRYAERRGI PYVWFPDQAE VKDIRLGEQV
AADPASWTPP IEDLRPQVVA TSPTSEMEST K