Gene Noca_1019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1019 
Symbol 
ID4599680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1070611 
End bp1071789 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content75% 
IMG OID639775618 
Producthistidine kinase, dimerisation and phosphoacceptor region 
Protein accessionYP_922225 
Protein GI119715260 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.181556 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCAGT TCCGGCGGCA GCTCTCGACG TACGGTCTCG ACGCCCTCCT CGACGCCCTC 
CTCGGCGTGG CCGCCGTGTG GAGTGCCGTC GGCACCCTGC GCCGCGACGA TCGGTACCTC
CCGGCCGGGG CCGCGGCGTG GTGGGAGGCC GCGGCGATCG CCGCGATCAT CCTCGTCCTC
GTGCTGCGCC GTCGGTTCCC GTTCGGTGCG CCTGCCGGTG TCTGGCTGAC GTGTGCGGCG
CTCTCCTTCG CCGACGGGCG GATGATCCCC AGCCAGGCCG GCCTCTTCGT CGCCGGGCTC
GGCGCCGCGC TGCTTCTCGG CAACCAACGC AACGGAGTGC AGGCGCGGGT CGGCCTGGCC
ATCGTGGTCG GCAGCGGCGC GATCGTCATG TACAACGACC CCACGCACTC GTCCGGTGCC
CTGGTCTCCA CTCCGCTGCT GTTCGCGATG GCCTGGCTGG TCGGCTACGC GCTGCGCGAG
CGCACCGAGC GGACCGAGGC CGCGGAGGAG CGCGCCGCTC GTGCCGAGCG CGACCGCGAG
GTGGCGGCGC GCGTGGCCGT GGCGGAGGAG CGCGGCCGGA TCGCGCGGGA GCTCCACGAC
GTCGTGGCGC ACGCGGTCAG CGTGATGGTC CTCCAGGTCG GCGCCGTCCG GCACCGGATG
TCCGACTCCG ACGCGGAGAA CCGCGAGGCG CTCGAGAACG TCGAGCGGGC CGGGCGGGCC
GCCCTCGCCG AGATGCGCCG CCTGCTCGGG GCGATGCGGC GCGACGGCGA GCAGCCCGAG
CTGGTGCCGC ATCCGGGCCT GGCCGACCTG GACAGCCTGC TCGCGGACGT GCGGGCTGCC
GGGCTGCCCG TCCGGCTGCA GGTCCACGGC GAGCCGGTCG AGCTGCCGCC GGGGCTCGAT
CTCTCGGCGT ACCGCATCGT GCAGGAGGCC ATCACCAACA CCCTCAAGCA CGCCCGGGCG
CACCGCGCGG ACGTGGACGT GTACTACGAG CCCCACGACC TTCGGGTGGA GGTCCGCGAC
GACGGCCGGG GCTCGACGTC CGGTGCTGGG CTGGGGCACG GGCTGGTGGG CCTGCGCGAG
CGGGTCAAGA TCTACGGCGG GGAGATGACG GCGGGCCGAG GTCCTGCCGG AGGGTTCGCG
GTGCGCGCAC GGCTTCCGTT GGACGGTGAC GGGTCATGA
 
Protein sequence
MSQFRRQLST YGLDALLDAL LGVAAVWSAV GTLRRDDRYL PAGAAAWWEA AAIAAIILVL 
VLRRRFPFGA PAGVWLTCAA LSFADGRMIP SQAGLFVAGL GAALLLGNQR NGVQARVGLA
IVVGSGAIVM YNDPTHSSGA LVSTPLLFAM AWLVGYALRE RTERTEAAEE RAARAERDRE
VAARVAVAEE RGRIARELHD VVAHAVSVMV LQVGAVRHRM SDSDAENREA LENVERAGRA
ALAEMRRLLG AMRRDGEQPE LVPHPGLADL DSLLADVRAA GLPVRLQVHG EPVELPPGLD
LSAYRIVQEA ITNTLKHARA HRADVDVYYE PHDLRVEVRD DGRGSTSGAG LGHGLVGLRE
RVKIYGGEMT AGRGPAGGFA VRARLPLDGD GS