Gene Noca_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4901 
Symbol 
ID4595276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp233319 
End bp234470 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content71% 
IMG OID639772686 
Producthistidine kinase, dimerisation and phosphoacceptor region 
Protein accessionYP_919346 
Protein GI119714204 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.316421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.344545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCGT CCGCGATCGC CGGGAGGTAC CGCGACCAGA TTCTGGCCGC GGGCCTCGCA 
GCGTTCTACG CGCTCGAGGT GCTGTTCAGT AGCGAAGTCG AGCACCACCG GGGCTCAGCA
GCCGCCGTCG CGGTGCTGAT GGCCGTCAGC CTCGTCGTAC GCCTCACGAT GCCCCTGCTG
CCGCTGGTGG CCGTGATCGC CGTCATCCAG CTCAACCACA CCGTCCTCCC CGGCTTGGCC
GAGGGAGGCG CGTTCATGAT CGCCTTGATC GTCACGATTT TCTCCGCCGG CCACTACCTG
CACGGGCGGA TGCTGGCCCT GGGCGGCGTG ATCGTCGCGG GCATCATCCC GCTGGCGGCG
CTCGACCCCC GCCAGCCGCC AGCGGTCGGC GACTGGATCT TCTTCATCGT GTATCTCGGT
ACGCCATTCG TGGCGGGAGT CGTGTTCCGC CGCCGCCGCG AACGCGACCG GGAGATGACC
GAGATGGCCC GGCGCGCAGA GGAGGAGGGC GAGACGCGGG CCGGTGAGGC TGTCGCCGCG
GAGCGCGCCC GAATCGCTCG GGAACTGCAC GACGTGGTTG CTCACGCGAT CAGCGTCATC
GTGGTCCAGG CTCGCGGCGG ACGTCGGGTC CTGGCCGACG ACACCGGAGG GGCGCGGAGT
GCGTTCGACG TCATCGAGCA CGCCGGGGAG CAGGCACTGA CCGAGATGCG GCGATTGCTG
GCGCTCTTAC GAGAGACGGA GCCGGAGGCA GCGGCGTTAC AGCCGCAGCC GAGCCTGGGC
CGCATCGACG TGCTCGCCAC CGAAGTGGCG GCGTCCGGTT TGCCGGTTGA GGTCGTCCGC
GAGGGCGACC CGGTCGAACT GCCGCCCGGG GTGGATCTCT CGGCGTACCG GATCGTGCAG
GAAGCACTGA CCAACGCCCT CAAGCACGCC GGGCCGGCTC GCGCCCGAGT GGTGCTGCGC
TACCTGCCGC GGGCATTCGA GGTGGAGGTG CTCGACGACG GTCACGGGAC CGGCGCGGGC
GGCGGTTCGG GGCACGGGCT GACCGGCGTC CGCGAGCGCG TCGAGGTCTA CGGCGGTCAG
CTCTCGGCGG GCACTCGGCC CGAGGGTGGG TTTGCCGTGC GAGCGCGGCT GCCGATCGAG
ATACCGTCAT GA
 
Protein sequence
MDASAIAGRY RDQILAAGLA AFYALEVLFS SEVEHHRGSA AAVAVLMAVS LVVRLTMPLL 
PLVAVIAVIQ LNHTVLPGLA EGGAFMIALI VTIFSAGHYL HGRMLALGGV IVAGIIPLAA
LDPRQPPAVG DWIFFIVYLG TPFVAGVVFR RRRERDREMT EMARRAEEEG ETRAGEAVAA
ERARIARELH DVVAHAISVI VVQARGGRRV LADDTGGARS AFDVIEHAGE QALTEMRRLL
ALLRETEPEA AALQPQPSLG RIDVLATEVA ASGLPVEVVR EGDPVELPPG VDLSAYRIVQ
EALTNALKHA GPARARVVLR YLPRAFEVEV LDDGHGTGAG GGSGHGLTGV RERVEVYGGQ
LSAGTRPEGG FAVRARLPIE IPS