Gene Noca_4311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4311 
Symbol 
ID4596829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4555839 
End bp4559111 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content73% 
IMG OID639778921 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_925495 
Protein GI119718530 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCGG ACGGGTACGA CGACGCCGTC GGCGACCCCG GTCCCGACCT GCGCTCGTTC 
GAGCTGCTGC CCGACGGCAT CTGGATCTTC GACGACCAGG GGATCACCTC CTACGCCAAC
CGGCGGATGG CCGAGATGCT CGGCCGCGAC CAGGCCGAGA TGCCGGGGAC CCGCGTGGTC
GACTTCTTCG ACGAGGAGGG CGTCCACAAC TTCTGGGAGT GGCTCGGCGA GATGATCCGC
ACCGGGCACG GCCGGGAGAA CTTCGAGGCG TTCTTCTACC GACCCGACGG CTCCACCGTG
TGGGGCCTGG TCAGCTCGGC GCCACTCCAC GACGCCGACG GCCGCCGGAC CGGCTGGCTG
CACCGGATCG CGCCGTACAC CGAGCGCAAG GAGCTGCTCC AGCGACTCCA GGCGAGCGAG
CACCAGCTCG CCAACGCCCA GCGCATCGCC CGGCTCGGGA GCTGGGAGCG CGACCTGCGC
ACCGACGAGA TGTCCTGGTC CGACGAGATG TACCGGCTGG TCGGTCTGTC GCCGCAGGAG
GAGCCGGCCT CGATCGAGCT GATGATCTCC TTCGCCCACC CCGAGGACCG CGACCAGATG
ATCGCCGCGG CCCGCCACGA TCTCACCCGG GGCGACCAGT ACTCCTTCGA CGGCCGGATC
GTGCACCGCG ACGGCAGCGT CCGGTGGGGC CGGGTGCTCG GCTACGTCGA GCGGGACGCG
AGCGGGGCCC CGATCCGCAC CGGCGGCACC GTGCAGGACG TCACCGACCT CAAGAACGCC
GACCGGGCCG CGGACGAGGC GACCCGCCGC CTGCGGCTGC TGCAGCAGAT GGCGAGCGCC
GCGAACGTCG CGACCTCGCT GCGCGAGGCG ATGTTGATGG TCGCCCACGG GCTGCCCGGG
CTGACGAGCT GGTACGCCGT CGGCGCCTGG GTGTGCGGCG ACGACCTGAG TCGGACCGAG
ATCGACCTCG AGTCCCCGCC CGCGCCTCCG GGCATCACGC TGCACTTCGA CCACGACCTC
GCGGAGCAGG CCCGCGCGAC GCGGGCGGTC GTCGTCGGGT CGCTGCCGGC GCTGGCGGCC
ACCCACAGCC TGGTCGCGGT GCCGGTGATC GTCCAGGACC AGGTCCTCGC GGTGGCGGAG
CTGCTCGCCG ACGAGGTGCC GCCCGACGAG GAGTCCCGTC AGCTCGTCGA GCAGGTCGCC
ACCCAGCTGG CGGTCGTCGC GGAACGCGAG CGGAGCGCCG CCCAGCTGGC CGAGGCCCGC
GACGACGCCA TGGAGGCGTC CCGGCTGAAG TCGGAGTTCC TGGCGACCAT GAGCCACGAG
ATCCGCACCC CGATGAACGG CGTCATCGGG CTGACCGACC TGCTGCTGCT CTCCGAGCTG
GACGACCACC AGCGCCGGCT GGCCGAGAAC CTCCAGGGCG CGGGGCTGAC CCTCCTCGGG
ATCATCAACG ACATCTTGGA CCTCTCCAAG ATCGAGTCCG GCAAGCTCGA GCTCGAGTCG
GCCGACTTCG ACGTGCGCGC GGTCTTCGAC CAGGTCGCCT CCGTGCTCAG CGGGCCGGCC
CACGCCAAGG GGCTCGAGCT CGTGGTGGCC TGCCACCCCG AGGTGCCCGT CCAGCTGCGC
GGCGACGCCG TACGGTTCGG CCAGATCCTC ACCAACCTCG GCTCGAACGC GGTCAAGTTC
ACCGACCAGG GCGAGGTGGT CGTCCAGGCG CGGGTGCTGG CCGAGCACGA GCACGAGCAC
GAGTCCGAGT CCGAGCCCGG GCCCGGGCCC GGGCACGAGG TGGTGCTGCA GGTCGACGTC
GCCGACACCG GCGTCGGCAT CGAGCCGCAC TCCCGCGAGC GGCTCTTCGA CGCGTTCACC
CAGGCCGACC CGTCGACCAC CCGGCGGCAT GGCGGCACCG GGCTCGGCCT GGCGATCTCC
CGGCAGCTCG CGATCGCCCT CGGTGGCGAG ATCTGGGTCG AGAGCGAGCC CGGTCGGGGC
AGCACCTTCT CGTTCACGGC CCGCTTCGGG CGCGGCTCCG GTGCCACCGG AGCGAGCCGG
GAGCACGCGC GGCACCTCGC GGGGCGGCGC GCCCTGGTGA TCGACGACAA CGAGACGAAC
CGGTTCATCC TCGAGGAGCA GCTCGGCGCG TGGCGGATGC GCCCCGTGGC GGTGGCCTGC
GCCACCGAGG CGATCGCCAC GCTGCGCGAG GCGGCGCGCT CGGGCGACCC GTACGACGTC
GCCCTGTTGG ACCTGATGAT GCCGGGAACC GACGGACTGA TGCTGGCCCG CCAGATCCGT
GCGGACCCGT CCGTGGGCGC CCCCGCGATG CTGCTGCTCA CCTCGGACCA GACGGTGACC
CGGGAGGAGG TCGAGGGCGC CGGCGTGCAC GCCTCGCTGA GCAAGCCGGT GCGGCACGGC
GAGCTGCGCG GCGCCCTGCA GGCGCTGCTC GGCGACGCGA CCACCCGGCC GGCGCCGGCC
TCGCCGGCCG GACCCGGGCT GGGCATCCGG GTCCTGGTCG TGGAGGACAA CCAGGTCAAC
CAGCTGGTCG CCGCCGGGCT GTTGGAGAAC CTCGGTTGCA CCGTCGACGT CGTCTCCGAC
GGGGTCGAGG CGGTGCGGCT GCTCACCCGG CCGCACGAGT ACGCCGCGGC CCTGATGGAC
TGCCGGATGC CGCGACTCGA CGGGTTCGAC GCGACCCGGC AGGTCCGCCG CCACGAGCCG
GTGGGCCGGC GGGTGCCGAT CATCGCGCTG ACCGCCTCGG CCATGGAGGG CGAGCGCGAG
CGCTGCCTGG ACGCGGGCAT GGACGACTAC CTCACCAAGC CCGTGGACAC CGCCGAGCTG
GAGCGGGTGA TCCGTGAGTG GGCGGTCCCG GAGCGCGACC GGCGGACGGC GTCGCCGGCG
TCCCCCGAGC CGGCCGACGG GCTCCTCAGC GGGATCGCGG ACGGGATCCT CGACGCGGAG
CGGATCGCGA TGCTCGAGGG CCTGCGCAAG GACGGCATCA GCTTCTTCGA GCGCACCGCC
GCCTCGTTCC TCGGTCGGGT CGGCAGCCAG CTGCTCGCGA TCCGCGCCGC GGTCGATCGC
GGCGACGCGA TGGGACTGCT CACCTCGTCG CACCAGCTCA AGGGCAGCGC CCTCAACCTG
GGGCTGCCGC GGGTGGCCGA GGCCGCCGCG CGCCTCGAGG CGCTGGGCAT CGCCGGCTCC
ACGACCGGCG CCGAGCCGCT GTTCACGACG GCGACCGCCG AGGTGGAGCT CGCGGTGGCC
GCGCTCCAGC AGGCGACCAC CCGGGACCGC TGA
 
Protein sequence
MSSDGYDDAV GDPGPDLRSF ELLPDGIWIF DDQGITSYAN RRMAEMLGRD QAEMPGTRVV 
DFFDEEGVHN FWEWLGEMIR TGHGRENFEA FFYRPDGSTV WGLVSSAPLH DADGRRTGWL
HRIAPYTERK ELLQRLQASE HQLANAQRIA RLGSWERDLR TDEMSWSDEM YRLVGLSPQE
EPASIELMIS FAHPEDRDQM IAAARHDLTR GDQYSFDGRI VHRDGSVRWG RVLGYVERDA
SGAPIRTGGT VQDVTDLKNA DRAADEATRR LRLLQQMASA ANVATSLREA MLMVAHGLPG
LTSWYAVGAW VCGDDLSRTE IDLESPPAPP GITLHFDHDL AEQARATRAV VVGSLPALAA
THSLVAVPVI VQDQVLAVAE LLADEVPPDE ESRQLVEQVA TQLAVVAERE RSAAQLAEAR
DDAMEASRLK SEFLATMSHE IRTPMNGVIG LTDLLLLSEL DDHQRRLAEN LQGAGLTLLG
IINDILDLSK IESGKLELES ADFDVRAVFD QVASVLSGPA HAKGLELVVA CHPEVPVQLR
GDAVRFGQIL TNLGSNAVKF TDQGEVVVQA RVLAEHEHEH ESESEPGPGP GHEVVLQVDV
ADTGVGIEPH SRERLFDAFT QADPSTTRRH GGTGLGLAIS RQLAIALGGE IWVESEPGRG
STFSFTARFG RGSGATGASR EHARHLAGRR ALVIDDNETN RFILEEQLGA WRMRPVAVAC
ATEAIATLRE AARSGDPYDV ALLDLMMPGT DGLMLARQIR ADPSVGAPAM LLLTSDQTVT
REEVEGAGVH ASLSKPVRHG ELRGALQALL GDATTRPAPA SPAGPGLGIR VLVVEDNQVN
QLVAAGLLEN LGCTVDVVSD GVEAVRLLTR PHEYAAALMD CRMPRLDGFD ATRQVRRHEP
VGRRVPIIAL TASAMEGERE RCLDAGMDDY LTKPVDTAEL ERVIREWAVP ERDRRTASPA
SPEPADGLLS GIADGILDAE RIAMLEGLRK DGISFFERTA ASFLGRVGSQ LLAIRAAVDR
GDAMGLLTSS HQLKGSALNL GLPRVAEAAA RLEALGIAGS TTGAEPLFTT ATAEVELAVA
ALQQATTRDR