Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5202 |
Symbol | |
ID | 9249095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 349257 |
End bp | 353693 |
Gene Length | 4437 bp |
Protein Length | 1478 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | GAF sensor hybrid histidine kinase |
Protein accession | YP_003683088 |
Protein GI | 297564115 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0874625 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGG CGGTCAAGGA ACTGGCGGCC GACGACCAGC TCGACGAGAT TCTGCGGGCC CTGTACCGCA TGCGCGACGG CGACTTCAGC GTCCGTCTGC GCAAGCGCGG CGGCGGCACC ATGCGGGAGA TCGCGTCCGT CTTCAACGAG GTGGTCGACC ACAGCGAACA GCTCAGCGAC GAACTCCAGC GCGTCGGCGA CGTCGTCCGC AACGAGGGCA GGCTCAACGA GCGCGTCAAC GTCAACCCCG CCCGGGGCGC CTGGGGCGAG AGCGCCCAGG CGCTCAACCT CCTGCTCGAC GAGGTCGCCG AACCCGTGAC CGACGTCGCC GGGGTCCTGG ACTCGGTCGC GGAGGGCCAG CTCAGCCGCC GCGCCTCCAC CGAGGGCCGC CGCGGCGAGC TCAAGGGCGA CCTGCTGCGC CTGGCCACCA CGGTCAACCG CATGGCCGAC CAGATGAGCG GCTTCGCCGG GGAGGTCACC CGTGTGGCCC GCGAGGTGGG CACCGAGGGC AAGCTCGGCG GCACCGCGCA GGTCGAGGGG GTCTCCGGAG CCTGGCGCGA GGTGACCGAG TCGGTCAACC AGATGTCCTC GCGCCTGACC GACCAGATGC GCGACATCTC CGAGGTGACC ACCGCCGTCG CCCGCGGCGA CCTCAGCCGC AAGATCACGG TGGACGTACA GGGCGAGATG CTCGACCTCA AGGACACCGT CAACACCATG GTGGACCAGC TGTCCACCTT CGCCGACGAG GTCACCCGTG TGGCCCGCGA GGTGGGCACC GAGGGCAAGC TCGGCGGCCG CGCCAACGTG CGCGGCGTGC GGGGCATCTG GAAGGACCTC ACCGAGAACG TCAACTCGAT GGCGGACAAC CTCACCAACC AGGTGCGCGA CATCTCCCAG GTGACGACCT CCGTGGCCCG CGGCGACCTC ACCCAGAAGG TCCAGGTCGA CGTGCAGGGC GAGATGCTCG CCCTGAAGAA CACCGTGAAC ACCATGGTCG ACCAGCTCGA CTCCTTCGCC GACGAGGTCA CGCGCGTGGC GCGCGAGGTC GGTACGGAGG GCAAGCTGGG CGGCCGGGCC AACGTCAAGG GCGTGTCGGG CATCTGGAAG GACCTGACCG ACAACGTCAA CTCCATGGCC AACAGCCTCA CCTACCAGGT CCGCAACATC TCCCAGGTGA CGACGGCGGT CGCCACCGGT GACCTCACCA AGAAGATCAC CGTGGACGCC CAGGGGGAGA TGCTCGACCT CAAGGACACC ATCAACAAGA TGGTCGACCA GCTGGACTCC TTCGCCGGCG AGGTGACCAG GGTCGCCCGT GAGGTGGGCA CGGAGGGCAA GCTGGCCGGA CAGGCCCACG TCCGCGACGT GTCCGGGGTC TGGAAGGACC TGACCGACAA CGTCAACTCC ATGGCCAACA ACCTGACCTA CCAGGTGCGC CAGATCTCCA TGGTCACGCG CGCGGTGGCC GCGGGCGACC TGACCAAGAA GGTCACGGTC AACGCCAAGG GCGAGATCCT GGAGCTGAAG GACACCATCA ACGTCATGGT GGACCAGCTC TCCGCGTTCG CCGACGAGGT CACCCGGGTG GCCCGCGAGG TCGGCACCGA GGGCAAACTG GGCGGCCGAG CCGACGTCAA GGGCGTCTCC GGCATCTGGA ACGACCTCAC CGAGAACGTC AACTCGATGT CGCACAACCT CACCACGCAG GTGCGCAACA TCTCCGAGGT GACCACGGCC GTGGCGGCGG GCGACCTCAA CAAGAAGATC GACGTCAACG CCCAGGGGGA GATCCTGGAG CTCAAGACGA CGGTCAACAC CATGGTCGAC CAGCTGTCGG CGTTCGCCAC CGAGGTCACC CGCGTGGCCC ACGAGGTGGG CAGCCAGGGC CAGCTGGGCG GCCAGGCCAA GGTCGAGGGC GTGACCGGCA CCTGGAAGCA GCTCACCGAC AGCGTGAACG GCCTGGCGGG CAACCTGACC ACGCAGGTGC GGGCGATCGC CGAGGTCGCC AACGCCGTCG CCAAGGGCGA CCTGACCCGC AACATCCAGG TCGACGCCCG CGGTGAGATG GAGCAGCTCA AGCACAACAT CAACCTGATG GTGTCCAACC TGCGCGAGAC CACCGCCACC CAGCGCGACG CCGACTGGCT CAAGTCCAAC GTGGCCCGTA TCTCCGGCCA CATCCAGGGC CACCGCGACC TCAAGGAGCT GGCCCGCCTC ATCATGACCG AGGTGACGCC GCTCATCGGC GCCCAGCACG GCGCCTGCTA CCTGCCCGAG GACCAGGACG ACCAGGAGAA CTTCCTGTTC TACGCGGGCT TCGGCTTCGA CCCCGACGAG GACCGCCGCC GCGTCCGCGC GGGCATCGGC CTGGTCGGCG AGGCCCTCGC CCAGCAGGTG GAACAGCAGA TCTCCAACAT CCCGCCGGAC TACGTCAAGG TCCGGTCGGG CCTGGGCGAG GCCTCCCCGC GCAACCTGTA CATCCTGCCG ATCGTCTCCG AGGGGCGCTC GCTGGGCGCC ATCGAGTTCG CCTCCTACGA CGACTTCCGC GAGAGCCACA AGAACTTCCT GCGCCAGCTG GTGAGCCTGC TCGGCACCAC CATCAACACC ATCCTGGCCA ACAACCGCAC CGAGGACCTG CTCGAACAGT CCCAGCAGCT CACCAGCCAG CTCAGGGAAC GCTCCAACGA GCTCCAGCGC CAACAGGAGG AGCTGCGCGG CAAGAACGCC GAGCTCCGCC AGAAGGCCAC CCAGCTCGCC AACCAGAACC GGGCGATCGA GCTCCAGAAC CAGCAGATCC AGCGCTCCCG CAACGCCCTG GAGGAGCGCG CCCACCAGCT CCAGGTCTCC TCCAAGTACA AGTCCGAGTT CCTGGCGAAC ATGTCCCACG AGCTGCGCAC GCCGCTCAAC AGCCTGCTGA TCCTGGCCCG CCTGCTCGCC GACAACGCCG AGCAGAACCT GTCCGCCAAG CAGGTCGAGT TCGCCCAGAC CATCCACAAG GCGGGCAGCG ACCTGCTGCT GCTCATCGAC GAGATCCTCG ACCTGTCCAA GGTGGAGGCC GGGCGCGCCG AGGTGCAGCC CAGCGAGGTC TCCATCGCCC AGCTGGTCGA CTACGTCGAG GCCACCTTCC GCCCGGTCAC CGGGGACCAG GGGCTCGCCT TCGCCGTGGA CGTGTCGCCG GACATCCCCG GCACGCTGTG GACCGACGAG CAGCGGCTCC AGCAGATACT GCGCAACCTG CTCTCCAACG CCGTCAAGTT CACCCCGCAG GGGGAGGTGC GGCTGCTCAT CGAGCCCGCG TGGGCGCTGG AGGACGCCGA CCTGGAGATG TTCGTCGAGA ACGAGGAGGT CATCGCGTTC ACCGTGGCCG ACACCGGCAT CGGGATCGCC GAGGACAAGC TCCAGGTCAT CTTCGAGGCC TTCCACCAGG GCGACGGCGG CACCTCGCGG CGCTTCGGCG GAACCGGCCT CGGCCTGTCC ATCAGCCGCA ACTTCGCCCG GCTGCTGGGC GGCGAGATCC GCGTGCAGAG CGTGCCCAAC CAGGGCTCCA CCTTCACGCT GCTGCTCCCG GTGCGCCTGC CCGACGACGC GGGGGAGCGG GCCGAGGAGG CCGGGTCCCC GATCCGCGGC CTGAGCGGCA TGGACTCGGC GCCGATGCTC GATTCGGGCC GGGGCGCGGA CACCGGCGGC TTCGGCTCCT CCCCGCTGGA CGACGGCTTC TCCATGCCGG ACGAGGACTT CGACGCGGCA CTGGCAGCCC TCACCGACAT CAGCGACGAG CCGCTGCCCA CGGTCCCGGT GCTCAAGGCG CCCGAGACCC ACGCGGCGGC CGCCGAGGGC GAGGCGCCCG AGCAGCCGGC CGAGTCCGAG CCGCGCACCG ACCCCGAGCG GCGGGCCGTG CTGAGCGGCC AGCGGGTCCT CATCGTCGAC GACGACGTCC GCAACGTCTT CGCGCTCACC AGCGCTCTGG AGGCGCAGGG CCTGGAGGTC CTCTACGCTG ACAACGGACA CGCCGGAATC GCCAAGCTGG AGGCCAACGA GGACATCGCG CTCGTCCTGA TGGACGTGAT GATGCCCGAG CTGGACGGCA ACCAGACCAC CCAGCGGATC CGGGAGATGC CGCAGTTCGC CGGTCTGCCG ATCATCTCGC TCACCGCCAA GGCGATGCAG GGGGACCGCG AGCGCAGCCT CGCGGCGGGA GCCACGGACT ACGTGACCAA GCCGGTCGAC CTGGACCACC TGCTGGACGT CATGCGGCGC TGGCTCACCG CGGGACGCGA GGAGACCATC GCCGGAGCGG CGGCCGACGG CGGGGGCGGC GCGGTCCCGG ACGGGCACAG CCGCGCGACG TCCGGGCCGG AGGAGGCCGG CGCCGGGCAT GACAATGATC CGAACGGGGA GACTGGCACC AGCACCCCGA GCGAAGACCT GGAGTAA
|
Protein sequence | MPEAVKELAA DDQLDEILRA LYRMRDGDFS VRLRKRGGGT MREIASVFNE VVDHSEQLSD ELQRVGDVVR NEGRLNERVN VNPARGAWGE SAQALNLLLD EVAEPVTDVA GVLDSVAEGQ LSRRASTEGR RGELKGDLLR LATTVNRMAD QMSGFAGEVT RVAREVGTEG KLGGTAQVEG VSGAWREVTE SVNQMSSRLT DQMRDISEVT TAVARGDLSR KITVDVQGEM LDLKDTVNTM VDQLSTFADE VTRVAREVGT EGKLGGRANV RGVRGIWKDL TENVNSMADN LTNQVRDISQ VTTSVARGDL TQKVQVDVQG EMLALKNTVN TMVDQLDSFA DEVTRVAREV GTEGKLGGRA NVKGVSGIWK DLTDNVNSMA NSLTYQVRNI SQVTTAVATG DLTKKITVDA QGEMLDLKDT INKMVDQLDS FAGEVTRVAR EVGTEGKLAG QAHVRDVSGV WKDLTDNVNS MANNLTYQVR QISMVTRAVA AGDLTKKVTV NAKGEILELK DTINVMVDQL SAFADEVTRV AREVGTEGKL GGRADVKGVS GIWNDLTENV NSMSHNLTTQ VRNISEVTTA VAAGDLNKKI DVNAQGEILE LKTTVNTMVD QLSAFATEVT RVAHEVGSQG QLGGQAKVEG VTGTWKQLTD SVNGLAGNLT TQVRAIAEVA NAVAKGDLTR NIQVDARGEM EQLKHNINLM VSNLRETTAT QRDADWLKSN VARISGHIQG HRDLKELARL IMTEVTPLIG AQHGACYLPE DQDDQENFLF YAGFGFDPDE DRRRVRAGIG LVGEALAQQV EQQISNIPPD YVKVRSGLGE ASPRNLYILP IVSEGRSLGA IEFASYDDFR ESHKNFLRQL VSLLGTTINT ILANNRTEDL LEQSQQLTSQ LRERSNELQR QQEELRGKNA ELRQKATQLA NQNRAIELQN QQIQRSRNAL EERAHQLQVS SKYKSEFLAN MSHELRTPLN SLLILARLLA DNAEQNLSAK QVEFAQTIHK AGSDLLLLID EILDLSKVEA GRAEVQPSEV SIAQLVDYVE ATFRPVTGDQ GLAFAVDVSP DIPGTLWTDE QRLQQILRNL LSNAVKFTPQ GEVRLLIEPA WALEDADLEM FVENEEVIAF TVADTGIGIA EDKLQVIFEA FHQGDGGTSR RFGGTGLGLS ISRNFARLLG GEIRVQSVPN QGSTFTLLLP VRLPDDAGER AEEAGSPIRG LSGMDSAPML DSGRGADTGG FGSSPLDDGF SMPDEDFDAA LAALTDISDE PLPTVPVLKA PETHAAAAEG EAPEQPAESE PRTDPERRAV LSGQRVLIVD DDVRNVFALT SALEAQGLEV LYADNGHAGI AKLEANEDIA LVLMDVMMPE LDGNQTTQRI REMPQFAGLP IISLTAKAMQ GDRERSLAAG ATDYVTKPVD LDHLLDVMRR WLTAGREETI AGAAADGGGG AVPDGHSRAT SGPEEAGAGH DNDPNGETGT STPSEDLE
|
| |