Gene Ndas_5202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5202 
Symbol 
ID9249095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp349257 
End bp353693 
Gene Length4437 bp 
Protein Length1478 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_003683088 
Protein GI297564115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0874625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAGG CGGTCAAGGA ACTGGCGGCC GACGACCAGC TCGACGAGAT TCTGCGGGCC 
CTGTACCGCA TGCGCGACGG CGACTTCAGC GTCCGTCTGC GCAAGCGCGG CGGCGGCACC
ATGCGGGAGA TCGCGTCCGT CTTCAACGAG GTGGTCGACC ACAGCGAACA GCTCAGCGAC
GAACTCCAGC GCGTCGGCGA CGTCGTCCGC AACGAGGGCA GGCTCAACGA GCGCGTCAAC
GTCAACCCCG CCCGGGGCGC CTGGGGCGAG AGCGCCCAGG CGCTCAACCT CCTGCTCGAC
GAGGTCGCCG AACCCGTGAC CGACGTCGCC GGGGTCCTGG ACTCGGTCGC GGAGGGCCAG
CTCAGCCGCC GCGCCTCCAC CGAGGGCCGC CGCGGCGAGC TCAAGGGCGA CCTGCTGCGC
CTGGCCACCA CGGTCAACCG CATGGCCGAC CAGATGAGCG GCTTCGCCGG GGAGGTCACC
CGTGTGGCCC GCGAGGTGGG CACCGAGGGC AAGCTCGGCG GCACCGCGCA GGTCGAGGGG
GTCTCCGGAG CCTGGCGCGA GGTGACCGAG TCGGTCAACC AGATGTCCTC GCGCCTGACC
GACCAGATGC GCGACATCTC CGAGGTGACC ACCGCCGTCG CCCGCGGCGA CCTCAGCCGC
AAGATCACGG TGGACGTACA GGGCGAGATG CTCGACCTCA AGGACACCGT CAACACCATG
GTGGACCAGC TGTCCACCTT CGCCGACGAG GTCACCCGTG TGGCCCGCGA GGTGGGCACC
GAGGGCAAGC TCGGCGGCCG CGCCAACGTG CGCGGCGTGC GGGGCATCTG GAAGGACCTC
ACCGAGAACG TCAACTCGAT GGCGGACAAC CTCACCAACC AGGTGCGCGA CATCTCCCAG
GTGACGACCT CCGTGGCCCG CGGCGACCTC ACCCAGAAGG TCCAGGTCGA CGTGCAGGGC
GAGATGCTCG CCCTGAAGAA CACCGTGAAC ACCATGGTCG ACCAGCTCGA CTCCTTCGCC
GACGAGGTCA CGCGCGTGGC GCGCGAGGTC GGTACGGAGG GCAAGCTGGG CGGCCGGGCC
AACGTCAAGG GCGTGTCGGG CATCTGGAAG GACCTGACCG ACAACGTCAA CTCCATGGCC
AACAGCCTCA CCTACCAGGT CCGCAACATC TCCCAGGTGA CGACGGCGGT CGCCACCGGT
GACCTCACCA AGAAGATCAC CGTGGACGCC CAGGGGGAGA TGCTCGACCT CAAGGACACC
ATCAACAAGA TGGTCGACCA GCTGGACTCC TTCGCCGGCG AGGTGACCAG GGTCGCCCGT
GAGGTGGGCA CGGAGGGCAA GCTGGCCGGA CAGGCCCACG TCCGCGACGT GTCCGGGGTC
TGGAAGGACC TGACCGACAA CGTCAACTCC ATGGCCAACA ACCTGACCTA CCAGGTGCGC
CAGATCTCCA TGGTCACGCG CGCGGTGGCC GCGGGCGACC TGACCAAGAA GGTCACGGTC
AACGCCAAGG GCGAGATCCT GGAGCTGAAG GACACCATCA ACGTCATGGT GGACCAGCTC
TCCGCGTTCG CCGACGAGGT CACCCGGGTG GCCCGCGAGG TCGGCACCGA GGGCAAACTG
GGCGGCCGAG CCGACGTCAA GGGCGTCTCC GGCATCTGGA ACGACCTCAC CGAGAACGTC
AACTCGATGT CGCACAACCT CACCACGCAG GTGCGCAACA TCTCCGAGGT GACCACGGCC
GTGGCGGCGG GCGACCTCAA CAAGAAGATC GACGTCAACG CCCAGGGGGA GATCCTGGAG
CTCAAGACGA CGGTCAACAC CATGGTCGAC CAGCTGTCGG CGTTCGCCAC CGAGGTCACC
CGCGTGGCCC ACGAGGTGGG CAGCCAGGGC CAGCTGGGCG GCCAGGCCAA GGTCGAGGGC
GTGACCGGCA CCTGGAAGCA GCTCACCGAC AGCGTGAACG GCCTGGCGGG CAACCTGACC
ACGCAGGTGC GGGCGATCGC CGAGGTCGCC AACGCCGTCG CCAAGGGCGA CCTGACCCGC
AACATCCAGG TCGACGCCCG CGGTGAGATG GAGCAGCTCA AGCACAACAT CAACCTGATG
GTGTCCAACC TGCGCGAGAC CACCGCCACC CAGCGCGACG CCGACTGGCT CAAGTCCAAC
GTGGCCCGTA TCTCCGGCCA CATCCAGGGC CACCGCGACC TCAAGGAGCT GGCCCGCCTC
ATCATGACCG AGGTGACGCC GCTCATCGGC GCCCAGCACG GCGCCTGCTA CCTGCCCGAG
GACCAGGACG ACCAGGAGAA CTTCCTGTTC TACGCGGGCT TCGGCTTCGA CCCCGACGAG
GACCGCCGCC GCGTCCGCGC GGGCATCGGC CTGGTCGGCG AGGCCCTCGC CCAGCAGGTG
GAACAGCAGA TCTCCAACAT CCCGCCGGAC TACGTCAAGG TCCGGTCGGG CCTGGGCGAG
GCCTCCCCGC GCAACCTGTA CATCCTGCCG ATCGTCTCCG AGGGGCGCTC GCTGGGCGCC
ATCGAGTTCG CCTCCTACGA CGACTTCCGC GAGAGCCACA AGAACTTCCT GCGCCAGCTG
GTGAGCCTGC TCGGCACCAC CATCAACACC ATCCTGGCCA ACAACCGCAC CGAGGACCTG
CTCGAACAGT CCCAGCAGCT CACCAGCCAG CTCAGGGAAC GCTCCAACGA GCTCCAGCGC
CAACAGGAGG AGCTGCGCGG CAAGAACGCC GAGCTCCGCC AGAAGGCCAC CCAGCTCGCC
AACCAGAACC GGGCGATCGA GCTCCAGAAC CAGCAGATCC AGCGCTCCCG CAACGCCCTG
GAGGAGCGCG CCCACCAGCT CCAGGTCTCC TCCAAGTACA AGTCCGAGTT CCTGGCGAAC
ATGTCCCACG AGCTGCGCAC GCCGCTCAAC AGCCTGCTGA TCCTGGCCCG CCTGCTCGCC
GACAACGCCG AGCAGAACCT GTCCGCCAAG CAGGTCGAGT TCGCCCAGAC CATCCACAAG
GCGGGCAGCG ACCTGCTGCT GCTCATCGAC GAGATCCTCG ACCTGTCCAA GGTGGAGGCC
GGGCGCGCCG AGGTGCAGCC CAGCGAGGTC TCCATCGCCC AGCTGGTCGA CTACGTCGAG
GCCACCTTCC GCCCGGTCAC CGGGGACCAG GGGCTCGCCT TCGCCGTGGA CGTGTCGCCG
GACATCCCCG GCACGCTGTG GACCGACGAG CAGCGGCTCC AGCAGATACT GCGCAACCTG
CTCTCCAACG CCGTCAAGTT CACCCCGCAG GGGGAGGTGC GGCTGCTCAT CGAGCCCGCG
TGGGCGCTGG AGGACGCCGA CCTGGAGATG TTCGTCGAGA ACGAGGAGGT CATCGCGTTC
ACCGTGGCCG ACACCGGCAT CGGGATCGCC GAGGACAAGC TCCAGGTCAT CTTCGAGGCC
TTCCACCAGG GCGACGGCGG CACCTCGCGG CGCTTCGGCG GAACCGGCCT CGGCCTGTCC
ATCAGCCGCA ACTTCGCCCG GCTGCTGGGC GGCGAGATCC GCGTGCAGAG CGTGCCCAAC
CAGGGCTCCA CCTTCACGCT GCTGCTCCCG GTGCGCCTGC CCGACGACGC GGGGGAGCGG
GCCGAGGAGG CCGGGTCCCC GATCCGCGGC CTGAGCGGCA TGGACTCGGC GCCGATGCTC
GATTCGGGCC GGGGCGCGGA CACCGGCGGC TTCGGCTCCT CCCCGCTGGA CGACGGCTTC
TCCATGCCGG ACGAGGACTT CGACGCGGCA CTGGCAGCCC TCACCGACAT CAGCGACGAG
CCGCTGCCCA CGGTCCCGGT GCTCAAGGCG CCCGAGACCC ACGCGGCGGC CGCCGAGGGC
GAGGCGCCCG AGCAGCCGGC CGAGTCCGAG CCGCGCACCG ACCCCGAGCG GCGGGCCGTG
CTGAGCGGCC AGCGGGTCCT CATCGTCGAC GACGACGTCC GCAACGTCTT CGCGCTCACC
AGCGCTCTGG AGGCGCAGGG CCTGGAGGTC CTCTACGCTG ACAACGGACA CGCCGGAATC
GCCAAGCTGG AGGCCAACGA GGACATCGCG CTCGTCCTGA TGGACGTGAT GATGCCCGAG
CTGGACGGCA ACCAGACCAC CCAGCGGATC CGGGAGATGC CGCAGTTCGC CGGTCTGCCG
ATCATCTCGC TCACCGCCAA GGCGATGCAG GGGGACCGCG AGCGCAGCCT CGCGGCGGGA
GCCACGGACT ACGTGACCAA GCCGGTCGAC CTGGACCACC TGCTGGACGT CATGCGGCGC
TGGCTCACCG CGGGACGCGA GGAGACCATC GCCGGAGCGG CGGCCGACGG CGGGGGCGGC
GCGGTCCCGG ACGGGCACAG CCGCGCGACG TCCGGGCCGG AGGAGGCCGG CGCCGGGCAT
GACAATGATC CGAACGGGGA GACTGGCACC AGCACCCCGA GCGAAGACCT GGAGTAA
 
Protein sequence
MPEAVKELAA DDQLDEILRA LYRMRDGDFS VRLRKRGGGT MREIASVFNE VVDHSEQLSD 
ELQRVGDVVR NEGRLNERVN VNPARGAWGE SAQALNLLLD EVAEPVTDVA GVLDSVAEGQ
LSRRASTEGR RGELKGDLLR LATTVNRMAD QMSGFAGEVT RVAREVGTEG KLGGTAQVEG
VSGAWREVTE SVNQMSSRLT DQMRDISEVT TAVARGDLSR KITVDVQGEM LDLKDTVNTM
VDQLSTFADE VTRVAREVGT EGKLGGRANV RGVRGIWKDL TENVNSMADN LTNQVRDISQ
VTTSVARGDL TQKVQVDVQG EMLALKNTVN TMVDQLDSFA DEVTRVAREV GTEGKLGGRA
NVKGVSGIWK DLTDNVNSMA NSLTYQVRNI SQVTTAVATG DLTKKITVDA QGEMLDLKDT
INKMVDQLDS FAGEVTRVAR EVGTEGKLAG QAHVRDVSGV WKDLTDNVNS MANNLTYQVR
QISMVTRAVA AGDLTKKVTV NAKGEILELK DTINVMVDQL SAFADEVTRV AREVGTEGKL
GGRADVKGVS GIWNDLTENV NSMSHNLTTQ VRNISEVTTA VAAGDLNKKI DVNAQGEILE
LKTTVNTMVD QLSAFATEVT RVAHEVGSQG QLGGQAKVEG VTGTWKQLTD SVNGLAGNLT
TQVRAIAEVA NAVAKGDLTR NIQVDARGEM EQLKHNINLM VSNLRETTAT QRDADWLKSN
VARISGHIQG HRDLKELARL IMTEVTPLIG AQHGACYLPE DQDDQENFLF YAGFGFDPDE
DRRRVRAGIG LVGEALAQQV EQQISNIPPD YVKVRSGLGE ASPRNLYILP IVSEGRSLGA
IEFASYDDFR ESHKNFLRQL VSLLGTTINT ILANNRTEDL LEQSQQLTSQ LRERSNELQR
QQEELRGKNA ELRQKATQLA NQNRAIELQN QQIQRSRNAL EERAHQLQVS SKYKSEFLAN
MSHELRTPLN SLLILARLLA DNAEQNLSAK QVEFAQTIHK AGSDLLLLID EILDLSKVEA
GRAEVQPSEV SIAQLVDYVE ATFRPVTGDQ GLAFAVDVSP DIPGTLWTDE QRLQQILRNL
LSNAVKFTPQ GEVRLLIEPA WALEDADLEM FVENEEVIAF TVADTGIGIA EDKLQVIFEA
FHQGDGGTSR RFGGTGLGLS ISRNFARLLG GEIRVQSVPN QGSTFTLLLP VRLPDDAGER
AEEAGSPIRG LSGMDSAPML DSGRGADTGG FGSSPLDDGF SMPDEDFDAA LAALTDISDE
PLPTVPVLKA PETHAAAAEG EAPEQPAESE PRTDPERRAV LSGQRVLIVD DDVRNVFALT
SALEAQGLEV LYADNGHAGI AKLEANEDIA LVLMDVMMPE LDGNQTTQRI REMPQFAGLP
IISLTAKAMQ GDRERSLAAG ATDYVTKPVD LDHLLDVMRR WLTAGREETI AGAAADGGGG
AVPDGHSRAT SGPEEAGAGH DNDPNGETGT STPSEDLE