Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4166 |
Symbol | |
ID | 9248040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4974455 |
End bp | 4976752 |
Gene Length | 2298 bp |
Protein Length | 765 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003682067 |
Protein GI | 297563093 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.299427 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACGGT GGGGTGAGCC CTTCCCCAAG GGTGACGCGC TCGCCGCGGC CGCCGTGGCG GACCAGGCGC AGATCGCCAT CGTCGTCATC GACCGGGCCA GCCTCCTGCG CTACTGGAAC CCCTTCGCGC GCGAGCTGTT CGGGTTCGTG GGCGGTCGCG ACTACGTCGG CCTCTCCCTG CTCGACATCG GTATCCACGA GTCCGACCGG GAGCACGCCT CCCAGCTGGC CCGGCGCGTC CTGCGCGGCG AGCCGTGGGA GGGCACCTTC GCGGTGCTCC GGGGCGACTC GACCTGGATC CACGTGCGCG CCCAGGCCGT GCCCATGCAC AACGAGGCCG GCGAGATCGA CGGCATCACC CTGATCGCCA GGGAGGCCCT GCGCAGCGGA CGGGTGGAGG AGCAGTACGG CCTGCTGGAG CGCATCGGCA GCCGCCTGAC CAGCTCCCTG GAGTTCGACT CCACGGTCAG GGGGGTCGCG GGCATCCTCG TGCCGCAGTT CGCCGACCAC TGCTTCATCG ACCTCTACGA CCGCGACCGC CTGGTCAGAC AGGTGTCGGT GCACGCCGAG GGGTGGACGC CCCCGCCGCG CACCTGGTTC GAGGTCGGCG ACGAGGTCCG CTACCCCGAA CGCCACTTCG TCACCCAGGC CCTGCGCCGC CTGGAGACGG TGGTCAGCAG CGAGTACCTG TTCGAGAACT CGCCCAGCAC GCGCTCCGAC CAGGTCTCGC GCCAGGTCGG CGTCACCTCC GCCATCGCCG CCCCGCTGCG GGCCCGCGGC GAGGTGCTGG GCGTGCTCAC CCTGGCGCTG TCCGGGCTCT CCCCGCGGCA GAAGAGCACC TACGGGGGCT TCGACCGCGA CCTCGTCGGC GCCATCGCCT CCCGCGTGGC CCTGGCCATC GACAACGCCC GCCTGTTCGA GCAGGAGCGC AGCACCGCGC TGGCCTTCCA GCGCAGCCTG CTGCCCAGCA GCCTGCCCCG GCTGGACGGG CTGACCGCGG CCCACCGCTA CCTGCCCGCG GGACCGCTGC GCTCCGACGG GTACGGGGTC CAGACCCAGA TCGGCGGGGA CTTCTACGAC GCCATCCCGC TCTCCGCCGG ACGCGTGGGC CTGGTCATCG GCGACGTCGA GGGCCGCGGC CCGCACGCCG CCGCCGTCAT GGGCCAGCTG CGCGCCGCGC TGCGCGCCTT CGCCCAGGCC GACCGCGAGC CCGCCGACAT CCTGCGCGAA CTCGACGAGT GGGTCCGCCA GCTGGGCCAG GAGGACGAGG AGGGCGGCAC CTGGATCCCC AGCGTCAGCT GCCTGTACAT GGTCTACGAC GCCTGGTCGC GCGAGCTGTC CTACGCCAAC GCCGGGCACG CGCCCCCGCT GCTGGTCACC TCCGAGTCGG TGGAGAAGAT CGACCTGGAG GTCACCGACC GCATGCTGGG CGTGCGGGCC AAGGGCGGAT CGGGCGAGGA CGTGGTCTAC CACCAGGCCA ACCTGCGCCT GCCCATCGGC GCCACCCTGG TCCTGTACAC CGACGGGCTC GTGGACCGGC GCCCGGCCGG GGGCAGGGCG GACCCCGAGA GCGCCTTCGA GCTGCTGGCG GAGCGGGTGG CCGAGGTCGC CGACAAGGAC GTGGACCAGA TCGCCGAGGC GGCCGTGCAC AGCGTCCCGG GCGAACACGA CGACGACACC GCTCTGCTGG TCGTGCGCAC CCACTCCGAG GAGCTGGCGC TGCGCGAGGG CTGGTTCCCC TCGGAGGCCT CCACCGTGGG CGAGGCCCGC CACATGGCGG CCCACACCTT CAGCGAGTGG GGGGTGGACC GCGACCAGGC CGAGCTGGCC TGCCTGCTGG TCTCGGAGAT CGTCACCAAC GTCGTCATCC ACGCCACCCC GCACCCGGTT CACCGCGAGT TCACCGGCGG AGGGGTGCTG GACTCCGAGA CCCCGCTCGA CGCGGTGGCC GACGAGTTCG ACGAGGACTG GACCGACCTG CTGGAGGCGG TGGCCGAGGA GGCCGACGAA CCCGCCGAGG AGGCCGGGGA GAAGGAGTTC CTGCTGCGGC TGCGGCGGGG CGCGAACACG GTGTGGGTGG AGGTCTTCGA CAACGACCTG CGCCTGCCCC GCATCCGCAG CGCCGCCGCC GACGACGAGG GCGGCCGCGG CCTGTACCTG GTCGAGCAGC TGGCCAGCCG GTGGGGGGCG CGGCCGACCC CGGACGGCAA GGCGGTCTGG TTCGAGATGC CCATGCACTC CGAGGAGACC GGGGAGCAGC ACAGGGCCGA CGACGGGGAG AAGGCCGGGC GGGACTAG
|
Protein sequence | MTRWGEPFPK GDALAAAAVA DQAQIAIVVI DRASLLRYWN PFARELFGFV GGRDYVGLSL LDIGIHESDR EHASQLARRV LRGEPWEGTF AVLRGDSTWI HVRAQAVPMH NEAGEIDGIT LIAREALRSG RVEEQYGLLE RIGSRLTSSL EFDSTVRGVA GILVPQFADH CFIDLYDRDR LVRQVSVHAE GWTPPPRTWF EVGDEVRYPE RHFVTQALRR LETVVSSEYL FENSPSTRSD QVSRQVGVTS AIAAPLRARG EVLGVLTLAL SGLSPRQKST YGGFDRDLVG AIASRVALAI DNARLFEQER STALAFQRSL LPSSLPRLDG LTAAHRYLPA GPLRSDGYGV QTQIGGDFYD AIPLSAGRVG LVIGDVEGRG PHAAAVMGQL RAALRAFAQA DREPADILRE LDEWVRQLGQ EDEEGGTWIP SVSCLYMVYD AWSRELSYAN AGHAPPLLVT SESVEKIDLE VTDRMLGVRA KGGSGEDVVY HQANLRLPIG ATLVLYTDGL VDRRPAGGRA DPESAFELLA ERVAEVADKD VDQIAEAAVH SVPGEHDDDT ALLVVRTHSE ELALREGWFP SEASTVGEAR HMAAHTFSEW GVDRDQAELA CLLVSEIVTN VVIHATPHPV HREFTGGGVL DSETPLDAVA DEFDEDWTDL LEAVAEEADE PAEEAGEKEF LLRLRRGANT VWVEVFDNDL RLPRIRSAAA DDEGGRGLYL VEQLASRWGA RPTPDGKAVW FEMPMHSEET GEQHRADDGE KAGRD
|
| |