Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2200 |
Symbol | |
ID | 9246050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2630410 |
End bp | 2632719 |
Gene Length | 2310 bp |
Protein Length | 769 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | putative sensor with HAMP domain |
Protein accession | YP_003680128 |
Protein GI | 297561154 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.930898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000377518 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCACA ACAAGCGCCG TTCGCTTCGC GCCCGCATGA TCACGCTGGT CCTGTTCCCC AGCACCGTGC TGCTGGCCCT GTGGGCCGCC TTCACCGCCC TGCTCGTCGG CGACATCGGT GAACTGCGCA CCACCGCCAC CTTCACCGAA CAGGTCGGCG TCCCCGTCGA GGAGACCATC AGCGCGCTCC AGCGCGAACG CCGCGCCACC ATGGACGCGG CCTGGGGCAC CGAGCGCACC GCCCTGTACC TGACCTACTC GCGCCAGCGC ACCGACGAGG CCGTGGACGC CCTCAGCCGG AGCCTGGACG CCTTCGACGT GCAGGACCTG CCCGAACAGG CCCTGGCCTT CCGCGCCGCC GTCAACCGCG TCGACACCCT GCGCGCCCGG GTGGACGCCT CCTCCCCCAC CGAGATCGAC CTGGAGGAGA CCTCCACCGT CTACGACGAG GTCATCGAAC AGGGCCTGCG CGTGTGGGAC GGCCAGGTCG AGCGCGCCGA CCCCGCCCAG GCCCCGCACC TGCGCTCCCT GACCTCGCTG ATGCGCACCC GCGAACTCCT CAACCAGCAG GACACCGTCC TGGCCCACGC GGTGGCCACC AACTCCTTCC CCGTCCAGGC CCACGCACAG TTCGCCGCCG CGGCCGGAGC CCAGCACTAC ACCTGGGACC GGGTCGTGGT CGAGATGAGC GAGGAGGACG GCCAGGACTA CATCGAGCTG GAGTCCTACA GCGAGGTGCA GCGCATCTAC CAGCTCCAGG AGAGCATCAT CTCCATGCCC GTGCGGGGCG GGGCCACCGT GCCGGTCAAC GCCACCGCCT GGCAGGGCGC CGCCGAGGCC GTGGACGCGC GCATGCGCGA CGTCGAACAG GGCCAGATCG ACCGGGTCAT CGCCTTCAGC CGCTCGCAGT CGGCCGAGCT GCGCACCAGC GTCCTGCTGG TGAGCGTGCC CGCCCTGGTG GCCGCCCTGG TCTCCTCCGC GGTCGCGGTG GGCGCCACCC AGCGCCTGGG CAGGCGCCTG CAGGACCTGC GCACGGCCAC CCTGGAGCAC GCCCGGGTGC GCCTGCCGGA GGTGACCGCA CGGCTGCGCG CGGGCGGCAG CGTGGACGTG GACGCCGAGG TGCCGCGGCT GCGGGTGGAG AGCACCGACG AGGTCGGGCA GGTGGCCGAG GCCTTCAACG ACGCCCAGCG CGCGGCCGTG GCGGCGGCGG TGGAGGAGGC GCAGGTGCGC GCGGGCGTGC GCAACATGTT CCGCAACATC GCCCGGCGCA CGCAGGCGCT GGTGCACCGC CAGCTGGGGC TGTTGGACGC GCTGGAGCGC GCCGAGACCG ACCCCAAGGT GCTGGAGTCG CTCTTTCGCA TCGACCACTT CAGCACGCAG ATGCGGCGCA ACGCCGAGAA CCTGATGCTG CTCAGCGGGG ACGCCCCCGT GCGGCGGGGA CTGGCGCCGG TGCGCCTGCA CGAGGCGGTG CGGGCCGCGG CCAGCGAGAT CGAGGACTAC GTGCGGGTGC GGGTGCTGGC GGTGCCGGCG GTGGCGCTGC GCGGCGAGGT GGGCGCGGAC ACCGTGCGGC TGCTGGCGGA GCTGCTGGAG AACGCCACGT CCTTCTCGCC CCCGGGCACG GAGGTGACGG TGCGCGGCCG GGCCGGGGAG GCGGGCGGGT GCGTGCTGGA GGTGCGCGAC CGGGGGCTGG GGATGACGCG GGAGCAGTTG GAGGCGTCCA ACGCGCTGCT GCGCGACCCG CCGCGCTTTG ATCTGGCCCG GATGCGCGAG GACTCCCAGC TGGGGTTGTT CGTGGTGGCC ACCATCGCCG CGCGCCACGG CCTGGAGGTG ACGCTGAGCG CCGGGCCCGA GCGCGGCGCG TGCGCGGTCG TGGCCGTTCC CGCCGACGCG CTGGCGGACC AGGGCGCGGG CGCCGCGGAG CCGGGCGGGG ACCGGGAGGC CGCGACCGCG CGGTCGCAGC GGCTGGTCCT GGCGGGCGGG GTCGAGGAGC GGGCGGCGCG GGAGGACGGG GCCCGGGCCG CGCCCGAGAC CGCGGACGCG GCCGGGGACG AGGACACGGG CGATACCTAC AAGGGTCTGC CCCGGCGGCG GCGCAAGGGC GTGCAGCCCT CCCCCGCGCC CGCCCCGGCC GCGCCCGGGC ACCCCGGGGA GTCCGGGGGC CGGGAGGCGG CGGCCGAGCG GTCGCTGACC GAGATCCGGT CGATGATGAG TGCTTTCCAG TCCGGGAGCC TGCGCGGGCG CGCACAGCCC CTGGACGGGG ATGGAGACGG CGTGCGCGGT GCGCGCCCCG GTGAGAGCTC GGAAGGGTGA
|
Protein sequence | MPHNKRRSLR ARMITLVLFP STVLLALWAA FTALLVGDIG ELRTTATFTE QVGVPVEETI SALQRERRAT MDAAWGTERT ALYLTYSRQR TDEAVDALSR SLDAFDVQDL PEQALAFRAA VNRVDTLRAR VDASSPTEID LEETSTVYDE VIEQGLRVWD GQVERADPAQ APHLRSLTSL MRTRELLNQQ DTVLAHAVAT NSFPVQAHAQ FAAAAGAQHY TWDRVVVEMS EEDGQDYIEL ESYSEVQRIY QLQESIISMP VRGGATVPVN ATAWQGAAEA VDARMRDVEQ GQIDRVIAFS RSQSAELRTS VLLVSVPALV AALVSSAVAV GATQRLGRRL QDLRTATLEH ARVRLPEVTA RLRAGGSVDV DAEVPRLRVE STDEVGQVAE AFNDAQRAAV AAAVEEAQVR AGVRNMFRNI ARRTQALVHR QLGLLDALER AETDPKVLES LFRIDHFSTQ MRRNAENLML LSGDAPVRRG LAPVRLHEAV RAAASEIEDY VRVRVLAVPA VALRGEVGAD TVRLLAELLE NATSFSPPGT EVTVRGRAGE AGGCVLEVRD RGLGMTREQL EASNALLRDP PRFDLARMRE DSQLGLFVVA TIAARHGLEV TLSAGPERGA CAVVAVPADA LADQGAGAAE PGGDREAATA RSQRLVLAGG VEERAAREDG ARAAPETADA AGDEDTGDTY KGLPRRRRKG VQPSPAPAPA APGHPGESGG REAAAERSLT EIRSMMSAFQ SGSLRGRAQP LDGDGDGVRG ARPGESSEG
|
| |