Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3089 |
Symbol | |
ID | 9246945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3697262 |
End bp | 3699292 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003681004 |
Protein GI | 297562030 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGGAA AACGCGCTTC AGGCGATGGT TCCGAGAGCG CTCTCGCGGA GGCGTTCTCC CACGCCCCCG AGCCCATGGT GCTCACCCTG GCCGACGGCG CGATCGTGCA CGCCAACCTG GCCTTCACCG AACTCTTCGG CCGCGACGCC TCCGACCTCG ACGGAGGGGC CTGGTTCGAC CTCCTCGACG GTGACGACGC CCAGCGCGGC GCCCGCGCCC ACCACGAGGC CCTCGCCGGG AACCGGGGTC GCCGGTCCCG CCTGCGCCTG AAGGTCGCCG ACAAGGCCGC CCACCTGGTG GAGGCCGAGC TGCGCCCCCT GCCCGGGGAC GGCCCGGGGG ACGCCCGCGC GGTCGTCCTG CTGCACCTGC TCTCCACCGA GGAGGCCGAC CTGCGCGTGA TCGGCGAGCT GCGCACCGAC AACTCCGACT CCGTGCTGTG GAGCCTGGAC CCGGCCAGCG GACGCCTGCG CGAGATGTTC GGCCCCACCC CGCTCGGCGC GCTGCTGGCC GGGAACGACC GCGAACTCGA ACGCATCCTG GAGCGCGTCC ACCCCGACGA CATCGCCCGC GTGCGCGACG CCATCTCCGC CTCCTTCGCC GGGCGCGACT ACGAACAGCG CTTCCGCGTC TTCGACCGCC TCGGCGACGA GCGCTCCCTG CACGTGCGCG CCCGCTACGT GCCAGGCGAT CCCGACCGCC TCGTGGGCAT CGTCGACGAC GTCACCGAGC ACGTCCAGCT CGTACGCCGC CTCGCCGACC GCCGCCGCAC CGAGGCCGAG CACGGCCGCC TGGTCACCGA ACTGTCCTCC AAACTCGTCT CGGCCACCAC CGTCGACAAG GTCATGGACC TGCTCAGCGA GGAGTTCCTG CCCATCTTCG GCGGAGTCCA GGCCGTCGCC CTCTACGTCG AGAACGGACT GCTGCGCAGC TCGCCCAGCG CACACGCCAA GAGCGCCACC AGCAACATCG AGCGCATAGA CGGCCGCCGC GCCGACGACA CCGACTTCCC CATGGGCGCG GTCATCCAGG ACCGCCAGCC CCGCTTCTTC GAGAGCCGCG CCGAGATCAT CAGCCGCTTC CCCGCCGCCG TCGAACTCAT GCGCCAGGTG CGCGGCCAGG CCTGGGCCAC CGTGCCCATC TTCGGCGACG GCAAGGTCGC CCTGGGCGTG TGGCAGATGG TCTGGGACCG CCCCCACCAC GCCAGCCGCG ACGAGCGGGC CCTCATGCTC ACCTTCGCCG GACTCGCCGG GCAGGCCCTC CAACGGATCA AGGCGCAGCA GGCCGAGCTG GAGCTGGCCG ACGCGCTCCA GCGCCGCATG CTGCCCCGCC AGGTGGCCAG CTTCCCGGAC ATGGACATCG CCACCAAGTA CCTGCCCTCC CGCGCGGACT GGCGCATCTG CGGCGACTTC TACGACGTCA TCGAACTCCC CGAGAACAAG GTCGGCCTCC TGGTCGGCGA CGTCCAGGGG CACGGCGTCG AGGCGGCGGC CGCCATGGGC CAGATCCGCG TGGCCTTCCG CGCCTACGCC ACCAACCAGT CCGACCCCGG CGTCGTCCTG GGCGAGACCA ACCGCCTGCT CACCGAGACC GGCGAGATCG TCTTCGCCAC CTGCGGCTAC CTCGTCGTGG ACCGCGAGAG CGGCGTCATG CAGGCGGCCT GGGCCGGACA GCCGCCGGTC GTCCTGGCCA CCCGGAGCGG CTACGAGCTC TGGGAGCCCG AGACCGGCCC GCCGCTGGGC GTGCTCTCCG ACCCCGAGTA CGCCGTCACC ACCCGCATGC TCCCGCCCGG CACCGCCCTG CTGCTGTGCT CCGACGGACT CGTGGAGAGC TCCGAGGTGC CCATGGGCGA GGGGCTGGCC CGGGTCGGCG CGGCGCTCTC CGAGCACCAC GAGGACCCCG AGGCCGCCGC GCACGTCATC GCCGAGATGG CCCCGGCCGG GCGCGGCGAC GACATCGCCC TGCTCATCAC CCGCATGATC CCCTCCACCG AGCCGGCGCG GGACCGCGCC GCCGTGGCGA CCGCGACCTG A
|
Protein sequence | MNGKRASGDG SESALAEAFS HAPEPMVLTL ADGAIVHANL AFTELFGRDA SDLDGGAWFD LLDGDDAQRG ARAHHEALAG NRGRRSRLRL KVADKAAHLV EAELRPLPGD GPGDARAVVL LHLLSTEEAD LRVIGELRTD NSDSVLWSLD PASGRLREMF GPTPLGALLA GNDRELERIL ERVHPDDIAR VRDAISASFA GRDYEQRFRV FDRLGDERSL HVRARYVPGD PDRLVGIVDD VTEHVQLVRR LADRRRTEAE HGRLVTELSS KLVSATTVDK VMDLLSEEFL PIFGGVQAVA LYVENGLLRS SPSAHAKSAT SNIERIDGRR ADDTDFPMGA VIQDRQPRFF ESRAEIISRF PAAVELMRQV RGQAWATVPI FGDGKVALGV WQMVWDRPHH ASRDERALML TFAGLAGQAL QRIKAQQAEL ELADALQRRM LPRQVASFPD MDIATKYLPS RADWRICGDF YDVIELPENK VGLLVGDVQG HGVEAAAAMG QIRVAFRAYA TNQSDPGVVL GETNRLLTET GEIVFATCGY LVVDRESGVM QAAWAGQPPV VLATRSGYEL WEPETGPPLG VLSDPEYAVT TRMLPPGTAL LLCSDGLVES SEVPMGEGLA RVGAALSEHH EDPEAAAHVI AEMAPAGRGD DIALLITRMI PSTEPARDRA AVATAT
|
| |