Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1998 |
Symbol | |
ID | 9245848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2419122 |
End bp | 2420126 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | Helix-turn-helix type 11 domain protein |
Protein accession | YP_003679930 |
Protein GI | 297560956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.556593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGATGA AGACTTCCGC CCGCCTGCTG GCGCTGCTCT CCATACTCCA GACCCGCCGG GACTGGTCGG GGCAGGACCT GGCCGACCGG CTGGACGTCA GTGCGCGCAC GGTCCGACGT GACGTGGACC GTCTGCGCGA ACTCGGCTAC CCGATCACGA CCTTCAAGGG GCCCGACGGC GGGTACCGGC TCGACGCGGG GTCGCGGATG CCCCCGCTGC TGTTCGACGA CGAGCAGGCC GTCGCGCTGG CCGTCGCGCT CCAGGCGGCC ACGGCCACCG GCGCCGGGAT CGGGGAGGCC GCGGCGCGCG CCCTCAACAC CGTTCGACAG GTCATGCCCG CCCGCCTGCG CCAACGGATC AACGCGGTCC GGGTCGCCGT GGTCGCCCCG CCCTCCGCCC CGCGGGCGCG GGCCGACGGC GGGCTGCTCA CGGCGATCAG CGCCGCCGTG CACGCCCGCG AGGAACTGCG CCTGGACTAC GCCCCCGCCT TCCGGTCCGC CTCCCGGGAC GAGGCCGCCG TGGGCCCGCG CCGGGTCCAG CCCCACCACC TGGTCACCTG GGCGGGGCAC TGGTACCTCC TGGCCTGGGA CGTCGAGCGC GAGGACTGGC GGACCCTGCG GGTGGACCGC ATCGCGCTGC GCAGTCCCAA CGGCCCCCGG TTCACCCCTC GGGAGGTGCC CGGGGGTGAT GTGGCGGCCT TCCTCATCGG CAGGTTCCGG GGCTCGGACG GAACGGTCGA CTGGCCCTGC CGCGGTGAGG TGATCCTCGA CCTGCCCGCC CGCGCCGTCG CTCCCTTCGC GCACGACGGG CTGGTCGAGG AGGTGGGTCC CGAGCGCTGC CGCCTGGTCC TCGGCTCCTG GTCGTGGGTC GGCCTGGCCT CGGCCGTCGG CCGCTTCGAC GCCGACTTCG AGGTCGTCGG ACCGCCCGAG CTGGAGGACG CCTTCGCGCG GCTGGCCCGC CGTTACGCGG GCGGCGGCCG TCAGGCCCGC GCCCCCGAGC GCTGA
|
Protein sequence | MVMKTSARLL ALLSILQTRR DWSGQDLADR LDVSARTVRR DVDRLRELGY PITTFKGPDG GYRLDAGSRM PPLLFDDEQA VALAVALQAA TATGAGIGEA AARALNTVRQ VMPARLRQRI NAVRVAVVAP PSAPRARADG GLLTAISAAV HAREELRLDY APAFRSASRD EAAVGPRRVQ PHHLVTWAGH WYLLAWDVER EDWRTLRVDR IALRSPNGPR FTPREVPGGD VAAFLIGRFR GSDGTVDWPC RGEVILDLPA RAVAPFAHDG LVEEVGPERC RLVLGSWSWV GLASAVGRFD ADFEVVGPPE LEDAFARLAR RYAGGGRQAR APER
|
| |