Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1798 |
Symbol | |
ID | 9245648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2199160 |
End bp | 2200116 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, LysR family |
Protein accession | YP_003679732 |
Protein GI | 297560758 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.103195 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.329714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCGC GGCAGCTGTC CTACTTCCTG GCCATCGTCG ACCACGGAGG ATTCGGCAGG GCCGCCGAGC ACCTGCACAT CGCCCAGCCC TCGTTGTCCC AGGCCATCGG GGGTCTGGAG CGGGAGCTGG GCGTGGACCT GTTCCACCGG GTGGGCCGGG GCGTGGTGCT GACCGACACC GGCACCCGCC TCATCGAACC GGCCCGCCAG GTGGTCCGGG ACCTGGAGGC GGTGCGCGAC ACCGCGCGTT CGGCCCGCGG GCTGCGGCGG GGGCGGGTGG ACCTGGTCTC GACGCCCTCC CCGGGGATCG AGCCGCTCAC CACGCTGATG GCCTCCTTCG CGCGGGAGTA CCCGCTGATG ACGGTCAACG TGGCGGGGGC GTTCACCCCC GAGGAGGTGG TCCAGCACGT GCGCTCGGGG GCGGCCGAGA TCGGGCTGCT GGGCTCGGCG GGCCACCCGC GCACCGCGGA CCTGAGGGTG CTGCCGGTGG AGGAGCAGCC CCTGGTGCTG CTGTCCCCGC CCGAGGAGGA GGGCGCCCCG CCGGAGCGGA CCGTGTCCGG CGGCACCGGT CCCGTCCCGG ACCGGATCGA CCGCGCGGAC CTGGGCGGTC TGCGGCTGAT CGCCTCCCAG CGCGGCAGCC TCATGCGCCA GATCGTGGAC GACATCCTGG CGGGCGGCAC CGACGCGCAC CTGGCCGCCG AGGTCGACCA CCGCACCTCG ATCCTGCCCC TGGTGCTGTC CGGGCTCGGC CACGCCGTCA TGCCCTCCTC CTGGAAGCCC ATCGCCACGC GCATGGGCGC CCGGGTCCGC CTCATCGAAC CGGTCTCCCG CCTGCGCGTC GTCATCGTCA GCCGCGCCTC GCGGATGACG CCCGCCGCGC AGGCCTTCCT CGGCGTCGCC GAGCACTACG CCAGGACCCG CCCGGAAGCG GAGCCGGGGG GTTCCGCGCA GGCATAG
|
Protein sequence | MDARQLSYFL AIVDHGGFGR AAEHLHIAQP SLSQAIGGLE RELGVDLFHR VGRGVVLTDT GTRLIEPARQ VVRDLEAVRD TARSARGLRR GRVDLVSTPS PGIEPLTTLM ASFAREYPLM TVNVAGAFTP EEVVQHVRSG AAEIGLLGSA GHPRTADLRV LPVEEQPLVL LSPPEEEGAP PERTVSGGTG PVPDRIDRAD LGGLRLIASQ RGSLMRQIVD DILAGGTDAH LAAEVDHRTS ILPLVLSGLG HAVMPSSWKP IATRMGARVR LIEPVSRLRV VIVSRASRMT PAAQAFLGVA EHYARTRPEA EPGGSAQA
|
| |