Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1921 |
Symbol | |
ID | 9245771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2340849 |
End bp | 2341841 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003679854 |
Protein GI | 297560880 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.295106 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGTC TGGACGACGT GGCCAGGGAG GCCGGCGTGT CGGCGTCCAC CGTTTCGCGG GCCCTGTCGC GGCCGTCCAT GGTCTCCGAG CCCACCCTGG TCAGGGTCCG CGCCGCCGCG GAGAGACTGG GCTTTCGGGC CAACCCCGCC GCGCGGGCGC TGACCACGGG TCGCAGCGGT TTCTTCGCCC TGCTCGTCCC CGACCTGGAC AACCCCTTCT ACTCCACCAC CGCCACCGCC GCGCAGGAGC TGGCGGGCAG GAGCGGGCGC CGGGTGATCA TCGCCGTCAC CGGGGGCGAC GCGGCCCGCG AGGCCGAGGT GCTCGGCGAA CTGGAGAGCC AGGTGGACGG ATTCGCGCTC CAGTCACCGG TGGGCTCGGC CGCCGACCTG AAGGAGGCGC ACCGGCGCAA GCCGCTGGTG GTGATCAACC GCCGCGTGCC CGGACTGACC TCGTTCACGG TGGACACCCC CGGCGGTCTG GGCACCGTCT ACGACCGCCT GGTGGAGCTG GGGCACCGCG ACGTCGCCTA CCTGGCGGGG CCGCCCAGTT CGTGGATGGA CCGGCGCAGG CGCGAGGAGC TGGTGGAGCA CGCCGCCCGG TCCCGGCTGC GGGTGCGGGT CTTCGGGCCC GTGCGACCGG CCTTCGCGGA GGGGGCCGCG GCGGCGCGGG AGATCGTGGA CTCGGGCTGC ACGGCGATGC TGGTCTACAA CAGCCTGTTG CTGCTGGGAG CCATGTTCGA GTTCGGCCGC ATAGGGGTGC GGGTGCCCGA GGACATCAGC GTGGCCGCGG CCGACGACAT CGCCCTGGCC GACCTGCCGG GGCCGCCGAT CTCGGCGGTG CTGGCCCCGG CCGACGAGCT GGGCCGCGCG GCGGTGTCGG CGCTGATCGA GCTGGTCGAC GGGCCCGCGG GGACCCGCCC CCGGGGGCGC AGGCTGCCCA CCGAGGTGCG GATCACCGAC TCCCTCGCGC CGCCGCGCCC GGCACTCGGC TGA
|
Protein sequence | MASLDDVARE AGVSASTVSR ALSRPSMVSE PTLVRVRAAA ERLGFRANPA ARALTTGRSG FFALLVPDLD NPFYSTTATA AQELAGRSGR RVIIAVTGGD AAREAEVLGE LESQVDGFAL QSPVGSAADL KEAHRRKPLV VINRRVPGLT SFTVDTPGGL GTVYDRLVEL GHRDVAYLAG PPSSWMDRRR REELVEHAAR SRLRVRVFGP VRPAFAEGAA AAREIVDSGC TAMLVYNSLL LLGAMFEFGR IGVRVPEDIS VAAADDIALA DLPGPPISAV LAPADELGRA AVSALIELVD GPAGTRPRGR RLPTEVRITD SLAPPRPALG
|
| |