Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1914 |
Symbol | |
ID | 9245764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2334172 |
End bp | 2335182 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003679847 |
Protein GI | 297560873 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.04666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.595127 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAC CGACCAAGCG CGCGACCCTG CGGGACGTCG CGGCGGGGGC GGGCGTCTCG GTCGCCCAGG CCAGCTTCGC CCTCAACGGC ACCGGCCGCG TGGCCGCCGC GACCGCCGAG CGGGTCCGCC GGATCGCCGC GGAACTCGAC TACCAGGCCG ACGGCCGGGC CCGGGCGCTG CGCACTGGTC GGCGCTCCGC CTACGGCGTG GTGATCCGCA ACATGCGCAA CCCCTTCTTC CTGGACGTGC TGCGCGGCAT GGAGACCGTC GCGCACCGCG AGGGGGCGCT CCTGCTGATC ATGAGCTCGG ACTACGACCA GGAGCGGGAG AGCGCCGCGC TGCGCAGGCT CGCCGCCGAG GCGGTCGCCG GGATCGCCAT CGCGCCCATC GGGCGGCGCG ACCGCCTCCT GGAGTGGATG GACCGCCACT CCCACGTTCC GGTGGTGGCC TTCAACTGCA CACCCGAACC CGACCGGGAG GGCACCGCCA GCCGCCTGTC CACGGTCGGC CCCGACGACG AGGAGGCCGT CGCCCGGGCC GTGGCGCACC TGGCGCAGCG GGGCCACCGC GAGGCCACGC TGCTGATGGC CCCCGAGCAC CTGGCCGCCG ACTGGGGGCG CGAGGAGGCC TTCCAACGCC ACTGCGCCGA GCACGGGGTG GCCGGTTCGG TGGCACGCGG ACCCCTGGAC TACGAGGCGG TGGCCCGCAG GTCCGCGGAG ATGATGGCGC GCCCCGGGCA CCGCGCCCTG GTCGTCAACT CCGACCACCT GAGCGCCGCC GTCTACGACG CCGCCCGCTC CCTGGGCCTG CGCGTGGGGC GCGACGTCAG CGTGGTGGGC CACGACGACC TGCCCACCTC GGCCCTGCTG GACCCCGGCC TGACCACGAT CGCCGTGGAG CGCGAGGTAC TGGGGGAGCG GATCATGAAC CTGCTCGTGG AGGGCCCAGG CGCCGCCGTG CGGCTGCCCG TGCGCCTGGT GGAACGGGGG TCGGTGGCGG TCCTGGAGTG A
|
Protein sequence | MPEPTKRATL RDVAAGAGVS VAQASFALNG TGRVAAATAE RVRRIAAELD YQADGRARAL RTGRRSAYGV VIRNMRNPFF LDVLRGMETV AHREGALLLI MSSDYDQERE SAALRRLAAE AVAGIAIAPI GRRDRLLEWM DRHSHVPVVA FNCTPEPDRE GTASRLSTVG PDDEEAVARA VAHLAQRGHR EATLLMAPEH LAADWGREEA FQRHCAEHGV AGSVARGPLD YEAVARRSAE MMARPGHRAL VVNSDHLSAA VYDAARSLGL RVGRDVSVVG HDDLPTSALL DPGLTTIAVE REVLGERIMN LLVEGPGAAV RLPVRLVERG SVAVLE
|
| |