Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3985 |
Symbol | |
ID | 9247856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4766758 |
End bp | 4767774 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003681888 |
Protein GI | 297562914 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.610149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACA CCCCGTCCCG AGACGTCACG ATCTCCGCCA TCGCCGAGGA GGCGGGCGTC TCCGCGCCGA CGGTCTCGCG CGTGCTCAAC GGCCGGGGCG ACGTCGCCCC CGCCACGCGC GAGCGCGTGG AGAGCCTGAT CCGCGCCCAC GGCTACCGGC GCCGCGGAGG TCGGCCGCGG GAGCGCGTCG GCCTGCTCGA CCTCGTCTTC AACGACCTGG ACAGCCCGTG GGCCGTCGAG ATCATCCGCG GGGTGGAGGA CGCCGCGCAC GAGAGCGGTA CCGGCATCGT GGTGTCGGCC ATCCACCGCC GGGTCAGCTC CACCCGCCAG TGGCTGGAGA ACGTGCGCTC GCGCGCCACC GACGGGGCGA TCCTCGTGAC CACCGACCTG GACCCCGAGC TGCGCGAGGA ACTGCGGGAA CTCCACGTGC CCGCCGTGGT CGTCGACCCG GTCGGCGTCC CGGACCTGGA CACCCCCACG GTCGGCGTCA CCAACTGGGC CGGGGGCCTC AGCGCGACCG AGCACCTGAT CCACCTCGGC CACCGGCGCA TCGCCTTCGT CGCCGGGCGC CCCGAGCTGT GGTGCAGCCG GGCCCGGCTC GACGGCTACC GGGCGGGCCT GGAGACGGCG GGGCTCGCGG TCGACGACGA GCTGGTCGTG CCGGGGGAGT TCGGCTACGA GTCCGGCTTC CGGGCGGGGG AGCGGTTGTT CGACCTCGCC GATCCGCCCA CGGCCGTGTT CGCGGCCAGC GACCAGATGG CGCTGGGCGT CTACGAGGCG CTGCGCCGCC GCGGCCTGCG GGTGCCCGCC GACGTCAGCG TGGTCGGCTT CGACGACCTG CCCGAGGCGC GCTGGTCCTC GCCGTCCCTG ACCACCGTGC GCCAGCCGCT GTCGGACATG GGCAGGCTCG CGGTGCGCAC CGTGCACCGC CTGGTGCAGC GCGAGACCAT CGAGAGCCCG CGGGTCGAGC TGGCCACCGA GCTCGTCGTG CGCGACAGCA CCGCCCCGCC GCCGTGA
|
Protein sequence | MPDTPSRDVT ISAIAEEAGV SAPTVSRVLN GRGDVAPATR ERVESLIRAH GYRRRGGRPR ERVGLLDLVF NDLDSPWAVE IIRGVEDAAH ESGTGIVVSA IHRRVSSTRQ WLENVRSRAT DGAILVTTDL DPELREELRE LHVPAVVVDP VGVPDLDTPT VGVTNWAGGL SATEHLIHLG HRRIAFVAGR PELWCSRARL DGYRAGLETA GLAVDDELVV PGEFGYESGF RAGERLFDLA DPPTAVFAAS DQMALGVYEA LRRRGLRVPA DVSVVGFDDL PEARWSSPSL TTVRQPLSDM GRLAVRTVHR LVQRETIESP RVELATELVV RDSTAPPP
|
| |