Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0809 |
Symbol | |
ID | 9244654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 998603 |
End bp | 999637 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003678759 |
Protein GI | 297559785 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.119427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAAGA CCGGTGCTGG TCGGAAGCGT CCGACCCTGG AGATGGTGGC GCAGCGCGCC GGGGTCGGAC GAGGGACCGT CTCGCGGGTG ATCAACGGGT CCGCGCAGGT GAGCCCCCGC ACCCGGGAGG CCGTGCACGC CGCGATCGCC GAACTCGGCT ACAGCCCGAA CCAGGCCGCG CGCACCCTGG TCACCAGGCG CACCGACACG ATCGCGCTGG TCGTCTCCGA GCCCCGGGAC CGGCTGTTCT CCGACCCCTT CTTCGCCGAC ATCATCCGCG GAGTGAGCTC GGTCCTGCAC GAGCGCGACC TGCAGCTCAT GCTCACCACG GCCCGGACCG AGGCCGAGCA CAAGCGCGTG GGCGACTACC TCAGCGGCTT CCACGTGGAC GGCGCGCTGC TGATCTCCCT GCACAGCGAC AATCCGCTCT CGGCCCGTCT GGACGAGGCC GGGGTGCCGG TCGTCCACGG CGGTCGCCCG CACTCGCCCG AACAGCCCGC GCCCTACTGC GTCGACATCG ACAACATCGG CGGGGCCCGG ATGGCCATCC GCCACCTCCT GGAGCGCGGA TGCCGACGGG TGGCCGCCAT CACCGGCCCC CTGGACATGA ACGCCGGTGT GGAGCGCCTG CGCGGCTACC GCGAGGTCAT GGCCGCCGCC GGACTGGAGG TGGACGACAG GCTCGTCGTG CAGGGCGACT TCAGCGTGGA GGGGGGAGCC GAGGCGATGG AGCGGCTCCT GGGCACCGGG CTGGAGCCCG ACGCGGTGTT CGCGGCCTCC GACATGATGG CGCTCGGCGG CCTGCGGGTG CTGCGCGCAC GCGGCCTGAG AGTTCCGGAG GACGTGGCCC TGGTGGGTTA CGACGACACC GTCATGGCCC AGCACAGCGA CCCGCCGCTG ACCACCATCC ACCAGCCCAC GGTGCAGATG GGGCAGGAGA TGGCGCGGCT GCTGGTGGAC GTGGCGATCC CCCGCACGAC GGAGGCCGAG ACCGTCATGC TCGGCACCCA CCTGGTCGTG CGCGAGTCCG GCTGA
|
Protein sequence | MAKTGAGRKR PTLEMVAQRA GVGRGTVSRV INGSAQVSPR TREAVHAAIA ELGYSPNQAA RTLVTRRTDT IALVVSEPRD RLFSDPFFAD IIRGVSSVLH ERDLQLMLTT ARTEAEHKRV GDYLSGFHVD GALLISLHSD NPLSARLDEA GVPVVHGGRP HSPEQPAPYC VDIDNIGGAR MAIRHLLERG CRRVAAITGP LDMNAGVERL RGYREVMAAA GLEVDDRLVV QGDFSVEGGA EAMERLLGTG LEPDAVFAAS DMMALGGLRV LRARGLRVPE DVALVGYDDT VMAQHSDPPL TTIHQPTVQM GQEMARLLVD VAIPRTTEAE TVMLGTHLVV RESG
|
| |