Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1087 |
Symbol | |
ID | 9244933 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1336321 |
End bp | 1337373 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003679035 |
Protein GI | 297560061 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.978331 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCACGA GGCCGCGTAT CAAGGACGTC GCGCGTCAGG CCGGGGTCTC GGAGAAAACC GTTTCCAACG TTATTAACGA CCATCCGCAC GTCCGGCCCG CCACGCGCGC CGCCGTGGAG GCGGCCATCG ACGCGCTGGG CTACCGGGTC AACCTGGCCG GACGCCACCT GCGCCGCGGC CGGACCGGTG TGATCGCGCT GGTCGTGCCC GAACTCGACC TGGGCTACTT CGCCGAACTC GCCGACCTGG TCATCCGCGA GGCCGAGCGC CTCTCGCGCA CCGTCCTGGT CCACCAGAGC GAGGCCCGCC GCGAGCGCGA GGAGTCCGCG CTGGAGGGGT TCGGCGCCGA CTTCGTGGAC GGGGTCATCC TCAGCCCGCT CGCCATGGAC GACGCCGCCC TGCGCACGCA CCCCTCCCGG CTGCCGGTGG TGCTCCTGGG CGAGCTGCCC CGCACGGTCC GGCACGGCCA CGTCGCCATC GACAACGTGG CCGCCGCCCG GGAGGCGACC GAGCACCTGC TCGACGGCGG GCGCACCCGG ATCGCGGTGG TGGGCGGTCG GCCGCCGGGC CCCTCCGGCA CGGCCGAGCT GCGCACCCGC GGTTACCGCG AGGCCCTGGA GGCGCGGGGG AGGAGCTACG ACCCCGAGCT GGTGCGACCG GCCGGGCACT TCCACTGGAG GGACGGGGCG GAGCTGGCCG CCGAACTCGT CGCCGGACCC AGGCCGCCCG ACGCGCTGCT GTGCATGAAC GACCTGCTGG CGCTGGGGGC GATGCGCGCG CTGCACGACG CAGGGGTGCG GGTCCCCCGG GACGTGGCGG TGGTGGGGTT CGACGACATC GCCCCCGGGC GCTACTCCGT GCCGAGCCTG ACCACCGTCG CCCCGGACAA GCCGGGCCTG GCCAGGGAGG CGGTGCGGCT GCTGCTGGAG GAGGTGGAGG CCCGTCGCGG CGCTCCGGAC GCGGAGTCCG GGGCCGGGGC CGACCGGGCC TCGGCCAAGG TCGTCGTCGG CCACACCCTC CTGGTGAGGG AGAGCAGCGC CAGCGTGCTC TGA
|
Protein sequence | MGTRPRIKDV ARQAGVSEKT VSNVINDHPH VRPATRAAVE AAIDALGYRV NLAGRHLRRG RTGVIALVVP ELDLGYFAEL ADLVIREAER LSRTVLVHQS EARREREESA LEGFGADFVD GVILSPLAMD DAALRTHPSR LPVVLLGELP RTVRHGHVAI DNVAAAREAT EHLLDGGRTR IAVVGGRPPG PSGTAELRTR GYREALEARG RSYDPELVRP AGHFHWRDGA ELAAELVAGP RPPDALLCMN DLLALGAMRA LHDAGVRVPR DVAVVGFDDI APGRYSVPSL TTVAPDKPGL AREAVRLLLE EVEARRGAPD AESGAGADRA SAKVVVGHTL LVRESSASVL
|
| |