Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2029 |
Symbol | |
ID | 9245879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2449082 |
End bp | 2450113 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003679961 |
Protein GI | 297560987 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.259814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGTCCG GTTACGTGAC GCTGGAACAG GTCGCCGAGC ACGCCGGGGT GTCCCTGGCC ACGGCGTCCA GGGTGATCAA CGGCAGCACA CGCCAGGTCA GTCAGCGCCT GCGCGACAAG GTGACCGCCA GCGCGCGCGA ACTCGGCTAC CTGGCCAACG CCTCGGCCCA GACCCTGGCC CGCAACAGCA GCGCCCTGGT CGGCCTGCTC GTCCACGACA TCTCCGACCC CTACTTCTCC TCGATCGCCG CCGGGGTCAC CCGCCACACC GAGGAACAGG GACTGGTCCT GGTGCTGGGC ACCACCAACC GCTCCGCGCA GAAGGAGGGC CGCATCCTGG CCACGCTGCG CGCGCACCGG GCCCGCGCGG TGGTGCTGGT CGGCTCGCGC TCCACCGACG AGGAGAGCAA CCGGCGCCTG TCCGAGGAGA TCCGCATGTT CCGCCGCCAG GGCGGGCGGG TGGCGTGCGT GTCCCAGGAG GGCCTGCCCG CGGACACGGT CACCCCCGAC AACCACACCG GGGCCTCCGA CCTGGCCCGC CGCCTCATCG CCCAGGGGCA CCGGGAGTTC GCCATCCTGG CCGGGCCCAC CGACCTCCAG ACCGCCCGCG AACGCCTGGA CGGGTTCCAC TCCGCGCTCT CGGGCGCGGG GCTGGAACTG GCCCACCACA ACGTGGTGCA CGGCGCCTTC ACCCGCGACG GCGGCTACGA GTCCACCCGC CGCCTGATGG CGGTGGGCAC CGACGCCACG TGTCTGTTCG CCGTCAACGA CGTCATGGCC ACGGGGGCGA TGGCCGCGCT GCGCGACCTG GGCCTGCGGG TGCCCACCGA CCTGTCCGTG GCCGGGTTCG ACGACATCCC CACCCTGCGC GACCTCACCC CCGCCCTGAC CACGGTGCGC CTGCCGCTGG AGGAGATGGG CGAACGCGCC GCTGTGCTCG CCCTGGACGG CGATCCCAGC GACCAGCCCC GCGTGGTCAC CGTGCGCGGC GAGGTCGTCG AGCGCGAGAG CACCGCGCCG CCCACCCGCT GA
|
Protein sequence | MESGYVTLEQ VAEHAGVSLA TASRVINGST RQVSQRLRDK VTASARELGY LANASAQTLA RNSSALVGLL VHDISDPYFS SIAAGVTRHT EEQGLVLVLG TTNRSAQKEG RILATLRAHR ARAVVLVGSR STDEESNRRL SEEIRMFRRQ GGRVACVSQE GLPADTVTPD NHTGASDLAR RLIAQGHREF AILAGPTDLQ TARERLDGFH SALSGAGLEL AHHNVVHGAF TRDGGYESTR RLMAVGTDAT CLFAVNDVMA TGAMAALRDL GLRVPTDLSV AGFDDIPTLR DLTPALTTVR LPLEEMGERA AVLALDGDPS DQPRVVTVRG EVVERESTAP PTR
|
| |