Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1012 |
Symbol | |
ID | 9244858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1237292 |
End bp | 1238308 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003678961 |
Protein GI | 297559987 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.03718 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.932979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGC CCGAACCGCC CAGGCGCCGA CCGACCATCA TGGAGGTCGC CCGCCTGGCG GGCGTCTCCC ACCAGACCGT CTCGCGCTAC CTGCGCTTCG AGGGCGGGCT CAGGGAGGCG ACCCGCGAGC GCGTCGACGC CGCGATCCGG GAGCTCAACT ACCGCCCCAA CCTCGTGGCC CGGTCGATGC GCACGCGCCG GACCGGGCGC CTGGCGATCC TCCTGCCCGG TGTGCCCGGC GCCAGCCCGA GCCGGTTGCT GGCCGGGGCC ATCGCGACCG CGCACGCCGA GGGGTTCGTC GTCGAGGCGG TGAGCGTGGA CGGCGGGGTC GGGGCCCGGA CCGGGCGCAT GCTCGAACTC GCCGGGTCGG GCCAGGTGGA GGGCGTCCTG TCCCTGGCCC CCGTCGCCCC CGACAGCCTC CGGGCCGCGG GCGAGGGCGC CTCCGTGGTC GTCTCCGCCG ACTACGACGA CGAGATGCGC GGTCTGGGCG AGATCGCCGA CGGGTCGGTC GTCGCCGACC TCGTCGAGGG GCTGGCCAAG GCCGGGCACC GGCGCTTCCT GCACGTGTCC GGCCCGCTCC AGTACGCCTC GGCGCGGGGG CGCAGACAGA CCTACCTGGA GGCGGTCGAA CGCCTGGGGC TGGAGTCCCA CGGCGTGTTC GACGGCGACT GGTCGGCCGA GTCCGGCGCC GAGGCCGTGC GCTCCCTGCC CGAGGACAGC GGGGTCAGCG CCGTCATCGC GGGCAACGAC GTGGTCGCGG CGGGCGCGGT CCGCGCGGCG ATGGAGCGCG GGTGGAGCGT GCCGGGCGAC CTGAGCGTGA CGGGGTGGGA CAACAACCCC GTGGGCGCCT ACCTGTCTCC CTCGCTCACG ACGGTCGACG TGGACCTCGA ACGGCTGGGC GTCAACGCCA TGCGCCGCCT GGTCGCGGCC GTGCGGGGCA CGGTGGCGGA GGTCGGACGG GAGCCGCTCA ACACGATCCT GTGGCGCGAG TCCACCGGCC CGGGCCCCTG GCGCTGA
|
Protein sequence | MAEPEPPRRR PTIMEVARLA GVSHQTVSRY LRFEGGLREA TRERVDAAIR ELNYRPNLVA RSMRTRRTGR LAILLPGVPG ASPSRLLAGA IATAHAEGFV VEAVSVDGGV GARTGRMLEL AGSGQVEGVL SLAPVAPDSL RAAGEGASVV VSADYDDEMR GLGEIADGSV VADLVEGLAK AGHRRFLHVS GPLQYASARG RRQTYLEAVE RLGLESHGVF DGDWSAESGA EAVRSLPEDS GVSAVIAGND VVAAGAVRAA MERGWSVPGD LSVTGWDNNP VGAYLSPSLT TVDVDLERLG VNAMRRLVAA VRGTVAEVGR EPLNTILWRE STGPGPWR
|
| |