Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0756 |
Symbol | |
ID | 9244598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 927046 |
End bp | 928140 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_003678707 |
Protein GI | 297559733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0825121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCCC ATCCCCGACG CACACCCCTC CTGGCCGCCG CACTGTGCGG CACACTCCTG CTCACCGCCT GCGGAGGCGT CGGCGACGCC GCGCGGCCCG AGGGCGAGGC CGGCGTCGTC GGCATCGCCA TGCCCACCCA GTCCTCCGAG CGCTGGATCA ACGACGGCGA GAACATGGTC GCCGAGTTCG AGGCCCGCGG CTTCGGCACC GACCTCCAGT ACGGCGAGGA CGTCGTGGAG GACCAGGTCT CCCAGATCGA GAACATGATC ACCCGGGGCG CCGACGTCCT GGTCATCGCC TCCATCGACG GCGAGGCCCT CGGCGACGTG CTCGACATGG CCGCCTCCAG CGACATCCCC GTCATCGCCT ACGACCGCCT CATCCTCGGC AGCGAGCACG TCGACTACTA CGCCACCTTC GACAACTTCC AGGTGGGCGT CCTCCAGGGC GAGTACATCG TGCGAGCCCT CGACCTGGAG AACGAGGAGG GTCCCTTCAA CATCGAGCTG TTCGGCGGCT CGCCCAACGA CAACAACTCC TCCTACTTCC TCGACGGGGC GATGTCGGTG CTCCAGCCCC ACATCGACGA CGGCCGCCTC GTCGTCCGCA GCGGCCAGAC CTCCATGGAG CAGATCGCCA CCCAGGAGTG GTCCGGCGCC GTCGCCCAGG ACCGCATGGA CAACCTGCTC AGCGCCCACT ACTCCGAGGA GGAGGTGCAC GCGGTCCTGT CGCCCTACGA CGGCATGAGC CTCGGCGTGA TCGAGTCCCT GCGCGCCGTC GGCTACGGCA CCGAGGACCG GCCGCTGCCC GTCATCACCG GCCAGGACGC CGAGGCCGCC TCGGTCCGGT CCATCATCGC CGGGGAGCAG ACCCAGACCG TCTTCAAGGA CATCCGGACC CTGGCCACCC AGACCGTGGA CATGGTCGAG GCCCTGGTAC AGGGTGAGGA GGTCCCGGTC AACGACACCG AGAGCTACGA CAACGGGGTC AAGGTCGTCC CCTCCTACCT GCTCGACCCC GTCTCGGTGG ACGCCGACAA CTACCACGAG GTCCTGGTCG AGAGCGGCTA CTACGAGGAG TCCGAGCTCC AGTGA
|
Protein sequence | MTPHPRRTPL LAAALCGTLL LTACGGVGDA ARPEGEAGVV GIAMPTQSSE RWINDGENMV AEFEARGFGT DLQYGEDVVE DQVSQIENMI TRGADVLVIA SIDGEALGDV LDMAASSDIP VIAYDRLILG SEHVDYYATF DNFQVGVLQG EYIVRALDLE NEEGPFNIEL FGGSPNDNNS SYFLDGAMSV LQPHIDDGRL VVRSGQTSME QIATQEWSGA VAQDRMDNLL SAHYSEEEVH AVLSPYDGMS LGVIESLRAV GYGTEDRPLP VITGQDAEAA SVRSIIAGEQ TQTVFKDIRT LATQTVDMVE ALVQGEEVPV NDTESYDNGV KVVPSYLLDP VSVDADNYHE VLVESGYYEE SELQ
|
| |