Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1373 |
Symbol | |
ID | 9245223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1687936 |
End bp | 1688934 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_003679311 |
Protein GI | 297560337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.981661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAGAC GACCGCCGCG CCCCCCGGGC CGCGCGCTCG CCGCTCTCAC CGCCGCCCTG TGCGTGCTGC TGCTCGCCGG ATGCGGCGCC AGGCTCGGCG AGGACGGCCG CATCGGCGTC GTCTACATGG ACGCGCAGGG CTTCTACGCG GGAGTCCGCG AGGGCATGCA GAACTACGCC GACGACTCCG GACGCGAGCT CCAGCTCCTG GAGCTCAACG CCCGCGGAGA CGCCTCGGAG GAGAGCACCT TCGTGGACGT CGTCTCCTCG GCCGACGTCG ACGCCCTGGT GCTGTCCCCC GTCTCGGCCA CCGCCTCGGT GCCCGCCGTG CGCCTGGCGC ACGAGAGCGG GATCCCGGTG ATCTGCTACA ACACCTGCAT CGAGGACGAG GCCGCCAAGG AGTACGTGAC CTCCTACATC CTGGGCGACC CCCACGAGTT CGGGCGCCTG CTGGGTGACG CCGCCGCGGA CCACTTCGAG GACGAGGGAG TGGAGGACCC CCAGATCGGC GTCCTCAACT GCGAGTTCGT CGAGGTCTGC GTGCAGCGCC GCGAGGGCTT CGAGGAGGCG CTCTTCGCCC GCCTGCCCGA CGCCCGGATC GTCGCCAACC AGGAGGGCGC CACCATCGAC GAGGCCGTCA ACGTGGGCGA GCGCCTGCTC ACCGCCCACC CCGACCTCGA CGCCTTCTAC GGGGAGGCGG GCGGCGCCAC CATGGGCGCG GTCCGCGCCG TCACCAACCG CGGCCTGGCC GGGGAGGTCG TGGTCTTCGG CAGCGACATG TCCACCGACG CGGCCCGCGC GCTGTCCGAC CACCGCATCC TCAAGGCCAA CGTCGACATC TCCGGAATCG CGGTCGGCCT GCTGGCCGGG GAGACGGTCG AGCGCATCAT CGCGGGCGAG GCGCCCGAGG AGTTCGTCAC CGAGGCGCCC ATCGACCTCT ACACCACGCC CGAGGACGGC GAGGAGTGGC TGGAGGAGCA CCCGGACGGC GTCCCCTAG
|
Protein sequence | MGRRPPRPPG RALAALTAAL CVLLLAGCGA RLGEDGRIGV VYMDAQGFYA GVREGMQNYA DDSGRELQLL ELNARGDASE ESTFVDVVSS ADVDALVLSP VSATASVPAV RLAHESGIPV ICYNTCIEDE AAKEYVTSYI LGDPHEFGRL LGDAAADHFE DEGVEDPQIG VLNCEFVEVC VQRREGFEEA LFARLPDARI VANQEGATID EAVNVGERLL TAHPDLDAFY GEAGGATMGA VRAVTNRGLA GEVVVFGSDM STDAARALSD HRILKANVDI SGIAVGLLAG ETVERIIAGE APEEFVTEAP IDLYTTPEDG EEWLEEHPDG VP
|
| |