Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2051 |
Symbol | |
ID | 9245901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2473020 |
End bp | 2474054 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003679983 |
Protein GI | 297561009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0974748 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCAAG GATCCTCGCA CCGCGTGGCC CTCGTCGTGG ACGAGGGCTC CAATCCCTTC GAGATGAGCG TGGCCACAGA GCTGTTCGGG CTGCGCCGCC CCGAGATCGA CCGCCCCTGG TACGACCTGA CGGTGTGCGC GGCCGTGCCC GAGGTGCGCG TGCACCGGGG CATGTTCACC CTCGGCGGCC TGGCCGGGCT GGAGGCGGTC GCGAAGGCCG ACACCGTCAT CGTGCCCAAC CGCCCCGACC CCGAGGTGCC GCCCTCCCCG GAGGTGGTGG AGGCCGTGCG CGCGGCGGAC CGGCGCGGCG CCCGTCTGGT GAGCTTCTGC AGCGGCGCGT TCACCCTGGC GGCGGCCGGG GTCCTGGACG GCCGCCGGGC CGCCACGCAC TGGCGGTTCG AGGAACTGTT CGCCCGGACC TACCCCCGCG TCCGGCTGGA GCGGGACGTG CTGTTCGTGG ACGAGGGCCG TGTGCTCACC GCGGCGGGCA GCGCCGCCGC CCTGGACCTG GGGTTGCACC TGATCCGCCG CGACCACGGG GAGGGGGTCG CCAACACCGT CAGCAGACGG CTGGTGTTCA CCGCCCACCG CGAGGGCGGG CAGCGCCAGT TCGTGCCCAG GCCCGTCCCC GAGGCAGCCG ACACCTCCCT GTCCCCGCTC CTGGACTGGG CGCGCGAGCG CCTGGACTCC CCTCTCACCG TCGCCGGCCT GGCCGCCCGG GCGTCGGTCA GCCCCGCGAC CCTGCACCGC CGTTTCCGGG CCGAGCTGGG CACGACCCCG CTGGCCTGGC TGCGCGCCGA GCGGGTGCTG CTGGCCTGCC GCCTGATCGA GGCGGGCGGG ATGGGGCTGG AGACGGTCGC CCGCGCCAGC GGCCTGGGCA GCACCGCCAA CCTGCGCGCG AGCGTGCGCG CCCACACCGG GGTCAGCCCC TCCGCCTACC GGGAACGCTT CGGGCCGAGG CAGCTGGCTG GGACTTCAGG CCCTCCGGTC CTTCCCTCAC CGGACGCCGA ACGCGTAGCC TCCGGGCATG TCTGA
|
Protein sequence | MPQGSSHRVA LVVDEGSNPF EMSVATELFG LRRPEIDRPW YDLTVCAAVP EVRVHRGMFT LGGLAGLEAV AKADTVIVPN RPDPEVPPSP EVVEAVRAAD RRGARLVSFC SGAFTLAAAG VLDGRRAATH WRFEELFART YPRVRLERDV LFVDEGRVLT AAGSAAALDL GLHLIRRDHG EGVANTVSRR LVFTAHREGG QRQFVPRPVP EAADTSLSPL LDWARERLDS PLTVAGLAAR ASVSPATLHR RFRAELGTTP LAWLRAERVL LACRLIEAGG MGLETVARAS GLGSTANLRA SVRAHTGVSP SAYRERFGPR QLAGTSGPPV LPSPDAERVA SGHV
|
| |