Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3544 |
Symbol | |
ID | 9247413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4252806 |
End bp | 4253762 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003681451 |
Protein GI | 297562477 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.545256 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCCCC TCGTGCACCT CCTGGACGGA CCACGCGCGC GCGGCGCCTT CACCCTGCGC ACCGTCATGA GCCCGCCCTG GGGCATCGAC ATCCGCGACG AGGCCGCGCT CACCCTGGTC GTGGTCACCT CCGGGCGCGC CTGGGTGGCG GCGAACGGAG AGGAGGCCGA GGCCGGTCCC GGCGACCTGG TGCTCGTGCG CGGCCCGGAG CCCTACGTGA TCGCCGACAG CCCGGGCCGG GAGCCCACCG TCACCGCTCT GCCGGGTGCG CGGTGCGTGG CCCGCGACGG CGCGGAGGTC CACGTGTCCA TGAGCCACGG CGTCGGCACC TGGGGGAACG ACCCCGACGG CACCGACACC ATGGTCGTGG GCGCCTACCT GGACGACAGC GAGGTCGGCC GCCTCGCCCT GGCCGCGCTG CCGTCGCTGG CGGTGCTCCC CGAGGGCCGG GTCGACCCCG CGCTGGTGGA ACTGCTCGCA CGCGAGATCA CCACCGGCCA CGTCGCCCAG ACCAGCCTCA ACGACCGGCT CCTGGACTGC CTGCTGGTGA TGGCGGTGCG CGCCTGGCTG GAGGACAACC CCGACAGCGC GCCCAACTGG CTCACCGCCC GCAGCGACCC CGTCGTCGCG CGGGCGCTGG AGCTGGTCCA CGAGCGCGTC GCCGACCCCT GGACGCTGGA GTCCCTGGCG GGGGAGTGCG CGGTCTCCCG CGCCACCCTG GCCGCGCGCT TCCAGCGGGC CGTGGGCACC CCTCCGATGA CCTACCTGCG CACCTGGCGG CTGACGGTGG CCTGCGACCT GCTCACCGCC AGCCCGGGGC TGGGCCTGGA GGCCGTGGCC GCACGGGTGG GCTACGGGAG CGCGTTCGCG TTCAGTTCCG CGTTCAAGAA CCACACGGGC GCGAGCCCGT CCGCGTACCG GGCCGCCCGG GCCGAACGCG TGCCGCAGGA GGCCTGA
|
Protein sequence | MDPLVHLLDG PRARGAFTLR TVMSPPWGID IRDEAALTLV VVTSGRAWVA ANGEEAEAGP GDLVLVRGPE PYVIADSPGR EPTVTALPGA RCVARDGAEV HVSMSHGVGT WGNDPDGTDT MVVGAYLDDS EVGRLALAAL PSLAVLPEGR VDPALVELLA REITTGHVAQ TSLNDRLLDC LLVMAVRAWL EDNPDSAPNW LTARSDPVVA RALELVHERV ADPWTLESLA GECAVSRATL AARFQRAVGT PPMTYLRTWR LTVACDLLTA SPGLGLEAVA ARVGYGSAFA FSSAFKNHTG ASPSAYRAAR AERVPQEA
|
| |