Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0512 |
Symbol | |
ID | 9244353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 629995 |
End bp | 631200 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | transcriptional regulator, XRE family |
Protein accession | YP_003678465 |
Protein GI | 297559491 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCCA AACCGTTCCG TCTGGTGCTC CGTGAACTCC GGTCCGCGCG TGGTTGGTCG CAAGCGCGGT TGGCGGAAGC GCTGTGTGAC GTGTCGGGAC GTCCAACGGT GACCCGACAC GATGTGTCGC GGTGGGAGCG GGGGAAGCGT GTCCCGCGTG CGTGGCTTCC CTACCTCGCG GAGGTTCTGG ACGTCTCCCG GGAGACGTTG GAACGGGCTA CGGTCACGGA GCCGGAGCCC TTTCCGGTTG CGGAAACGCT GGCGTCTCTC TTGCCTCCGG GGGAGGCTGT CGCGCCGCTA CAGGCCCGAG CAGGCCGGAG GGTGGGGCAG ACGACGGCGG ATGATCTGGC GACCCGTGCG CACGGTCTGC GGCTTGCCGA CGACGTTCTA GCCGGAGGCG ATTTGATCGG TCCGGCCTTC CGGGAACTGG ACGCGGCCGT TCGCGTCCTC CGGGAATCGA CGCACACGGA CGAGGTCCGG CGGGAACTAC TCCGGGCGGT TGGTGAACTC GCGCAGATAG CCGGATGGAT TGCCAGCGAC GCGGCGGACT CCCGGGCGGA GGGGGCCTAT CGGCTTGGGC TGGACGCGGC ACGGGAAGCC GGGGACGGCC CGTTGGCCGC GCAGCTTGCC GGGTCTCTCG GCTACCACTT GGTGAACAAC GGACGTGTTG CCGATGGGGC CGCGCTGTCG GTCGCGGCCG TGGCGGAGGC GGGACCGGAC GCTCCCGGGA AGACGCGGGC ACTGTTCCAT GACCGGGCCG CGTGGGCCCA TACACAAGCC GGGGACGCGC AAGCCGCTAT GCGGTCGTTG GGGGCCGCCC ACGAGGCACT AGCGGAGGAC AGCGGGGACA CTCCGGAGTG GGCTTACTGG GTGAACGAGG CGGAACTAGA GGTCATGGAT TCCCGCGTCT ACACGGAACT CCGCCGTCCC CTGCGCGCGG TTCCCCTGCT CTCTCGTGTC CTCCGTGAAT ACCCGGCCAC GTCCACAAGA GAGCGGGCTT TGTATGAATC GTGGCTTGCC GTGGCCTACG CGGACGCGAA CGAACCAGAA GAGGCCGCAC GCGTCGCGGC CCGCGTGATC GAACTATCCG GAGACGTGGC ATCCGCGCGG ACATCGGACC GGGTCCGTGT CGTGCTGTCG CGACTTGCCG ACTTCCCAGA CGTCCCGGAG GTCCGGGAAG TGCTGGACAG CGTCGGTCCG GCTTGA
|
Protein sequence | MAPKPFRLVL RELRSARGWS QARLAEALCD VSGRPTVTRH DVSRWERGKR VPRAWLPYLA EVLDVSRETL ERATVTEPEP FPVAETLASL LPPGEAVAPL QARAGRRVGQ TTADDLATRA HGLRLADDVL AGGDLIGPAF RELDAAVRVL RESTHTDEVR RELLRAVGEL AQIAGWIASD AADSRAEGAY RLGLDAAREA GDGPLAAQLA GSLGYHLVNN GRVADGAALS VAAVAEAGPD APGKTRALFH DRAAWAHTQA GDAQAAMRSL GAAHEALAED SGDTPEWAYW VNEAELEVMD SRVYTELRRP LRAVPLLSRV LREYPATSTR ERALYESWLA VAYADANEPE EAARVAARVI ELSGDVASAR TSDRVRVVLS RLADFPDVPE VREVLDSVGP A
|
| |