Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0475 |
Symbol | |
ID | 9244314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 572257 |
End bp | 573240 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, RpiR family |
Protein accession | YP_003678428 |
Protein GI | 297559454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.227753 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGAAA TCCCGAAGCC TGCGGGCGGC CGGGCGGAGA GCGGACACGT CTCCGCGCCG TCCGTCCCCA CGACGGTCCT GCGCATCCGG TCGCTCCTGC CCTCCCTCGC GCCCGCGGAG CGACGGGTCG CGCAGCACGT CGTCGACGAC CCCGAGCGGG CCGCCGCCTC CTCCATCACC CAGCTCGCCA AGGACTGCGC CACGTCCGAG GCCACCGTGA TCCGGTTCTG CCGCACCATC GACTTCAGCG GCTACCGGGA GCTGCGCCTG GCCCTGGCCA CCGAGGCCGG TCAGGCGCGC GGCGCCCGCG GGGCGGCTCC GGAGCTGTCC AGCGACATCA ATCCCGACGA CACCCTCGTC ACGGTGGTGC AGAAGATCGC CTACACCGAC GCGCGCGCGG TGGAGGAGAC CGGCGCCGCC CTGGACGTGG AGGTGCTGCG CACGGTCATC GACACGATGG CGGGCGCGCG GCGCATCGAC GTGTACGGGG TGGGCGCGAG CGCGTTCGTC GGCGCCGACC TCCAGCAGAA GCTGCACCGG ATCGGGCTCA CCTCGTTCGC GTGGTCGGAC GCGCACGTGA TGCTCACCAG CGCGGCGCTG CTGGACGAGC GCGACGTGGC GATCGGCATC TCCCACAGCG GAACGACCAT AGACACGGTG CAGGCGCTCA CGGAGGCGGG GCGGCGCGGC GCCCGGACCG TCGCCGTCAC CAACTTCCCC CGCTCGCCGA TCGGGTTCGC CGACCACGTG CTGACCACCG CGGCCCGCGA GACCACGTTC CGCTCGGGGG CCACCGCGAG CAGGCTCGCG CAGCTGACCG TGGTGGACTG CCTGTTCGTG GGCCTGGCGC AGAGCCGCTA CACCGACAGC CGCACCGCGC TGGAGACGAC CTTCGAGGCG GTGCGGGGGC TGCGGATCAA CGACGACCGC AGACGCCGCA GGGGCGAGGC CCCCGGCGGG AGCACGGACG ACAACGACGG CTGA
|
Protein sequence | MAEIPKPAGG RAESGHVSAP SVPTTVLRIR SLLPSLAPAE RRVAQHVVDD PERAAASSIT QLAKDCATSE ATVIRFCRTI DFSGYRELRL ALATEAGQAR GARGAAPELS SDINPDDTLV TVVQKIAYTD ARAVEETGAA LDVEVLRTVI DTMAGARRID VYGVGASAFV GADLQQKLHR IGLTSFAWSD AHVMLTSAAL LDERDVAIGI SHSGTTIDTV QALTEAGRRG ARTVAVTNFP RSPIGFADHV LTTAARETTF RSGATASRLA QLTVVDCLFV GLAQSRYTDS RTALETTFEA VRGLRINDDR RRRRGEAPGG STDDNDG
|
| |