Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4293 |
Symbol | |
ID | 9248167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5108835 |
End bp | 5110007 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF418 |
Protein accession | YP_003682188 |
Protein GI | 297563214 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCATA CGTCCACCCC GGAGCCCCGC ACGGCTCCGG TCCCCGACCA ACCGCTCCCG CTCGGCCGCC GCTCCCTGGC CCCGGACCTG GCCCGCGGCG TCATGCTGCT GTTCATCGCG CTCGCCCACA CGCACCTGTT CACCGTGTTC ATCGGCGGCC CGCAGGGCGC TCCCAACCCC GCCGACCAGC TCACCACCGC GGCCACGGTC ATGTTCGTCG ACCTGCGCAG CTACCCCATG TTCGCTGCCC TGTTCGGGTA CGGCCTGGCT CAGATCCACC GCCGCCGCGG CCAACAGGGG CAGGACTGGC CCCGGACCCG CGGCCTGCTG CGCCGACGCG GCCTGTGGAT GGTCGCCTTC GGTCTGGCGC ACACGGTGCT GCTCTTCCCC GCCGACATCC TGGCCGTCTA CGGCCTGGTG ACGCTGGCCC TGGTGGGCGT CCTGCGGTTG CGCGACCGGA CCCTGGTGAT CCTGGCGGCG GCCTGGCTGC CCGTGGCCGC GGCCGTGCAC GCCCTGATCG CCGCCGAGGA CACCGTGACA GGACAGGGCA TGCCGCTGAT GCCCGACGGG TTCGCGGACG AACTCCTCTT CCGGGTGACC CTGTTCTCGG TCCTGGCCGT GGTGATGTTC GCCAGCACGC TCGTCCCGTT CCTGATCGGT GTCCTGGCGG CCCGGCACCG GATCCTGGAG CGGCCCCACG AGCACCTGCG ACTGCTGCGC GCCACCGCGT TCGCGGGCAT CCCCCTGGCG GCGCTGGGCG GCCTCCCCCT GGCCCTGGAC AAGGCGGAGG TGTGGACCGG CGCCACGACC GGGGACCTCG TCGCCGCCAC GGCACTGCAC CAGGTGAGCG GCTACGCGGG CGCGCTGGGC TACGCCGCGC TCATCGCCCT GGTCGCGGTC CGCCTCACGG GCCGAGAGGG GCCGGTGACC GACGCCCTGG CCGCGCTCGG ACAGCGCTCC ATGACCTTCT ACCTGGCCCA GTCCATGGCC TGGGCCGTGC TGTTCTCCTC CTACACGCTC GACCTGCACA TGACGTCGCC CGCCGTCGGC GCGGGCGTCG CGGTGGCCGT GTGGCTGGCC ACGGTGCTGC TCGCCGACCT CATGCGCCGC CGGGGCGTGC GCGGACCCGC CGAGGTGGTG CTGCGCCGCC TCACCTACGG GCCGTCGCGC TGA
|
Protein sequence | MSHTSTPEPR TAPVPDQPLP LGRRSLAPDL ARGVMLLFIA LAHTHLFTVF IGGPQGAPNP ADQLTTAATV MFVDLRSYPM FAALFGYGLA QIHRRRGQQG QDWPRTRGLL RRRGLWMVAF GLAHTVLLFP ADILAVYGLV TLALVGVLRL RDRTLVILAA AWLPVAAAVH ALIAAEDTVT GQGMPLMPDG FADELLFRVT LFSVLAVVMF ASTLVPFLIG VLAARHRILE RPHEHLRLLR ATAFAGIPLA ALGGLPLALD KAEVWTGATT GDLVAATALH QVSGYAGALG YAALIALVAV RLTGREGPVT DALAALGQRS MTFYLAQSMA WAVLFSSYTL DLHMTSPAVG AGVAVAVWLA TVLLADLMRR RGVRGPAEVV LRRLTYGPSR
|
| |