Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4935 |
Symbol | |
ID | 9248822 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 72582 |
End bp | 73715 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | NMT1/THI5 like domain protein |
Protein accession | YP_003682824 |
Protein GI | 297563851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0962986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACACCC GACGCGACAT CCTGCGCTAC ACCGCGGCCG CCGGGGCCGC CCTCCCCCTG CTCTCCGCAT GCGGACCCCC GGGGAGCGGC GCGGCCGGAG CCTCCCCCGT GATCCGCTAC CAGGGCTGGA CCGGAGACGT CCTCCTGCCC GAACTCGCCG AGGACCTGGG CTACCTGGAC GGCATCGGGC TGGAGTGGAT CGGCGACACC ACCAGCGGCC CCCAGGACAT CCAGGCCGCG GCCACCGGCA GCACCGATGT GGGCGGCGCC TTCAACGGGG CGATCGCCAA GCTGGCCGCC GCCGGGGCGC CCGTCACCGC CGTCCTGGCC TACTACGGGG CGGACGAGGA GACCCACAAC GGCTACTACG TCCTGGAGGA CAGCGACATC ACCGGGGCCC GCGACCTCGT CGGCAAGCGG GTCTCCATGA ACACCCTGGG CGCCCACCAC GAGTTCGTGG TCCGCGAGTG GCTGGCCAGG GAGGGGCTGA CCAACGAGGA GATCGCCCGG GTGGAGCTGA CGGTGGTCCC GCCGGTCAAC GCCGAGCAGA CCCTGCGCAA CGGGCAGGTG GAGGTCGCCA CGCTCGGCGA CCTGCTGCGC GAGGTCGCCC TGGAGCGCGG CGGCATCCGG CCCCTGTTCA CCGACCACGG CCTGTACGGC GCCTTCAGCT ACGGCTCCCT CGTGCTGCGC GACGACTTCA TCGAGGCACA CGAGGACACC GTCCAGGCCT TCGTCGGCGG GGTCGCCCGC GCGATCCGGT GGACGCAGAC CACCCCGCGC GAGGAGGTGG TGGACCGCTA CACCGACATC ATCGGGCGGC GTGGCCGCAA CGAGAGCGCC GAGGCCGTCC GGTACTGGCG CAGCACCGGC GTCGCCGGAC CCGGCGGCGT CATCGCCCCG GACGAGTTCC GGACCTGGAT CGACTGGCTG GTCCGCAACG GCGAACTCGA CGAGGGGGCG GTCGAGGCCG AGGAGCTGTT CACCAACGAC TACAACCCCT ACGCCAACGG GACCTACCCC GAGGACTCCG GCCCCGACGG CCGACCCCTC GCGGAGGGCG GTGCTCCCGG GGACGGCGCC TCCGGGGGAG CGGACGACAC ACGGCGCGCG GCCGGGGACG GGGGGAACCG ATGA
|
Protein sequence | MHTRRDILRY TAAAGAALPL LSACGPPGSG AAGASPVIRY QGWTGDVLLP ELAEDLGYLD GIGLEWIGDT TSGPQDIQAA ATGSTDVGGA FNGAIAKLAA AGAPVTAVLA YYGADEETHN GYYVLEDSDI TGARDLVGKR VSMNTLGAHH EFVVREWLAR EGLTNEEIAR VELTVVPPVN AEQTLRNGQV EVATLGDLLR EVALERGGIR PLFTDHGLYG AFSYGSLVLR DDFIEAHEDT VQAFVGGVAR AIRWTQTTPR EEVVDRYTDI IGRRGRNESA EAVRYWRSTG VAGPGGVIAP DEFRTWIDWL VRNGELDEGA VEAEELFTND YNPYANGTYP EDSGPDGRPL AEGGAPGDGA SGGADDTRRA AGDGGNR
|
| |