Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3529 |
Symbol | |
ID | 9247398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4238110 |
End bp | 4239306 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | protein of unknown function DUF418 |
Protein accession | YP_003681436 |
Protein GI | 297562462 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.296308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTTC CCCCCTCCCC GCCGCCTTCG ACCGGGGGCG TGGCACTCGA CCGGCGCTCC CTGGCGCCCG ACCTGGCACG CGGGGTGATG CTGCTGGTCA TCGCCGTCGT GCACGCGCAC ATCGTCTGGG TGATGGCCAC CGCGGGTCAG AGTCCCGCCT CGGGCGGCCT CCTCGACGCC GCCTCCACCG CCCTGCTGAT GCTGTTCGCC GAGGCGCGCG GGTACCCCAT GTTCGCGGCG CTGTTCGGCT ACGGGCTCGC CTGGATCCAC CTGCGCCGCA CCGCCGAGGG GCGCCCGGAG CCGTGGGTGC GCTCACTGGT GCGCCGACGC GGCCGGTGGA TGGTGCTGAT CGGGCTCCTG CACACGGTCC TGCTCTTCTT CGGCGACATC GTCGCGGTCT ACGGGCTCAT CGCCCTGATG TTCGCGGGTC TGCTGCACGC CGGTGACCGG CGCCTGCTCG CCCACGCCCT CACCTGGGCC TCCCTGGGCT CGCTGTTCTA CGCGGTCATG AACAGCCTCC CGCTCTCCGA CCCCGAGGCG GGCGGGGTCT TCACCCAGGA CCCGCTGGCC GACATGGTCA CACGCCTGTT CACCTGGCCG GTCATGACCC CGATGCTCGT GGCGAGTTCG GTGTTCCCCT TCCTCGTCGG AGTCTGGGCG GCCCGCCGCC GCCTGATGGA ACAGCCCGAG CTCCACCTCG GGCTGCTGCG CCGCGTCGCC TTCATCGGTA TCCCGGTGGC CGTGGTCGGC GGACTGCCGC AGGCCCTGGT GGCCACCGAG CTGTGGGTGC CCGAGTTCGT GCTCAAGGCC GGTGTGGGCT GGCTGCACGT TCTGACCGGC TACGCGGGCG GCTTCGGCTA CGCCGCGCTC ATCGCGCTGG TCGCGGTCCG CCTCGGTGAC CGCAGAGGGC CCCTCGTGCG GGCCCTGGTC GCCACCGGCC AGCGCTCCAT GACCTGCTAC CTGTTGCAGT CGGTGGGCTG GATCGTGCTG TTCGCGCCCT ACGCCCTCGA CCTTGGGCCA CAGCTGTCCG GGACCGCCGC CGTCCTGCTC GGCGTCGCCG TCTGGTTGGC CACCGTGGTC CTGGCCGACG TCATGCGTCG CGTGGGAGTC CGGGGTCCGG CCGAACGCCT CCTGCGCTGG GGAACCTACA GAACCGTGCG AGCGGAGCAG CCGGAGGCCG AACCAGTGCG GAGATGA
|
Protein sequence | MSLPPSPPPS TGGVALDRRS LAPDLARGVM LLVIAVVHAH IVWVMATAGQ SPASGGLLDA ASTALLMLFA EARGYPMFAA LFGYGLAWIH LRRTAEGRPE PWVRSLVRRR GRWMVLIGLL HTVLLFFGDI VAVYGLIALM FAGLLHAGDR RLLAHALTWA SLGSLFYAVM NSLPLSDPEA GGVFTQDPLA DMVTRLFTWP VMTPMLVASS VFPFLVGVWA ARRRLMEQPE LHLGLLRRVA FIGIPVAVVG GLPQALVATE LWVPEFVLKA GVGWLHVLTG YAGGFGYAAL IALVAVRLGD RRGPLVRALV ATGQRSMTCY LLQSVGWIVL FAPYALDLGP QLSGTAAVLL GVAVWLATVV LADVMRRVGV RGPAERLLRW GTYRTVRAEQ PEAEPVRR
|
| |