Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4916 |
Symbol | |
ID | 9248803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 49339 |
End bp | 50544 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF418 |
Protein accession | YP_003682805 |
Protein GI | 297563832 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.511749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.649012 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCA CGACACCGCG AAGCACGGCC ACGGCCCGCC CGACCACCCG GCTGCCCCTG CTCGACGTCC TGCGCGGCGT CGCCATCATG GGCACCCTGG GCACCAACGT GTGGCTGTTC GCCGCACCGG GCGCCGAGGC GGGCATCATC TTCGTCCCGG ACTCGATCGG CCCGATGGCG GCCTTCCGCG AGGACCCGTC GGCGGCGACC CTGGCGCAGG GCCTGTTCGC CTTCGCCGCC AACGGCAAGT TCCTGGCCCT GCTCACCCTG CTCTTCGGTG TGGGGCTGGC CATCCAGTTC CGCTCCGCGG CCAGGCGCGG CGGGCGCTGG CCCGGCCCCT ACAAGTGGCG GGCGCTGTTC CTGTTCGCCG AGGGGCTCGT GCACTTCACG CTGGTCTTCG CCGCGGACGT GCTCATGGGC TACGCGGCGG CCTCGATCGT GGTGGCCTGG CTGCTCACCC GGTCGGAACG GGCGCAGAAC GCCGTGATGT GGACGTGCGC CGCCCTCCAC CTGGCCTTCG TCGGCCTGGT CACGGCCGCG CTCCTGTCCG AACCGGATCC GCAGGGCGCC GAGGTCCCGC CCGAGGCCGT GGAACTGTAC GCCGAAGGCG GTTACCTGGA GCAGGTCGCC TTCCGGCTCG ACAACGCGCT GCTGTTCCGG GCCGAACCGG TCGTCACCTT CGCCCTGATG GTGTTCATGT TCCTGCTGGG CGTACGGCTC CTCCGCGCCG GGGCCTTCGG CGGCGACGAC ACCGGCCGCC GGATCCGGGG CCGCCTGCTC GCCTGGGGGC TGGGGATCGG GCTGCCGGTC AACCTGGCCA CGTCCCTGGC CGGACCCGAC CTGTTCATGC TGGACCGCTA CGCCGCCGCG CCCGTCGTGG CGCTCGGCCT GGTCGGCGCG GTCGGCTGGG TCGTGGACCG GGCGAACCCC GCCGGACGGG GGATCACCGC CGTGTCCTCC CTGGGGCGGA TGGCGATGAG CGGCTACGTC GCGCAGAACG TCGTCTGCAT GCTGGTCTGC TACGGCTTCG GCCTGGGGCT GGCCGCACGC CTGGCCGACA CCGGCCCGTG GTGGGTGATG GGCCTGTGGG CGTCGGTGTG CGCCCTGCTC CTGACCGTCT CGACGCTCTG GCTGCGCCGC TTCCGGGCGG GGCCGCTGGA GGCGTTGCAG AAGGCCGTGC TGGCGCGGGT GCCCGAACGC CGCTGA
|
Protein sequence | MTSTTPRSTA TARPTTRLPL LDVLRGVAIM GTLGTNVWLF AAPGAEAGII FVPDSIGPMA AFREDPSAAT LAQGLFAFAA NGKFLALLTL LFGVGLAIQF RSAARRGGRW PGPYKWRALF LFAEGLVHFT LVFAADVLMG YAAASIVVAW LLTRSERAQN AVMWTCAALH LAFVGLVTAA LLSEPDPQGA EVPPEAVELY AEGGYLEQVA FRLDNALLFR AEPVVTFALM VFMFLLGVRL LRAGAFGGDD TGRRIRGRLL AWGLGIGLPV NLATSLAGPD LFMLDRYAAA PVVALGLVGA VGWVVDRANP AGRGITAVSS LGRMAMSGYV AQNVVCMLVC YGFGLGLAAR LADTGPWWVM GLWASVCALL LTVSTLWLRR FRAGPLEALQ KAVLARVPER R
|
| |