Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1941 |
Symbol | |
ID | 9245791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2364733 |
End bp | 2366115 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | protein of unknown function DUF21 |
Protein accession | YP_003679874 |
Protein GI | 297560900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.164047 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0989418 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAGCA CCATCCTCAG TATCGCCTTC GGGATCGTGG TGGTCTTCCT GATCACCGTG GCGACGGGTT ACTTCGTCGC CCAGGAGTTC GGTTACATGG CCGTGGACCG CTCGCGACTG CGGGCGAAGG CGGCGGCCGG GGACGCCGGG GCCCAGAGGG CCCTGGGCAT CACCAACCGG ACGTCCTTCA TGCTCTCCGG TGCCCAGCTG GGCATCACCG TGACCGCCCT GCTGGTGGGT TACGTGGCCG AGCCCATGAT CGGCGAGGGC GTCGGTGAGC TGCTCGGCCT CGCCGGAATC CCCACGGGGA CGGGCGTGGC CATCGGCACC GTCCTGGCCC TGCTCTTCTC CACCGTCGTG CAGATGGTCT TCGGAGAGCT CTTCCCCAAG AACCTGGCCA TCGCCCGGCC CGAGCCGGTC GCGCGCTGGC TGGCGCTGTC CACGGGGATC TACCTGAAGA TCTTCGGCCC GGTCATCTGG CTGTTCGACC AGGCGGCGAT CCTGCTGCTC AAGGCGGTGC GGATCGAACC GGTCGAGGAC GTCCAGCACG CGGCGACCGC GCGCGACCTG GAGAGCATCA TCGCCGAGTC CAAGGCCAGC GGCGACCTGC CGCCGGAGCT GTCCACCCTG CTGGACCGCA CCCTGGACTT CCACGAGCGC ACCGCCGGAC ACGCGATGAT CCCGCGTCCG GAGGTGGCCA CGGTGGAGGA GGGCGACCCG GTCAGCCGGG TCGTGGAGCT GATGGCCTCC GACCACTCGC GCTTCCCGGT GCTGGGTGAC GGCGTGGACG ACATCGTCGG CGTCATCTGC CTGCGCGACG TGCTCGCGCT GGGCGACCGG GACCTGGCCA ACACCAAGGT CAGCGAGGTG GCGCGGCCCA CCGTGATGCT TCCCGCGTCG CTGCCGCTGC CCTCCGCGCT GAGCCAGCTG CGCGAGGCGG GCGAGGAGTT CGCCTGCGTG GTGGACGAGT ACGGCGGCCT GGCCGGGGTC ATCACGACCG AGGACCTGGC CGAGGAGCTG GTCGGCGAGA TCGCCGACGA GCACAGCCCC GCGGAGGAGT CCCCCTCCTA CCTGGAGGGC GAGGGCTCCT ACCTGGTCCC GGGCGCTCTG CACATCGACG AGGTCGAGCG GCTGCTCGGC CACGACCTGC CCGAGGGGGA CTACGAGACC CTGGGCGGTC TGGTGGTGCA CGAGCTGCAC CGGCTCCCCG AGGCCGGTGA CAGGGTCTCC ATCGCCCTGC CCCGACCGCC CAGCGCGCAC GACGAGGACC CCGACATGGG GCTGACCATG GTCGTCAGCG CGGTGCAGAG GCACGTTCCG CACACCGTGG AGCTGCGTCT GCACGAGGTG CAGGGCAACG AGTCGCAGGA GGCGTCGGCA TGA
|
Protein sequence | MMSTILSIAF GIVVVFLITV ATGYFVAQEF GYMAVDRSRL RAKAAAGDAG AQRALGITNR TSFMLSGAQL GITVTALLVG YVAEPMIGEG VGELLGLAGI PTGTGVAIGT VLALLFSTVV QMVFGELFPK NLAIARPEPV ARWLALSTGI YLKIFGPVIW LFDQAAILLL KAVRIEPVED VQHAATARDL ESIIAESKAS GDLPPELSTL LDRTLDFHER TAGHAMIPRP EVATVEEGDP VSRVVELMAS DHSRFPVLGD GVDDIVGVIC LRDVLALGDR DLANTKVSEV ARPTVMLPAS LPLPSALSQL REAGEEFACV VDEYGGLAGV ITTEDLAEEL VGEIADEHSP AEESPSYLEG EGSYLVPGAL HIDEVERLLG HDLPEGDYET LGGLVVHELH RLPEAGDRVS IALPRPPSAH DEDPDMGLTM VVSAVQRHVP HTVELRLHEV QGNESQEASA
|
| |