Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1712 |
Symbol | |
ID | 9245562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2083404 |
End bp | 2084594 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF664 |
Protein accession | YP_003679647 |
Protein GI | 297560673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.833609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGACAG ACAAGAACAC CTCGCCCAGC CGGGAGTTCT GGGAACCCCG CTACCGGGGC GGGGACCCCT CGACGCCGCC GCCCGGCCCC AACGCGGCCT TCGCCCGCCT CGCCGGGGAA CTGGCCCTGG TGCCACCGCC GGAACCGGAG CGCGGGGCGG ACGCGCGCCG CGCCCTCGAA CTCGCCTGCG GCCGGGGCGG GGACGCCCTG TGGCTGGCGG GCCGGGGATG GGACGTCACG GCCGTCGACG TCGCGGAACA CGCCCTGGCC GTGCTGGCCG AGCGGGCCCG CCGGGCCGGG GTAGGGGACC GCCTCACCAC GCGGCGCCAC GACCTGGCGC TGTCGGTGCC CGACGCCGGA CCGTGGGACC TGGTCTACGC GAACTACTTC CACACCCCGG TGGACATCGA CCGTGACGCC GTGCTGCGCC GGGTCTCGCG GTCGGTGGGC GGGGGCGGGC TGCTGGTCGT GATCGACCAC GCGTCCAGTG CGCCCTGGTC CTGGGAACAG CGCGACGACT TCCCCGCCCC CGAGGAGCTG TGGCGGTCGC TGGACCTGGG CGCGGACTGG ACCGGCCTCG TGTGCGAGCG GCGTTCGCGG CTGGCCCACG GCCCCGACGG CCGCAGCGCG CGGGTGAGCG ACAACGTGGT GGTGGCCCGG CGCCGTACGG GGGCGACGCC GAAGGGTTCG ACGAGGACGG CCGATGCACC CTCGCGCCGA CGCGACCAGC CCCCGCCCGG GACGGGGTCC GCGGAGAAGG AGGTCCTCAC GGGGTTCCTG GCCTACCTGC GCGAGAGCGT CCTCGCCAAG CTGGACGGCG CGCCGGAGCA GCACGTGCGC ACTCCGGGCG TCGCGTCCGG CACGAACCTG CTCGGACTGG TCAAGCACCT GGCCCACGTC GAGCGCGCCC TCTTCCTCGG GGAGGAGGTC GGCGACTGGC AGGCCACGTT CCACGCCGAC ACCGGTGAGA CCACGGCTGG CGTTCTGGAG GGATACCGCG CGGCCGTGGC CGCCGCCGAC CGTGCCATCG CCGACTGCGA CGACCTCGGC GGGCCCGCGC ACGCGGGGCG CTGGAACGGA CCGGCCCCGT CGATGCGGTG GGCGCTCGTG CACATGATCG AGGAGACCGG CCGCCACGCC GGGCACCTGG ACATCCTCCG CGAACTGGTG GACGGCCGGA CCGGTCGCTG A
|
Protein sequence | METDKNTSPS REFWEPRYRG GDPSTPPPGP NAAFARLAGE LALVPPPEPE RGADARRALE LACGRGGDAL WLAGRGWDVT AVDVAEHALA VLAERARRAG VGDRLTTRRH DLALSVPDAG PWDLVYANYF HTPVDIDRDA VLRRVSRSVG GGGLLVVIDH ASSAPWSWEQ RDDFPAPEEL WRSLDLGADW TGLVCERRSR LAHGPDGRSA RVSDNVVVAR RRTGATPKGS TRTADAPSRR RDQPPPGTGS AEKEVLTGFL AYLRESVLAK LDGAPEQHVR TPGVASGTNL LGLVKHLAHV ERALFLGEEV GDWQATFHAD TGETTAGVLE GYRAAVAAAD RAIADCDDLG GPAHAGRWNG PAPSMRWALV HMIEETGRHA GHLDILRELV DGRTGR
|
| |