Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0098 |
Symbol | |
ID | 9243929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 122232 |
End bp | 124265 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | protein of unknown function DUF255 |
Protein accession | YP_003678055 |
Protein GI | 297559081 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAACC GCCTGAGCGA CGCGACCAGC CCGTACCTGT TGCAGCACGC CGACAACCCC GTGGAGTGGT GGCCCTGGGG TGAGGAGGCC CTCGCCGAGG CGCGTCGGCG CGACGTGCCC CTCCTCGTCT CCGTCGGCTA CGCCGCCTGC CACTGGTGCC ACGTGATGGC CCACGAGTCC TTCGAGGACG AGGCGACCGC GGCCCTGATG AACAGCCTGT TCGTCAACGT CAAGGTGGAC CGCGAGGAGC GCCCCGACGT CGACGCCGTG TACATGGAGG CGACCCAGGC CATGACCGGC CAGGGGGGCT GGCCCATGAC CGTGTTCGCC ACCCCCGACG GCGCCCCGTT CTACTGCGGC ACCTACTTCC CGCGCGAGCA CTTCCAGCGC CTGCTGCGGG GCGTGGCCGA CGCCTGGCGG GACCAGCGCA CCGAACTGGT CGGCCAGGGC GCGCGCGTGG TGGAGGCGCT GAGCGGCCCG CGCACCCTGG CCGCCGCGCC CCCGCCCTCC GCGGACCGGC TCGACCTGGC CGTCCGCGCG CTGGTGCGCG ACTACGACAG CGCCCACGGC GGTTTCGGCA CCGCGCCCAA GTTCCCGCCG TCGATGCTGC TCTCCTTCCT CACCGCCCAG GACGAGCGCA CCCGGCCCCT GCAGAGCGCG GACGAGTCCA CGCCCGCCTG GCTCATGGCC AGCGGCACCG CCCTGGCCAT GGCGCAGGGC GGCATGTACG ACCAGCTCGG CGGCGGTTTC GCCCGATACT CGGTGGACCG CGAGTGGACC GTGCCGCACT TCGAGAAGAT GCTGTACGAC AACGCCCTGC TGCTGCGCGC CTACGCCCGG ATGGGCCGCC GCCCCTCGGG TCCGGGGGTC TCCGACGCCG CCACCCACGC CCTGCTGCGC CGGGTCGCCG GGGAGACCGC CGACTGGATG CTGCGCGACC TGCGCACGCC CGAGGGCGGG TTCGCCTCGG CGCTGGACGC CGACAGCGAG GGCGAGGAGG GCACCTACTA CGTGTGGACG CCCGCCCAGC TGCGGGAGGT CCTGGGCGAG GAGGACGCCG CCTTCGCCGC CGAGGTGTTC GGCGTGACCG AGGAGGGCAC CTTCGAGCGC GGCGCCTCCG TGCTCCAGCT GCCCGCCCCG CCCGCCGACG CCTGGCGCTA CCAGCGGGTC CGTGAGGCCC TGCTGGCGGC CCGCGCCGAA CGGGTCGCCC CCGCGCGCGA CGACAAGGTG GTGGCCGCCT GGAACGGCCT GGCGGTCGCC GCGCTGGCCG AGGCCGGGGT GCTGCTGGAG CGGCCCGACC TGGTGGAGGC CGCCCGCGCG GCCGCTGACC TGCTGCTGCG CGTGCACCTG CGGGACGGGC GCCTGGTCCG CACCTCCCGG GACGGGCGCG CGGGCACCAG CGCCGGGGTG CTGGAGGACT ACGCCGACGT CGCCGAGGGG CTGCTCGTCC TGCACGGTGT GACCGGGGAG GCGCGCTACG CGCACGAGGC CGGGCGCCTG CTGGACACCG TCCTGGAGCG CTTCGGAGAC GGCTCCGGCG GGTTCTACGA CACCGCCGAC GACGCCGAGC GCCTCTTCAA CCGGCCCCAG GACCCCACCG ACAACGTCAC ACCGTCCGGC CGGTCGGCGG CGGCGTCCGC GCTGCTCTCC TACGCCGCGC TGACCGGATC CGAGCGCCAC CGCACGGCCG CTGAGGAGGC GCTGTCCCCG GTGGCGGTGC TGGCGGAGAA GGCCGCCCGG TTCGCCGGGT GGGGCCTGGC CACCGGCGAG GCCCTCCTGA CCGGGCCGCG CGCCGTGGCG GTGGTGGGCG ACCCCGACGA CCCGAGGACC GCGGAGCTGG TGCACGCCGC GCTGGTCTGG GCGCCGCTGG GCACCGTGCT CTCACGCGGC GACGGCCGCG ACGACGGAGG GGTGCCGCTG CTGCGCGACC GCGCGCCGGT GGGCGGGCGA CCGACCGCCT ACGTGTGCGA GGGCTTCGTC TGCAAGCTCC CGGTCACCTC GCCCGAGGAC CTGCGGGAGC AGCTGCTGGC CTGA
|
Protein sequence | MSNRLSDATS PYLLQHADNP VEWWPWGEEA LAEARRRDVP LLVSVGYAAC HWCHVMAHES FEDEATAALM NSLFVNVKVD REERPDVDAV YMEATQAMTG QGGWPMTVFA TPDGAPFYCG TYFPREHFQR LLRGVADAWR DQRTELVGQG ARVVEALSGP RTLAAAPPPS ADRLDLAVRA LVRDYDSAHG GFGTAPKFPP SMLLSFLTAQ DERTRPLQSA DESTPAWLMA SGTALAMAQG GMYDQLGGGF ARYSVDREWT VPHFEKMLYD NALLLRAYAR MGRRPSGPGV SDAATHALLR RVAGETADWM LRDLRTPEGG FASALDADSE GEEGTYYVWT PAQLREVLGE EDAAFAAEVF GVTEEGTFER GASVLQLPAP PADAWRYQRV REALLAARAE RVAPARDDKV VAAWNGLAVA ALAEAGVLLE RPDLVEAARA AADLLLRVHL RDGRLVRTSR DGRAGTSAGV LEDYADVAEG LLVLHGVTGE ARYAHEAGRL LDTVLERFGD GSGGFYDTAD DAERLFNRPQ DPTDNVTPSG RSAAASALLS YAALTGSERH RTAAEEALSP VAVLAEKAAR FAGWGLATGE ALLTGPRAVA VVGDPDDPRT AELVHAALVW APLGTVLSRG DGRDDGGVPL LRDRAPVGGR PTAYVCEGFV CKLPVTSPED LREQLLA
|
| |