Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2018 |
Symbol | |
ID | 9245868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2438549 |
End bp | 2439709 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF993 |
Protein accession | YP_003679950 |
Protein GI | 297560976 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.267016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAC TCCTGCTACC CGCCGCCGAC GGCACCGCCG CCCCCTACAC CCTGAGCGGC ACCCCCGTCG ACGCCTCCCC GCTTCCCCCC GCCCGCTCGC GTACGGCCTA CGCGGCGGCC CACGTGGTCG CCGACCCCCT GGCGGCCAAC GCCCCCGGCG CCCCGGCCCG CCTGGACTGG GAGGCCACCC TGGCCTTCCG CCACCACCTG TGGGACCAGG GCCTGGGCGT GGCCGACGCC ATGGACACCG CCCAGCGCGG CATGGGCCTG GACTGGGCGG CCACCGCCGA GCTGATCCGC AGGTCGGGTG CGGAGGCCGC CTCGCGCGGC GCCGCCCTGG CCTGCGGGGT GGGCACCGAC CAGCTCGACC TTTCGCAGGC CGATCTGGAA AGCGTTATCA CCGCGTACGG TGAGCAGCTC GACGTCGTCC AGGGCGCCGG AGCCACCCCC ATCCTCATGG CCTCGCGCGC CCTGGCCGCG GTCGCCTCGG GCCCCGACGA CTACGCCAAG GTCTACGCCG ACCTGCTGGG CCGGGCCGAC CAGCCCGTCA TCCTGCACTG GCTGGGCACC GCCTTCGACC CGGCCCTGGC CGGGTACTGG GGCTTCGCCG ACCCGGCCGA GGCGATCGAG CCGGTGGCGG CCCTGATCGC CGAGCACGCG GCGAAGGTGG ACGGCATCAA GGTCTCCCTG CTCGACGCCT CGCTGGAGGT GCGGCTGCGG CGGCTGCTGC CCGAGGGCGT GCGCCTGTAC ACCGGCGACG ACTTCAACTA CCCCGACCTC GTCCTGGGCG ACGAGCAGGG CCACTCCGAC GCCCTGCTGG GCGTCTTCGC CGCCATCGCC CCGGCCGCCG CCCGCGCGCT GGCCGCCCTG GACGAGGGCG ACACCGCCCG GTACCGGGCC CTGATGGACC CCACGGTCCC GCTGGCCCGG CACCTGTTCA CCGAGCCCAC CTTCTACTAC AAGACCGGCG TGGCCTTCCT GGCCTGGCTC AACGGCCACC AGAAGGGCTT CCACATGGTG GGCGGGCTGC ACAGCGCCCG CGACCTGCCC CACCTGGCCC AGGCGGTGCG CCTGGCCGAC GCCGCCGGGG CCCTGACCGA CCCCGACCTG GCCGCGGCAC GCATGCGGGC GCTCCTCCAG GTGTCAGGAG TGGACCAGTG A
|
Protein sequence | MSTLLLPAAD GTAAPYTLSG TPVDASPLPP ARSRTAYAAA HVVADPLAAN APGAPARLDW EATLAFRHHL WDQGLGVADA MDTAQRGMGL DWAATAELIR RSGAEAASRG AALACGVGTD QLDLSQADLE SVITAYGEQL DVVQGAGATP ILMASRALAA VASGPDDYAK VYADLLGRAD QPVILHWLGT AFDPALAGYW GFADPAEAIE PVAALIAEHA AKVDGIKVSL LDASLEVRLR RLLPEGVRLY TGDDFNYPDL VLGDEQGHSD ALLGVFAAIA PAAARALAAL DEGDTARYRA LMDPTVPLAR HLFTEPTFYY KTGVAFLAWL NGHQKGFHMV GGLHSARDLP HLAQAVRLAD AAGALTDPDL AAARMRALLQ VSGVDQ
|
| |