Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1668 |
Symbol | |
ID | 9245518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2038455 |
End bp | 2039717 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF418 |
Protein accession | YP_003679603 |
Protein GI | 297560629 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.620816 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCG ACCGCGCCGA AGGCCCGCCC TCCGTCCCCG GAGACCCGCC CGCAGCCGCC GCCGACACCC GGGCCCCGGC GGGTTCGACG CCGGTGGCCG AACGGGCCCT GGCCCCGGAC CTCGCCCGCG GCATGATGCT GCTGCTGATC GCGCTGGCCC ACGTGCCGTG GTTCCTGTAC CAGGCGCCCA CCGGCCTGGC CATGCTGCAC CCCGTCGACG GCAACCTGGC GGACAGGGCC GCGCAGTTCA TGACGATCGT CGTCGTGGAC GCCCGCACGC ACACGATGTT CGGCTTCCTC TTCGCCTACG GCATCGGGCA GATGTACCGC CGGCAGAGGG CGCGCGGCAC CGGTGAGAAG GAGGCCCGCG GGCTCCTGCG CAGGCGCCAC CTGTGGATGC TCGTCTTCGG CGCGGTCCAC GCCGCCCTGC TGTGGCAGGG CGACATCCTG GGCACCTACG GCCTGATCGG GCTCATCATG GTGCCGCTGT TCCTCAACCG CAGCGACCGC ACCCTCAAGA TCTGGCTGTC CGTCCTGCTG GCCCTGGGCG CCCTGGTCAC CGCCGTCTCG GCGGCCTCGG TCCTGCTGGC GCCGGACGCG GTGTCCACGG CCGCGGCGAC CGACATGCAA AGGGCCAGCA TCGCCGAGAC GAGCTACCTG CTCTCCGCCG TGTTCCGCCT CCCGGCCTGG TTCTTCGGGC TCTTCTCCGG CCTGTTCACC CTGGCCCTGC CGACGGTGTT CCTGATCGGC CTGCTCGCGG CGCGGCACCG GTTCCTGGAG GACCCGGCGC GACACCTGAC GCTGCTGCGC CGGGTCGCGG TCCTGGGCAT CGCCGTGGGG TGGGCCGCCG GAGCGGTGCT GGGCCTCCAG CACGTGGGCG TCCTGGACGC CACCCACATC TCGGCGGTCT CCTCGGTGCA CTTCTACACC GGGATCTTCA CCGGGGTGGG CTACGCCGCG CTCTTCGGGC TCCTCGCGCA CCGGCTCTCC GCCCGGGGGG CCCAGCGGTC CCTGCCGGTC CGGGCCCTGG TGTCCCTGGG GCGGCGCTCC CTGAGCGGCT ACCTGGCCCA GTCGGTGGCC TTCGCCCCGT TCCTGGCCGC CTGGGGCCTG GGCCTGGGCG TGCACCTGTC CAGCTGGTCG GCGGTGCTGG TGGCCGTGGG CACCTGGCTG CTGACCGTCG CGGCGGCGTT CCGGCTCGAC CGCGCGGGCA GGCGCGGACC GGCGGAGATC CTGCTGCGGA GGCTGACCTA CCGCAAGCCG TGA
|
Protein sequence | MTTDRAEGPP SVPGDPPAAA ADTRAPAGST PVAERALAPD LARGMMLLLI ALAHVPWFLY QAPTGLAMLH PVDGNLADRA AQFMTIVVVD ARTHTMFGFL FAYGIGQMYR RQRARGTGEK EARGLLRRRH LWMLVFGAVH AALLWQGDIL GTYGLIGLIM VPLFLNRSDR TLKIWLSVLL ALGALVTAVS AASVLLAPDA VSTAAATDMQ RASIAETSYL LSAVFRLPAW FFGLFSGLFT LALPTVFLIG LLAARHRFLE DPARHLTLLR RVAVLGIAVG WAAGAVLGLQ HVGVLDATHI SAVSSVHFYT GIFTGVGYAA LFGLLAHRLS ARGAQRSLPV RALVSLGRRS LSGYLAQSVA FAPFLAAWGL GLGVHLSSWS AVLVAVGTWL LTVAAAFRLD RAGRRGPAEI LLRRLTYRKP
|
| |