Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3289 |
Symbol | |
ID | 9247151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3924863 |
End bp | 3925948 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | protein of unknown function DUF201 |
Protein accession | YP_003681201 |
Protein GI | 297562227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0509227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCACAGC AGCCTGCGAG CAAACCAACC GTCGTGGTCA CGACAGCGGG GTCGGCGCCC ACGCCCGGGA CCATCCTCCA CCTGCGGGAA CAGGGTTTCC GCGTGGTGGC GACCGATGTG GACCCCGCCG CGCCGGGCCT GTACCTGGCC GACCGCGGGT ACCTGGTGCC GCCGGGCGAC AGCGAGGCCT TCCTGCCCAG GATGCGGACG CTGTGCGCGG ACGAGGGGGC CGTCGCGGTC ATCCCCCTGG TGGACGAGGA ACTCGTCAGG GTGGGCGAGC TCGCCAAGGA CGGCGTGGAG GTGCTGCTGC CCCGGCTGGA CTTCGTCACC ACCTGTCTGG ACAAGTACGT GCTGATGCGC GAGCTGGAGG ACGCCGGGAT CGGCGTGCCG CGCACCTGGC TGGCCTCGGA GTGGCCGAGC GGCGCCGCGG ATTCCGCGCC CGGCGGACTC ATCGTCAAAC CGCGCTGCGG ACGCGGGAGC CGCGGGGTGG TGGTCATCGA CTCCGTCCGC GACATGGCGC GGGTGGTGTC CGAGGGCGGC TACGCGGCGG ACGAGCTGAT CGTCCAGGAG CTGGTCGGCG GCCCCGAGTA CACGGTGTCC GTGGTGGTGT GGCGGGACGG CGGGGTCCAG GCCGTGGTGC CCAAGGAGGT CGTCCTCAAG CAGGGCGTGA CCAGGTACGC GGTGACCCGG CGCCACCGCG AGGTGGACCG CGCCTGCCGG GCGGTCCAGT CCGCTCTGCG CGCCGACGGC CCCTTCAACG TGCAGCTGTG CCTGGACGCG GACGGCAGGC CGAGGATCTT CGAGATCAAC CCGCGCTTCT CCTCCACGGC CTCGCTCACC GCGGCGGCCG GGATCGACGA GATCACGGGG CTGCTGCGGC AGGCCGTCGC GGACGGGCCC CGCCTGGAGG ACGACTGGCG CGAGGGGGTG GCGATGGTGC GGCGGTGGAC GGACGAGTTC GTCAGCGAGG CGGAGTTCAC CTCCCACGGC ATCTCGCCCG CCCCCGCGGT CGGCACGGAG CTGCCCCGGC AGCTGTCCGC GGGAGGCACG GGGCCGGTCC CGGTCGTTCC GGTGCGTGTG CGTTGA
|
Protein sequence | MPQQPASKPT VVVTTAGSAP TPGTILHLRE QGFRVVATDV DPAAPGLYLA DRGYLVPPGD SEAFLPRMRT LCADEGAVAV IPLVDEELVR VGELAKDGVE VLLPRLDFVT TCLDKYVLMR ELEDAGIGVP RTWLASEWPS GAADSAPGGL IVKPRCGRGS RGVVVIDSVR DMARVVSEGG YAADELIVQE LVGGPEYTVS VVVWRDGGVQ AVVPKEVVLK QGVTRYAVTR RHREVDRACR AVQSALRADG PFNVQLCLDA DGRPRIFEIN PRFSSTASLT AAAGIDEITG LLRQAVADGP RLEDDWREGV AMVRRWTDEF VSEAEFTSHG ISPAPAVGTE LPRQLSAGGT GPVPVVPVRV R
|
| |