Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3111 |
Symbol | |
ID | 9246967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3725762 |
End bp | 3727093 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF418 |
Protein accession | YP_003681026 |
Protein GI | 297562052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.414343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGGGC ACACCGTTTC CCGCGGTCCG GTCGGCGCGG GGGAGAGAGC CCTCGCACCG GACCTGGCGC GCGGGTTCAT GCTGCTGTTC ATCGCGCTCG CCAACACGGC CTGGTACCTG TGGGCGCTCC CGTCCAGCGG GGGCGGCGTC CACCCCGAGC CGGGCGGCGT ACTGGACCGG ATCGCGCAGT TCGTCATCGT CACGGCGGTG GACATGCGCA GCTATCCGAT GTTCGCCTTC CTGTTCGGGT ACGGCATGGT GCAGCTGGCC CGGCGCCAGG AGGCGGCGGG CTCCTCCGCG CGGGAGGTGA ACGCGCTGCT GCGCCGCCGC AACCTGTGGC TGCTCGCGTT CGGTTTCGTC CACGCCCTGC TGCTGTGGAT GGGCGACGTC CTGGGCGCCT ACGGCCTGGC AGGGCTGCTC CTGGGCTGGC TGTTCCTGCG ACGCAGGGAC GCCACGCTGC TGGTGTGGTC GGGCGTGTTC ACCGGGCTGG CCGCCCTGCT GGCGGCGTTC AGCCTGCTGG GTCTGGCCTC CATGCCCGCG GAGGCCTCCT ACTCCTCCTC GGCCGCGTTC AGCACGGACC TGATGGCCGA CAACATCGGC GACACCTCGA TCCTGGGCGC CGCGCTGGCC CGCGTCCTGG ACTGGCCGCT GGTCACGCTG GGCCAGGGGC TGCTGGGCAT GGTCGTGCCC GCGGCGATCC TGCTCGGCTA CTGGGCCGCG CGCCGCCGGA TCCTGGAGGA GCCGGGCGGG CACCTGGGCC TGCTGCGCTG GACCGCGGCC CTGGGCATCG GCGTGGGCTG GCTGGGCGGG CTGCCCCTGG CCCTGACCCA GATCGGGGTC TGGGAGCTGT CCCCCGCGCA GGCGGCCATG CTCACCATGC CGCACATGGT GACCGGGCTG GCCTGCGGCC TGGGCTACGT GGCGCTGATC GCGCTGGCGG CGCACCGGAT CCAGGGGCGC GGTCGCGCAC CGGGCGCGGT GGTCGGCGCG CTGTCGGCGA CCGGCAAGCG CTCGCTGTCG GCCTACCTGG CCCAGTCGGT GCTGTGCGCG CCCCTCCTGG CGGCCTGGGG CCTGGGACTG GGCGGCGAAC TCGCCTCGTG GTCGATGGCC CTGTTCGCGG TGGGCGTGTG GCTGGTGACG GTGGCGGCGT CCTACGCCCT GGAGCGCGCG GGCAGGCGCG GACCGGCCGA GGTGCTGCTG CGCCGCCTGG CCTACCGCCG CCCGGTCGCG CGGGGCGCTC GGGAGTCGGA GGAGCCCGGC AGCGCGTCGG TTCGCACGGT CGGGCGCGGA CGGGCACGGG GACCGGCGGA CGGAGAGCGT CCCGCGCCCT GA
|
Protein sequence | MSGHTVSRGP VGAGERALAP DLARGFMLLF IALANTAWYL WALPSSGGGV HPEPGGVLDR IAQFVIVTAV DMRSYPMFAF LFGYGMVQLA RRQEAAGSSA REVNALLRRR NLWLLAFGFV HALLLWMGDV LGAYGLAGLL LGWLFLRRRD ATLLVWSGVF TGLAALLAAF SLLGLASMPA EASYSSSAAF STDLMADNIG DTSILGAALA RVLDWPLVTL GQGLLGMVVP AAILLGYWAA RRRILEEPGG HLGLLRWTAA LGIGVGWLGG LPLALTQIGV WELSPAQAAM LTMPHMVTGL ACGLGYVALI ALAAHRIQGR GRAPGAVVGA LSATGKRSLS AYLAQSVLCA PLLAAWGLGL GGELASWSMA LFAVGVWLVT VAASYALERA GRRGPAEVLL RRLAYRRPVA RGARESEEPG SASVRTVGRG RARGPADGER PAP
|
| |