Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1567 |
Symbol | |
ID | 9245417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1917910 |
End bp | 1918986 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | glycosyl transferase family 9 |
Protein accession | YP_003679502 |
Protein GI | 297560528 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.029243 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAACCGG TGGGAGAGAA CCCGGGCCCC GGCCACGGGG GCCCGGACGG GGGCCGGGTC CCCGGCACGG GAGGGCGGCC GACCCTGCTG GCGCTGCGGG CGCTGGGGCT GGGCGACTTC GCCACGGCCG TTCCCGCGCT GCGCGCGCTG GAGCGGGCGC TGCCGTCCTG GCGGCGCACC CTGGCGGGCC CCTCCTGGTA CCGGCACCTG GTCGCGCTGG CCGGGCTGGA CTGGGAGGTC CTGCCGACCG AGCCGCTGCG CGCGCCCGAC ACGCGCTCTC CGGACCTGGC GGTCAACCTG CACGGCCGGG GGCCGCAGAG CACCGCGGCC CTGGCGGCGC TGACGCCGGA GCGGCTGTGG ACGCACGGCC ACCCGTCCGC CCCGCAGTGG CCGGGCCCGG AGTGGCCCGA GGGGGTCCAC GACGCCGAGA TCTGGTGCCG CCTGCTGCTC GCGCACGGTG TGGCGGCCGA CCCGGACGAC CTGCGGTGGC CGGACCCGTC GCGGGGGCGG GGCGCGGTGG TCGAGGGGGA CACCGCGATC GTCCACCCGG GGGCCGCCTC CGGGTCGCGG CGCTGGCCCC CGGAGCGTTT CGCGCGCGTG GCGGGGGCCC TGGCGGCCTC GGGGCTGCGG GTGCTGGTGA CCGGCTCGCC GAACGAGACC GCGCTGGCCG AGCGGGTGGC CGAGGCCGCC GGGCTCGGCG GTCGGGCGGT GCTGGCCGGG CGCACCTCCC TGGACCTGCT GGCGCGGCTG GTCGGCGAGG CTCGGCTGGT GGTGTGCGGG GACACCGGGG TCGGTCACCT GGCCACGGCC TACGGCACGC CGTCGGTGCG CCTGTTCGGG CCGGTCTCCC CGCGGCTGTG GGGCCCGCGG GTGGACCGGG ACGTCCACGT GTGCCTGTGG GCGGGTCGCC TGGGCGATCC GCACGCCGCC GCACTGGACC CGGGACTGGA CGAGATCGGC GTGGAGGAGG TCGTCGCCGC GTGCCGGAGC GTGTGCGCCT CCGACCGCCC CACCGCCGAA CAGGCGGTCC CCGGCTCCCG AGCGCCGACC CCCTGCCCGA AAGCGTGGAC GATGTGA
|
Protein sequence | MEPVGENPGP GHGGPDGGRV PGTGGRPTLL ALRALGLGDF ATAVPALRAL ERALPSWRRT LAGPSWYRHL VALAGLDWEV LPTEPLRAPD TRSPDLAVNL HGRGPQSTAA LAALTPERLW THGHPSAPQW PGPEWPEGVH DAEIWCRLLL AHGVAADPDD LRWPDPSRGR GAVVEGDTAI VHPGAASGSR RWPPERFARV AGALAASGLR VLVTGSPNET ALAERVAEAA GLGGRAVLAG RTSLDLLARL VGEARLVVCG DTGVGHLATA YGTPSVRLFG PVSPRLWGPR VDRDVHVCLW AGRLGDPHAA ALDPGLDEIG VEEVVAACRS VCASDRPTAE QAVPGSRAPT PCPKAWTM
|
| |