Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3011 |
Symbol | |
ID | 9246864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3594796 |
End bp | 3595776 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF199 |
Protein accession | YP_003680927 |
Protein GI | 297561953 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0646248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATGA CCGGCGTGGT GAAGGATGAG TTGAGCCGGC TGACCATTCT CAAGCCATGC TGCCGCAAAG CCGAGGTGTC CACCATCCTG CGGTTCACCG GGGGCCTGCA CCTGGTGGGC GGGCGCATCG TGATCGAGGC CGAACTCGAC ACGGGCGCGG CCGCGCGCCG TCTGCGCAAG GACATCTCCG AGGTCTTCGG CCACGAGTCC GAGGTGGTCG TGCTGTCCCC GAGCGGCCTG CGCAAGGGCA ACCGGTACGT GGTCCGGGTG ATCAAGGACG GCGAGTCGCT GGCCCGGCAG ACGGGGCTGG TGGACAACAA CGGCCGCCCG GTGCGCGGCC TGCCCCGGCA CGTGGTGGCC GGCGGAGCCT GCGACGCCGA GTCGGCCTGG CGGGGCGCGT TCATCGCGCA CGGCTCCCTG ACCGAGCCGG GCCGGTCGAT GTCGCTGGAG GTGACCTGCC CCGGACCGGA GGCCGCGCTG GCGCTGGTGG GCGCCGCACG CCGGCTCAAG GTGCACGCCA AGGCGCGCGA GGTGCGCGGC GTGGACCGGG TGGTGGTCCG CGACGGCGAC TCCATCGGCG CGCTGCTGAC CCTGCTGGGC GCGCACCAGA GCGTGCTCGC CTGGGAGGAG CGGCGGATGC GCCGGGAGGT GCGCGCCACC GCCAACCGGC TGGCCAACTT CGACGACGCG AACCTGCGGC GCAGCGCGCG GGCGGCGGTG GCCGCGGGCG CGCGCGTGGA GCGGGCCCTG GAGATCCTGG GCGAGGACGC GCCCAAGCAC CTGGTGGCCG CGGGGCAGCT GCGCCTGGCG CACAAGCAGG CCTCCCTGGA GGAGCTGGGA CAGCTGTCCG TCCCGCCGCT GACCAAGGAC GCCATCGCCG GACGTATCCG CAGGCTGCTG GCGATGGCCG ACAAGCGCGC GAGCGACCTG GGCATCGAGG GCACCGAGGC CAACCTGACC CCGGACATGC TGGTCCCCTG A
|
Protein sequence | MAMTGVVKDE LSRLTILKPC CRKAEVSTIL RFTGGLHLVG GRIVIEAELD TGAAARRLRK DISEVFGHES EVVVLSPSGL RKGNRYVVRV IKDGESLARQ TGLVDNNGRP VRGLPRHVVA GGACDAESAW RGAFIAHGSL TEPGRSMSLE VTCPGPEAAL ALVGAARRLK VHAKAREVRG VDRVVVRDGD SIGALLTLLG AHQSVLAWEE RRMRREVRAT ANRLANFDDA NLRRSARAAV AAGARVERAL EILGEDAPKH LVAAGQLRLA HKQASLEELG QLSVPPLTKD AIAGRIRRLL AMADKRASDL GIEGTEANLT PDMLVP
|
| |