Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0967 |
Symbol | |
ID | 9244812 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1186032 |
End bp | 1187027 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | GCN5-related N-acetyltransferase |
Protein accession | YP_003678917 |
Protein GI | 297559943 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.776214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGTT ATGTCGCACG GGTGCGGCAT GATGAGTCGC CCGGGGGCGA CGTCGGCCTG GACCCGTTGG TGAGTTCGGC CGCCCAGCGC TGGGCGGCCG TGGATTCGCT GTTGCCCGAA CCGGTCAACT TCGCTCCGGG CTCCTACCCC CTCCTGACCG TCAGCGACGA GGACGGCCGT GCCGTCGCCG CCGGAACCAT GCACTACACC TGGTACCAGC CGGGCGAGGT CGGCCGGATA TGGGGCGTGC CCGACCAGCA CTGGCTCACC CCCATCGTGG GCGGACCCGA CCCCGGCCGC GCCCTGGACT CGCTGCTGAC CAGCTGGCGC GACCAGCTGG AGGACCTGCC TACCGGCACC GGCTCGGAGT CGGCGGCCCT GGTGAGCTGG CCCGCCCGCG ATGTGTGCGG CATCATCCCG CTCCAGCGCC ACGGTCTGCA GCCCTACACC GTGCTGGCCG CCCGCCAGCG CCGCCGCGGC GTGCCCCCGT CGCTGCCGCC GCGCGACGTG ACCATCCGGC TGGCCGACCG CAGCGACCTG ACCCAGGTGG TCGGTCTGCT GATGGAGGAG CACCGCTACG AGCAGCACTT CGGCGGGGTG TTCCTGCAAC CCGACACCGC CGAGCAGACC CGTGAGGTGG CGGCCCGGGC GCTGAACCGC TCCCGGTCCT GGATCTGGCT GGCCGAGCGG CGCGGTCGCG CGGTGGGCCT GCTGTGGGTC TCCCCGCCCG AGCGTTCGCG CTGGGCCAAG TCGCTGGTCA ACGCGCGCCC GATCGCGCAC ATCGGCTACG GCGTGGTGAC CGCCGCGGAG CGCGGCAGCG GGATCGGTAC GGCGCTGGTG GGCCAGGCGC ACCAGGCGCT GGACAGCCAC GGAGTGGGCG TCTCCGTGCT CAACTACGCG GCGATGAACC CGCTGTCGGG GCCGTTCTGG CACCGCATGG GGTACCGCCC GGTGTGGACG ACCTGGGAGG TCCGCCCCGC GCTCGCCCTG CGCTGA
|
Protein sequence | MSGYVARVRH DESPGGDVGL DPLVSSAAQR WAAVDSLLPE PVNFAPGSYP LLTVSDEDGR AVAAGTMHYT WYQPGEVGRI WGVPDQHWLT PIVGGPDPGR ALDSLLTSWR DQLEDLPTGT GSESAALVSW PARDVCGIIP LQRHGLQPYT VLAARQRRRG VPPSLPPRDV TIRLADRSDL TQVVGLLMEE HRYEQHFGGV FLQPDTAEQT REVAARALNR SRSWIWLAER RGRAVGLLWV SPPERSRWAK SLVNARPIAH IGYGVVTAAE RGSGIGTALV GQAHQALDSH GVGVSVLNYA AMNPLSGPFW HRMGYRPVWT TWEVRPALAL R
|
| |