Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2986 |
Symbol | |
ID | 9246839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3567469 |
End bp | 3569094 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | protein of unknown function DUF324 |
Protein accession | YP_003680902 |
Protein GI | 297561928 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0321062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.494788 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGCG AGCCAGCCCC GCCCGGGCTG CTGTGGGAGG TCACCCTGCG GCTGTGCCTG CTCTCCGACA CCCACGTCGG GGCGGCCCGG GCCAGGCCCC GCCACGCCGC CGGAAGCGAC GCGGACCTGC ACGTGGACCG CGACCCGGTC ACCGGCGCGC CCCGTCTGCG CGCCACCACC CTGGCCGGGC TGCTGCGCCA CGAACTCGCC GCCCGCACGG GCGACCCCGA CGACGTCCGC GCCCTCATGG GCTCGGCGGA GTCCGAGGCC GCGCCGGGCG GGGAGGGGGC CGCCGCGAGC GCGCTGGACG TGGACGACGC CCGCGCCGAA CTCCCCGAGG ACACAGCGGT AGCGGTCCGC ACCGGCATCC GGGTCGACCC GGCCGCGGGC ACCGTCCAGC CGGGCCGCAC GTGGCGGTGG GAGATCCTGC CCGCGGGTAC CGTGTTCACC GCCCACCTGC GCCTGCACGT GCCCGCACCG GCCGACGAGG CCCGCCTGCT CACCCTGCTG CTCCTGGCCT GCGACGGACT CTCCGGGCCC GGCGGGTCCG GCCCCGGCAT ACGCGTGGGC GGGCGCACCG GCCGCGGCTA CGGCGCCGTC CGCGCCACCC ACTGGTCGGC GCGCCGCCAC GACCTCACCG ACGAGCGCGG CTGGCTCGCC TACCACGCGC GCTCCTGGGC CCGGCGCTGG GAGGAGGGCG CCGACGCGCT CGCCGACGCC CCCGCCGACC TGGCCGCCGC CCTGACCGGG GCCCTGCGCG CCTGCGGGCG CGGGGCCACC GCCGCCCACG CGCTCGCCCG CGCGCACCGG CCCGACCGCC GCCACCGCGC CGAACTCCAC CTGACCCTGG CCGTCGCCGA GCAGCCCGAT CCCACCGCCC CGCCGCCGCG CGACCCCAGG CCCGGCCTGC TCATGGTCGG CGACGCCCCC GCCCCCGAAC GCCTGGGCGA GGCGGACCGG GCACACCGCC ACCGCCCCGC CGTCACCGAC CCGGACAAGG CCGCCGTCCA CCCCGCCCCG GTCCTGGGCG ACACCGCGCT GTTCGCCCTG TTCAAGAGGA TCGGCGGCCG ACTGGTCCGG GACGCGGCCG AACACCTGGG CGCCGGGCCG GACCGGTGGC GCGACTGGCA CGACCACTGG TGGGGCGCCG ACACCGACCG GCGCGGCCTC CCGCGCCCCG CCCGGATCCG GCTGCGCACC GTCCCCGTGC TCACCGGCGG AGCACCCCTG ACCGCCACCC GGCTGACCGT GGACTCCCTC TTCGGCGACG CCGTGGACGG CCGCCTGTTC ACCACCGACC TGCACTGCGG CGGCAGCGCC GAGGCGGTCC TGGACGTGCG CGAGCCCGAC GACGCCGTCC GCGGTCTGCT CGCCCTCCTC GTGCGGGAGC TGGCCACGGT ACCCTTCGAC ACCCTCGGCG CGGGCGCCGG AACCGGGAAC GGGCGCCTGA CCGCCACCCG CGCCCTCCTG ACCACCCACC CCCCGGGCGG AGGGCCACCG GACACGGTGG ACCTGCTCAC CGCCCTGTTC GCACCCGACA GCGCCGACGC GGCCACCGCC CGCGGCTGGC TGGCCGCCCT GCACGCCGCG CTCGCCCCCG CGCCCACCAC GGAGGAGCCC AGGTGA
|
Protein sequence | MSGEPAPPGL LWEVTLRLCL LSDTHVGAAR ARPRHAAGSD ADLHVDRDPV TGAPRLRATT LAGLLRHELA ARTGDPDDVR ALMGSAESEA APGGEGAAAS ALDVDDARAE LPEDTAVAVR TGIRVDPAAG TVQPGRTWRW EILPAGTVFT AHLRLHVPAP ADEARLLTLL LLACDGLSGP GGSGPGIRVG GRTGRGYGAV RATHWSARRH DLTDERGWLA YHARSWARRW EEGADALADA PADLAAALTG ALRACGRGAT AAHALARAHR PDRRHRAELH LTLAVAEQPD PTAPPPRDPR PGLLMVGDAP APERLGEADR AHRHRPAVTD PDKAAVHPAP VLGDTALFAL FKRIGGRLVR DAAEHLGAGP DRWRDWHDHW WGADTDRRGL PRPARIRLRT VPVLTGGAPL TATRLTVDSL FGDAVDGRLF TTDLHCGGSA EAVLDVREPD DAVRGLLALL VRELATVPFD TLGAGAGTGN GRLTATRALL TTHPPGGGPP DTVDLLTALF APDSADAATA RGWLAALHAA LAPAPTTEEP R
|
| |