Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1223 |
Symbol | |
ID | 9245073 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1523929 |
End bp | 1525287 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | YP_003679168 |
Protein GI | 297560194 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00198137 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCAGCG GACACGACGC CGGATACGAC ATCACCGACG GCTACTTCCT GGACACCGAC CGGGCCCGCA CCGTGGCCGA CCGCGTGGAG CGGACCGTCA CCGAGGCCAC CCGGCGGCTC AGCGAGGAGA CCCGCGGCCA ACCCGAGGAC ACGCCCGAGG AGTACCGCGC CCGCGCCGAA CGCGTCATCG CCCAGATCCT CGACGAGGAC GCGAGCCGCG CCCTGTCCGA GGGCAGGCAG GTGCTCGACG CCACTACCGA GGCCTCCGTC GCCGCTGGCG CCCTGGCCCG GGTGTGCGGG CTCGGACCCC TCCAGCCGCT CCTGGACGAT CCCGGGATCG AGAACATCAA CATCAACGGC GTGCACGTGT GGGTGCGCCG CGCCGACGGA AGCCGCGAGC GGCACGACTC CCTCTTCGAC GACCCCGACG AGGTCGTCGC CCTGGTCCGG CGCCTGGCGT CGGAGTCCGC CACGGGGGAG CGCCGCTTCG ACCCGGGCGC GCCCATCCTC GACATGCAGC TGCCCGGCGG CGAGCGCCTC AACGCGGTCA TGGAGGTCGC CCGGCAGCCC TCGGTGTCCA TCCGCCGCCA CCGCTACGGC CGCACCACCC TCGAACAGCT CCACGGACTG GGCACGATCG ACGACACCCT GGTCGCCCTG CTGCGCGCGG GGGTGCGCTC CCGCCGCAAC ATGGTCATCA CCGGCGGCAC CGGCGCGGGC AAGACCACCC TGCTGCGGGC CCTGGCCGCC GAGATCCCCG TGGACGAACG CCTCGTCACC ATCGAGGACG TCTTCGAACT CGGGCTGGAC CGCGACCGCG ACGCCCACCC CGACTGCGTC GCCCTCCAGG CCCGACCGGC CAACGTCGAG GGCGTCGGCG AGATCACCAT CGCCGACCTG GTGCGCACGG CCCTGCGCAT GTCGCCGGAC CGGGTCATCG TCGGTGAGAC GCGCGGCCAC GAGACCGTCC CGCTGCTCAA CGCCATGAGC CAGGGCAACG ACGGCAGCCT CACCACCCTG CACGCCGCCA ACTCCGCGGG CGCCTTCACC AAGCTGGGCG CCTACGCCGC CCAGTCCGCC GAGCGCCTGC CCCTGGACGC CACCGCGTCC CTGGTGGCCG CCGCCGTGCA CCTGGTCGTG CACGTGTCGG CGCTGCCCAC GGGCGGGCGC ATGGTCACGA GCGTGCGCGA GGTGGTGGGG GCCGAGGGGC AGAACGTGGT CTCCAACGAG ATCTACCGGC GCGACCGCAA CGGGCCCCTG CCCGCGGCGC CGCCCAGCCC CAACACGCTC GACGCCCTGG CCGAGGCCGG GTTCGACCCG GCGATGCTGA GCCCGGACAC AGTGGGGTGG GCACGGTGA
|
Protein sequence | MSSGHDAGYD ITDGYFLDTD RARTVADRVE RTVTEATRRL SEETRGQPED TPEEYRARAE RVIAQILDED ASRALSEGRQ VLDATTEASV AAGALARVCG LGPLQPLLDD PGIENINING VHVWVRRADG SRERHDSLFD DPDEVVALVR RLASESATGE RRFDPGAPIL DMQLPGGERL NAVMEVARQP SVSIRRHRYG RTTLEQLHGL GTIDDTLVAL LRAGVRSRRN MVITGGTGAG KTTLLRALAA EIPVDERLVT IEDVFELGLD RDRDAHPDCV ALQARPANVE GVGEITIADL VRTALRMSPD RVIVGETRGH ETVPLLNAMS QGNDGSLTTL HAANSAGAFT KLGAYAAQSA ERLPLDATAS LVAAAVHLVV HVSALPTGGR MVTSVREVVG AEGQNVVSNE IYRRDRNGPL PAAPPSPNTL DALAEAGFDP AMLSPDTVGW AR
|
| |