Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5318 |
Symbol | |
ID | 9249218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 483724 |
End bp | 484902 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003683204 |
Protein GI | 297564231 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.262877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGACG TGATCCTGGT CGTCCTGTTG CTGCTGTTCG CGGTGACGGG GTACCGACAG GGTTTCATCG TCGGTGTCTT CAGCTTCGCG GGCTTCGTCG GAGGGGGCGT CCTCGCCGCC CTGACGGCCC CGGCACCCAT CCAGGCGTGG GTGGAGGACC CCGGCAGGCA GGCGCTGCTG GCGATCGCCG TGGTGTTCCT GTCCGCGGCG CTCGGCCAGT TCCTGCTCTC CTACCTGGGC ACCTTCGTCC GCAACAAGGT GACGTGGGAC TCGGCGCGGG TCCTGGACGC CATCGGCGGC GCCCTGATCA GCGGGCTCTC GGTGCTGCTC GTGGCCTGGT TCATCGGCAG CACGGTGGCC AACTCGGCGC TGCCGTTCGT CGCGGGCCAG GTCAGGGACT CGCGCATCCT CCACTCGGTG GACACGCTGA TGCCCGAGGC CGCCCACAGC GGGTTCTCCA CGTTCCGCCG GATCGTGGAC CAGAGCGCCT TCCCGCAGGT GTTCAGCGGC CTGGGCACCG GTGAGCTGGC CGAGGTGGCG CCGCCGGACC CCGACGTGCT CACCACCCCG GAGCTGATCG AGTCGAGCCG CAGCGTGGTG AAGGTGCTGG GCACCGCGCC CTCGTGCCAG CGCCGCGTGG AGGGGACCGG CTTCGCCTAC GCGGAGGACC GGATCATGAC CAACGCGCAC GTGGTCGCCG GGGTCACCGA CGACCTGCGG GTGGTCACCC GGGAGGGCTA CCAGCTCGAC GCCACGCTGG TGCTCTTCGA CGCCCAGCAG GACCTGGCCG TGCTGCACGT GCCGGGCCTG GACCTGGAAC CGCTGGAGTT CACCTACGAG GCCCCGCAGG GCGGTGACGC GGTCGTGGCG GGCTTCCCGC GCAACAGCGG CTTCACGGCC GTCCCGGCGC GCGTTCGCGC CCGCCAGACG GCGCAGGGGC CGGACTTCTA CCACTCCCAG CAGGTGAGCC GGGAGATCTA CCAGGTGCGC GCCGTGGTGC GCCCGGGCAA CTCCGGCGGC CCGCTGCTGT CGCCGGACGG CACCGTGTAC GGGGTGGTCT TCGCCGCCGC CACGAACGAG CCCGAGACGG GTTACGTGCT CACCGCCGAC GAGGTCGCGG AGAACGCCCA GAGCGGCCTG GAGAACGACG AGCAGGTCTC CTCCCAGGCC TGCGACTGA
|
Protein sequence | MLDVILVVLL LLFAVTGYRQ GFIVGVFSFA GFVGGGVLAA LTAPAPIQAW VEDPGRQALL AIAVVFLSAA LGQFLLSYLG TFVRNKVTWD SARVLDAIGG ALISGLSVLL VAWFIGSTVA NSALPFVAGQ VRDSRILHSV DTLMPEAAHS GFSTFRRIVD QSAFPQVFSG LGTGELAEVA PPDPDVLTTP ELIESSRSVV KVLGTAPSCQ RRVEGTGFAY AEDRIMTNAH VVAGVTDDLR VVTREGYQLD ATLVLFDAQQ DLAVLHVPGL DLEPLEFTYE APQGGDAVVA GFPRNSGFTA VPARVRARQT AQGPDFYHSQ QVSREIYQVR AVVRPGNSGG PLLSPDGTVY GVVFAAATNE PETGYVLTAD EVAENAQSGL ENDEQVSSQA CD
|
| |