Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1492 |
Symbol | |
ID | 9245342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1828999 |
End bp | 1830153 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003679428 |
Protein GI | 297560454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0362438 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACCCT CCAGCATCAT CTCCGCCGTG GGCACAGGAG CCCTGGCCTT CGGCATGGCA CTGGCCATGG CCCCCGGAGC CCTCGCGGCG CCGGCCCCCG TCCCCCAGAC CCCCGTCGCC GACGACAGCG CCGCCAGCAT GACCGAGGCG CTCAAGCGCG ACCTCGACCT CACATCGGCC GAGGCCGAGG AGCTGCTCTC GGCGCAGGAA GCCGCCATCG AGACCGACGC CGAGGCCACC GAGGCCGCGG GCGAGGCCTA CGGCGGTTCC CTGTTCGACA CCGAGACCCT CGAACTCACC GTGCTGGTCA CCGACGCCTC CGCCGTCGAG GCGGTCGAGG CCACCGGAGC CCAGGCCACC GTCGTCTCCC ACGGCACCGA GGGCCTGACC GAGGTCGTCG AGGACCTCAA CGGCGCCGAG GTTCCCGAGA GCGTCCTCGG CTGGTACCCG GACGTGGAGA GCGACACCGT CGTGGTCGAG GTGCTGGAGG GCTCCGACGC CGACGTCGCC GCCCTGCTCG CCGACGCCGG TGTGGACTCC TCCTCGGTCC GGGTGGAGGA GACCGAGGAG GCCCCGCAGG TCTACGCCGA CATCATCGGC GGCCTGGCCT ACTACATGGG CGGCCGCTGC TCCGTCGGCT TCGCCGCGAC CAACAGCGCC GGTCAGCCCG GTTTCGTCAC CGCCGGCCAC TGCGGCACCG TCGGCACCGG CGTGACCATC GGCAACGGCA CCGGCACCTT CCAGAACTCG GTCTTCCCCG GCAACGACGC CGCCTTCGTC CGCGGTACCT CCAACTTCAC CCTGACCAAC CTGGTCTCGC GCTACAACTC CGGCGGCTAC CAGTCGGTGA CCGGTACCAG CCAGGCCCCG GCCGGCTCGG CCGTGTGCCG CTCCGGCTCC ACCACCGGCT GGCACTGCGG CACCATCCAG GCCCGCAACC AGACCGTGCG CTACCCGCAG GGCACCGTCT ACTCGCTCAC CCGTACCAAC GTGTGCGCCG AGCCCGGTGA CTCCGGCGGT TCGTTCATCT CCGGCTCGCA GGCCCAGGGC GTCACCTCCG GCGGCTCCGG CAACTGCTCC GTCGGCGGCA CGACCTACTA CCAGGAGGTC ACCCCGATGA TCAACTCCTG GGGCGTCAGG ATCCGCACCA GCTGA
|
Protein sequence | MRPSSIISAV GTGALAFGMA LAMAPGALAA PAPVPQTPVA DDSAASMTEA LKRDLDLTSA EAEELLSAQE AAIETDAEAT EAAGEAYGGS LFDTETLELT VLVTDASAVE AVEATGAQAT VVSHGTEGLT EVVEDLNGAE VPESVLGWYP DVESDTVVVE VLEGSDADVA ALLADAGVDS SSVRVEETEE APQVYADIIG GLAYYMGGRC SVGFAATNSA GQPGFVTAGH CGTVGTGVTI GNGTGTFQNS VFPGNDAAFV RGTSNFTLTN LVSRYNSGGY QSVTGTSQAP AGSAVCRSGS TTGWHCGTIQ ARNQTVRYPQ GTVYSLTRTN VCAEPGDSGG SFISGSQAQG VTSGGSGNCS VGGTTYYQEV TPMINSWGVR IRTS
|
| |