Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1493 |
Symbol | |
ID | 9245343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1830963 |
End bp | 1832108 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003679429 |
Protein GI | 297560455 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.03081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACCCT CCCCCGCTAT CTCCGCTATC GGCACCGGCG CACTCGCGTT CGGTCTGGCG TTCTCCGTGA CGCCGGGCGC CAGTGCGGCG ACCGTACCGG CCGAGCCAGC GAGCGAGGCC CAGACGATGA TGGAAGCGCT GCAGAGAGAC CTCGGCCTCA CCCCGCTCGG GGCCGAGGAG CTGCTCTCGG CGCAGGAAGA GGCGATCGAG ACCGACGCCG AGGCCACCGA GGCCGCGGGA GCGTCCTACG GCGGCTCCCT GTTCGACACC GAGACCCTCC AGCTCACCGT GCTGGTGACC GACGCCTCGG CCGTCGAGGC GGTGGAGGCC ACCGGCGCCG AGGCCACCGT GGTCTCACAC GGCACCGAGG GCCTGGCCGA GGTGGTCGAC GCGCTCGACG AGACCGGCGG CCGGGAAGGG GTCGTCGGCT GGTACCCGGA TGTGGAGAGC GACACCGTCG TGGTCCAGGT CGCCGAGGGC GCCAGCGCCG ACGGCCTCAT CGAGGCCGCG GGCGTGGACC CCTCCGCCGT CCGGGTGGAG GAGACCAGCG AGACGCCGCG CCTGTACGCC GACATCGTCG GCGGCGAGGC GTACTACATG GGCGGCGGAC GCTGCTCGGT CGGGTTCGCC GTGACCGACG GTTCCGGCGC GGGCGGCTTC GTGACGGCGG GCCACTGCGG CACCGTCGGC ACCGGCGCCG AGAGCTCCGA CGGCAGCGGA TCCGGAACCT TCCAGGAGTC CGTCTTCCCG GGCAGCGACG GCGCCTTCGT CGCGGCCACC TCCAACTGGA ACGTGACCAA CCTGGTCAGC CGGTACGACT CCGGCAGCCC CCAGGCGGTG TCGGGTTCCA GCCAGGCCCC GGAGGGCTCG GCGGTGTGCC GCTCCGGCTC CACCACCGGC TGGCACTGCG GGACCATCGA GGCCCGCGGC CAGACGGTGA GCTACCCGCA GGGCACGGTC CAGGACCTGA CCCGGACGGA CGTGTGCGCC GAGCCCGGTG ACTCCGGCGG CTCGTTCATC GCCGGTTCGC AGGCCCAGGG CGTCACCTCC GGCGGCTCGG GCAACTGCAC TTCCGGCGGC ACGACCTACT ACCAGGAGGT CACTCCCCTG CTGAGCAGCT GGGGGCTGTC CCTGGTGACC GGGTAA
|
Protein sequence | MRPSPAISAI GTGALAFGLA FSVTPGASAA TVPAEPASEA QTMMEALQRD LGLTPLGAEE LLSAQEEAIE TDAEATEAAG ASYGGSLFDT ETLQLTVLVT DASAVEAVEA TGAEATVVSH GTEGLAEVVD ALDETGGREG VVGWYPDVES DTVVVQVAEG ASADGLIEAA GVDPSAVRVE ETSETPRLYA DIVGGEAYYM GGGRCSVGFA VTDGSGAGGF VTAGHCGTVG TGAESSDGSG SGTFQESVFP GSDGAFVAAT SNWNVTNLVS RYDSGSPQAV SGSSQAPEGS AVCRSGSTTG WHCGTIEARG QTVSYPQGTV QDLTRTDVCA EPGDSGGSFI AGSQAQGVTS GGSGNCTSGG TTYYQEVTPL LSSWGLSLVT G
|
| |