Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3441 |
Symbol | |
ID | 9247308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4119126 |
End bp | 4122053 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | peptidase S45 penicillin amidase |
Protein accession | YP_003681352 |
Protein GI | 297562378 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.830206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCCCCT CGTTCAGAAG ACGGCTACGC ACCGGCGCCT CGGGTACCGC CCTGGCGGTG GCCGCGGCCA CGCTCGCCGC CCTGCCCGCC GCCCCCCAGG CCCTCGCCGA ACCGGAACCC GACTACTGCG CGCCCGGGGG GTGCAACGAC ATCCTTCCCC CCGGCCAGAA CGGCAGCGCC ACGCTCGTCG AGATCCTCGG CCACCAGGTC TTCGGCACCC GGCCGCGGCA CTCCTCCAGC CAGCTGGACA TGTACGACAA CCTCGTCCAC CACTACGACG GCCTCACCGA GGAGGGCCTG GACGACTTCT TCCTCGACGC CGGCTTCGGG GTCGACCCCG ACGACGTGGG GCGCCGGTAC CAGCCGCGCG ACGACGTCAC CATCACCCGC GACGCCGACC ACGGCATCCC GCACGTCGAG GGCACCACCC GCGAGGGCAC CATGTTCGGC GCCGGGTACG CCGCGGCCGA GGACCGCCTC TTCCTCATGG ACGTGCTGCG CCGGGTCGGC CGCGGCGAAC TCACCTCCTT CGCCGGAGGC GCCGAGGGCA ACCGCGGGCT GGAACAGGAC CTGTGGCGCA GCGCCCCCTA CACCGAGGAG GACAAGCAGG CGCAGATCGA CCGGGTCGCG GCCGGCGGAG AGCGCGGTGA ACAGGCCCTC GCCGACGTCG ACGCCTACCT GGAGGGCGTC AACGCCTACA TCGAGGAGGC CGACGACAAC CGCGACTTCC CCGGCGAGTA CGTCCTGACC GGGCACAAGG ACGCCATCAC CAACGCCGGG GAGATCGAGC CCTTCACCCC GACCGACGTC GTCGGCATCG CCGCCATGGT GGGCGGCATC TTCGGCGGCG GTGGCGGCGG CGAGGTCCAG GCGGCCGTCG TGCTCGCCAA CTTCCGGGAC CGCTACGGAG CCGAGGAGGG CGCCGAGCTT TACCGCGCCT GGCGGATGGA GAACGACCCC GAGGCCGTGG TCAGCGTCGG GGGCGAGTTC CCCTACGGCG GCACCCCCGA GAACCCGGTC GGCGAGGCCG TGCCCGACCC CGGCTCCGTG GAGCCCTACG ACATCGTCCA CGACGAGCAC GGTTCCGCGC TCAGCGGCGA GACCGCCGAG CAGCCCGCCC CCCTGGCGAG CACGGTGGAG GAGCCCGGGG AGGGAGCCGA GGCGCCCGCC GTCGAGGACC TGCCCGAGGA GCAGCGCGAC GAGCTGGAAC GGATGGTCGA GGACGGGGAC CTGTCCGCGG TGGAGGGCGT GTTCAACGAC GGCGTCCTGC CCGAGGGCTT CACCGACCCC CGCGGCATGT CCAACGCCCT GCTCGTCTCG GGCGAGCACA CCGCGAGCGG CAACCCCGTC GCGGTGATGG GCCCCCAGAC CGGCTACTTC TCCCCGCAGC TGCTCATGCT CCAGGAACTC CAGGGGCCGG GTGTCAGCGC GCGCGGCGCG TCCTTCGCGG GCGTGAGCTT CTACGTCCAG ATGGGGCGCG GCGTCGACTA CGCCTGGAGC GCCACCTCGG CGGGCCAGGA CGTCACCGAC ACCTTCGCCG TCGAACTGTG CGAGACCGAC GGGTCCGCGC CCACCACCGG GTCGCGCGCC TACGTGGACA CCGCCGGGGA CTGCGTGCCC TTCGAGGAGC TGAGCGTCAC CAACGAGTGG TCGCCCACCG TCGCCGACGG CACGCCGGCG GGCGGCTACA CGCTGACCTC CCTGCGTTCG GAGTTCGGCC TGGTGCACTC CTTCGCGACC GTGGACGGCG AGCCCGTCGC GTTCACGACG CGGCGCTCCA CCTACATGCG CGAGGTCGAC TCCATCATCG GCTTCCAGCG CTTCAACGAC CCCGGCGAGA TCACCTCGGC GCAGACCTTC ACGGACGCGG CCGAGGACAT CGGCTACGCC TTCAACTGGC ACTACGTCGA CGACGAGGAC ATCGCCTTCG TCAACTCCGG GGCCAACCCG GTCCGGACCG AGGGCACCAA CCCCAACCTG CCCGTGGACG CCTACTCCGA CGCGGGCTGG TCCGGCTGGG ACCCCGCCAC CAACGGCGCC GACTACCACC CGCAGGAGGA CCACCCGCAG GCGGTCAACC CCGACTACAT CGTCAACTGG AACAACAAGC CCGCCCAGGG CTACACCTCG GGCTGGGCCA CCGGACCGGT GCACCGCGGC GACCTCATCG ACGGCCGCGT CTCCGCGCTG GTCGCCGACG GGCACGAGTT CACCACCGCC AGCCTGACCC GGGTGATGAT GGAGGCCGGG GTCGCCGACC TGCGCGCCCA GGAGGTCCTC CCGCTCCTGC TGGAGGTGAT CGGTGCCGAG GCCGTGGACG ACCCCGAACT GGCCGCGGTC GTGGAGGAGC TGCGGCTCTG GCACGAGTCC GGATCCCTGC GCCGCGAGCC CTTCCGCGAC GCCGGGTACT ACTCCCACGC CGACGCCATC CGCACGCTGG ACGCCTGGTG GCCGCTGCTG GTCCGGGCCC AGTTCGAACC GGGCCTGGGC GAGGACCTCT ACACCCAGCT CACCAGGGCC GTGCAGACCG ACGAGTCGCC CTCGGGCTCC ATCGGGGGAG GCGAGCCCGG AAGCGTCAAC CAGGCGCAGC CCCACCGGGG ATCGGCGTTC CAGTACGGCT GGTGGTCCTA CGTGGACAAG GACCTGCGCA CGGTGCTCGG CCAGGAGGTG GAGGGCGGAC TGGGCGAGGC CTACTGCGGT GGCGGCGACC CCGACGCCTG CCGTACGGTG CTGCTGGACA CCCTGGCCCA GGCCGCGGAC ACCCCCGCGG CGGAGGTCTA CCCCGGCGAC GAGCACTGCG ACCCGGGCGA CCAGCTGTGC GCGGACACGG TGATCCACCA GGCGGTCGGC GGCATCAACA TGTGGCCCAT CGCGTGGCAG AACCGGCCCA CGTACCAGCT CGTCTACCAG TTCTCCGGCG GACGGTAG
|
Protein sequence | MFPSFRRRLR TGASGTALAV AAATLAALPA APQALAEPEP DYCAPGGCND ILPPGQNGSA TLVEILGHQV FGTRPRHSSS QLDMYDNLVH HYDGLTEEGL DDFFLDAGFG VDPDDVGRRY QPRDDVTITR DADHGIPHVE GTTREGTMFG AGYAAAEDRL FLMDVLRRVG RGELTSFAGG AEGNRGLEQD LWRSAPYTEE DKQAQIDRVA AGGERGEQAL ADVDAYLEGV NAYIEEADDN RDFPGEYVLT GHKDAITNAG EIEPFTPTDV VGIAAMVGGI FGGGGGGEVQ AAVVLANFRD RYGAEEGAEL YRAWRMENDP EAVVSVGGEF PYGGTPENPV GEAVPDPGSV EPYDIVHDEH GSALSGETAE QPAPLASTVE EPGEGAEAPA VEDLPEEQRD ELERMVEDGD LSAVEGVFND GVLPEGFTDP RGMSNALLVS GEHTASGNPV AVMGPQTGYF SPQLLMLQEL QGPGVSARGA SFAGVSFYVQ MGRGVDYAWS ATSAGQDVTD TFAVELCETD GSAPTTGSRA YVDTAGDCVP FEELSVTNEW SPTVADGTPA GGYTLTSLRS EFGLVHSFAT VDGEPVAFTT RRSTYMREVD SIIGFQRFND PGEITSAQTF TDAAEDIGYA FNWHYVDDED IAFVNSGANP VRTEGTNPNL PVDAYSDAGW SGWDPATNGA DYHPQEDHPQ AVNPDYIVNW NNKPAQGYTS GWATGPVHRG DLIDGRVSAL VADGHEFTTA SLTRVMMEAG VADLRAQEVL PLLLEVIGAE AVDDPELAAV VEELRLWHES GSLRREPFRD AGYYSHADAI RTLDAWWPLL VRAQFEPGLG EDLYTQLTRA VQTDESPSGS IGGGEPGSVN QAQPHRGSAF QYGWWSYVDK DLRTVLGQEV EGGLGEAYCG GGDPDACRTV LLDTLAQAAD TPAAEVYPGD EHCDPGDQLC ADTVIHQAVG GINMWPIAWQ NRPTYQLVYQ FSGGR
|
| |