Gene Ndas_3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3441 
Symbol 
ID9247308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4119126 
End bp4122053 
Gene Length2928 bp 
Protein Length975 aa 
Translation table11 
GC content72% 
IMG OID 
Productpeptidase S45 penicillin amidase 
Protein accessionYP_003681352 
Protein GI297562378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.830206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTCCCCT CGTTCAGAAG ACGGCTACGC ACCGGCGCCT CGGGTACCGC CCTGGCGGTG 
GCCGCGGCCA CGCTCGCCGC CCTGCCCGCC GCCCCCCAGG CCCTCGCCGA ACCGGAACCC
GACTACTGCG CGCCCGGGGG GTGCAACGAC ATCCTTCCCC CCGGCCAGAA CGGCAGCGCC
ACGCTCGTCG AGATCCTCGG CCACCAGGTC TTCGGCACCC GGCCGCGGCA CTCCTCCAGC
CAGCTGGACA TGTACGACAA CCTCGTCCAC CACTACGACG GCCTCACCGA GGAGGGCCTG
GACGACTTCT TCCTCGACGC CGGCTTCGGG GTCGACCCCG ACGACGTGGG GCGCCGGTAC
CAGCCGCGCG ACGACGTCAC CATCACCCGC GACGCCGACC ACGGCATCCC GCACGTCGAG
GGCACCACCC GCGAGGGCAC CATGTTCGGC GCCGGGTACG CCGCGGCCGA GGACCGCCTC
TTCCTCATGG ACGTGCTGCG CCGGGTCGGC CGCGGCGAAC TCACCTCCTT CGCCGGAGGC
GCCGAGGGCA ACCGCGGGCT GGAACAGGAC CTGTGGCGCA GCGCCCCCTA CACCGAGGAG
GACAAGCAGG CGCAGATCGA CCGGGTCGCG GCCGGCGGAG AGCGCGGTGA ACAGGCCCTC
GCCGACGTCG ACGCCTACCT GGAGGGCGTC AACGCCTACA TCGAGGAGGC CGACGACAAC
CGCGACTTCC CCGGCGAGTA CGTCCTGACC GGGCACAAGG ACGCCATCAC CAACGCCGGG
GAGATCGAGC CCTTCACCCC GACCGACGTC GTCGGCATCG CCGCCATGGT GGGCGGCATC
TTCGGCGGCG GTGGCGGCGG CGAGGTCCAG GCGGCCGTCG TGCTCGCCAA CTTCCGGGAC
CGCTACGGAG CCGAGGAGGG CGCCGAGCTT TACCGCGCCT GGCGGATGGA GAACGACCCC
GAGGCCGTGG TCAGCGTCGG GGGCGAGTTC CCCTACGGCG GCACCCCCGA GAACCCGGTC
GGCGAGGCCG TGCCCGACCC CGGCTCCGTG GAGCCCTACG ACATCGTCCA CGACGAGCAC
GGTTCCGCGC TCAGCGGCGA GACCGCCGAG CAGCCCGCCC CCCTGGCGAG CACGGTGGAG
GAGCCCGGGG AGGGAGCCGA GGCGCCCGCC GTCGAGGACC TGCCCGAGGA GCAGCGCGAC
GAGCTGGAAC GGATGGTCGA GGACGGGGAC CTGTCCGCGG TGGAGGGCGT GTTCAACGAC
GGCGTCCTGC CCGAGGGCTT CACCGACCCC CGCGGCATGT CCAACGCCCT GCTCGTCTCG
GGCGAGCACA CCGCGAGCGG CAACCCCGTC GCGGTGATGG GCCCCCAGAC CGGCTACTTC
TCCCCGCAGC TGCTCATGCT CCAGGAACTC CAGGGGCCGG GTGTCAGCGC GCGCGGCGCG
TCCTTCGCGG GCGTGAGCTT CTACGTCCAG ATGGGGCGCG GCGTCGACTA CGCCTGGAGC
GCCACCTCGG CGGGCCAGGA CGTCACCGAC ACCTTCGCCG TCGAACTGTG CGAGACCGAC
GGGTCCGCGC CCACCACCGG GTCGCGCGCC TACGTGGACA CCGCCGGGGA CTGCGTGCCC
TTCGAGGAGC TGAGCGTCAC CAACGAGTGG TCGCCCACCG TCGCCGACGG CACGCCGGCG
GGCGGCTACA CGCTGACCTC CCTGCGTTCG GAGTTCGGCC TGGTGCACTC CTTCGCGACC
GTGGACGGCG AGCCCGTCGC GTTCACGACG CGGCGCTCCA CCTACATGCG CGAGGTCGAC
TCCATCATCG GCTTCCAGCG CTTCAACGAC CCCGGCGAGA TCACCTCGGC GCAGACCTTC
ACGGACGCGG CCGAGGACAT CGGCTACGCC TTCAACTGGC ACTACGTCGA CGACGAGGAC
ATCGCCTTCG TCAACTCCGG GGCCAACCCG GTCCGGACCG AGGGCACCAA CCCCAACCTG
CCCGTGGACG CCTACTCCGA CGCGGGCTGG TCCGGCTGGG ACCCCGCCAC CAACGGCGCC
GACTACCACC CGCAGGAGGA CCACCCGCAG GCGGTCAACC CCGACTACAT CGTCAACTGG
AACAACAAGC CCGCCCAGGG CTACACCTCG GGCTGGGCCA CCGGACCGGT GCACCGCGGC
GACCTCATCG ACGGCCGCGT CTCCGCGCTG GTCGCCGACG GGCACGAGTT CACCACCGCC
AGCCTGACCC GGGTGATGAT GGAGGCCGGG GTCGCCGACC TGCGCGCCCA GGAGGTCCTC
CCGCTCCTGC TGGAGGTGAT CGGTGCCGAG GCCGTGGACG ACCCCGAACT GGCCGCGGTC
GTGGAGGAGC TGCGGCTCTG GCACGAGTCC GGATCCCTGC GCCGCGAGCC CTTCCGCGAC
GCCGGGTACT ACTCCCACGC CGACGCCATC CGCACGCTGG ACGCCTGGTG GCCGCTGCTG
GTCCGGGCCC AGTTCGAACC GGGCCTGGGC GAGGACCTCT ACACCCAGCT CACCAGGGCC
GTGCAGACCG ACGAGTCGCC CTCGGGCTCC ATCGGGGGAG GCGAGCCCGG AAGCGTCAAC
CAGGCGCAGC CCCACCGGGG ATCGGCGTTC CAGTACGGCT GGTGGTCCTA CGTGGACAAG
GACCTGCGCA CGGTGCTCGG CCAGGAGGTG GAGGGCGGAC TGGGCGAGGC CTACTGCGGT
GGCGGCGACC CCGACGCCTG CCGTACGGTG CTGCTGGACA CCCTGGCCCA GGCCGCGGAC
ACCCCCGCGG CGGAGGTCTA CCCCGGCGAC GAGCACTGCG ACCCGGGCGA CCAGCTGTGC
GCGGACACGG TGATCCACCA GGCGGTCGGC GGCATCAACA TGTGGCCCAT CGCGTGGCAG
AACCGGCCCA CGTACCAGCT CGTCTACCAG TTCTCCGGCG GACGGTAG
 
Protein sequence
MFPSFRRRLR TGASGTALAV AAATLAALPA APQALAEPEP DYCAPGGCND ILPPGQNGSA 
TLVEILGHQV FGTRPRHSSS QLDMYDNLVH HYDGLTEEGL DDFFLDAGFG VDPDDVGRRY
QPRDDVTITR DADHGIPHVE GTTREGTMFG AGYAAAEDRL FLMDVLRRVG RGELTSFAGG
AEGNRGLEQD LWRSAPYTEE DKQAQIDRVA AGGERGEQAL ADVDAYLEGV NAYIEEADDN
RDFPGEYVLT GHKDAITNAG EIEPFTPTDV VGIAAMVGGI FGGGGGGEVQ AAVVLANFRD
RYGAEEGAEL YRAWRMENDP EAVVSVGGEF PYGGTPENPV GEAVPDPGSV EPYDIVHDEH
GSALSGETAE QPAPLASTVE EPGEGAEAPA VEDLPEEQRD ELERMVEDGD LSAVEGVFND
GVLPEGFTDP RGMSNALLVS GEHTASGNPV AVMGPQTGYF SPQLLMLQEL QGPGVSARGA
SFAGVSFYVQ MGRGVDYAWS ATSAGQDVTD TFAVELCETD GSAPTTGSRA YVDTAGDCVP
FEELSVTNEW SPTVADGTPA GGYTLTSLRS EFGLVHSFAT VDGEPVAFTT RRSTYMREVD
SIIGFQRFND PGEITSAQTF TDAAEDIGYA FNWHYVDDED IAFVNSGANP VRTEGTNPNL
PVDAYSDAGW SGWDPATNGA DYHPQEDHPQ AVNPDYIVNW NNKPAQGYTS GWATGPVHRG
DLIDGRVSAL VADGHEFTTA SLTRVMMEAG VADLRAQEVL PLLLEVIGAE AVDDPELAAV
VEELRLWHES GSLRREPFRD AGYYSHADAI RTLDAWWPLL VRAQFEPGLG EDLYTQLTRA
VQTDESPSGS IGGGEPGSVN QAQPHRGSAF QYGWWSYVDK DLRTVLGQEV EGGLGEAYCG
GGDPDACRTV LLDTLAQAAD TPAAEVYPGD EHCDPGDQLC ADTVIHQAVG GINMWPIAWQ
NRPTYQLVYQ FSGGR