Gene Ndas_4248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4248 
Symbol 
ID9248122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5062600 
End bp5065173 
Gene Length2574 bp 
Protein Length857 aa 
Translation table11 
GC content71% 
IMG OID 
Productpeptidase S45 penicillin amidase 
Protein accessionYP_003682143 
Protein GI297563169 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.409218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.181167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTAA TGCGTAAGCG TGTCCTTGTT CGCGTACTTC TGGGCTTCCT CGCTGCGGTC 
GTGGTGCTGG CCCTGGTCGG CGCGCTCCTG GGGATATGGA CCGTCCGGAG GTCGTTCCCG
GAGACATCAG GAGAACTCCC GCTCCCGGGG CTGGAGTCCT CCGTCACCGT GCTGCGCGAC
GAGCACGGCG TCCCCCACGT CTACGCGGAC AACACCCACG ACCTGTTCAT GGCGCAGGGC
TTCACGCACG CCCAGGACCG CTTCTGGGAG ATGGACTTCC GCCGCCACGT CACGGCCGGG
CGCACCGCCG AACTGTTCGG CCCCGACCAG GTGGACACCG ACGTCTACCT GCGCACCATG
GGCTGGCGGC ACGTGGCCGA GCAGGAGTAC GAGCTGCTCG CCCCGGAGAC GCAGGCCCAC
CTCGACTCCT ACGCGGCCGG GGTCAACGCC TGGCTGGCCG AGAACGACGG CACCAAGGCC
AGCCTGGAGT ACGGGCTGCT CTCCGTGCTC AACGGCGGCC ACGAGATCGA GGAGTGGACG
CCGGTCGACA GCCTCGCCTG GCTCAAGGCC ATGGCCTGGG ACCTGGGCGG CAACATGACG
GAGGAGACCG AGCGCGCCCA GCTGCTCGAC GCCGGGATGA CCCGCGAGCA GATCGAGGAG
CTGTACCCGG CCTACCCCCA CGAGGAGCAC CTGCCCATCA CCGACACCGA CGAGCACGGT
CAGGCCGGGG CCGACGGGGA GTCCGGGGGT TCCGAGGACT CCGACGAGGC TGACCTGCGG
GAGGCCGACG AACGGCGCAC CCCCGAGCTG CCGGAGGAGG CCGTCACCGC GCTGACCGGG
GTGGCCGCCA CCGCGGCGTC ACTGCCCTCG ATGCTCGGTC CCAGCACCAG CCCCGACCTG
GGGTCCAACT CGTGGGTGGT CGGAGGCGAG CACACCGAGA GCGGGCTGCC CCTCCTCGCC
AACGACCCCC ACCTGGGCGC GTCGATGCCC TCCACCTGGT ACCAGATCGG CCTGCACTGC
ACCGAGCTCA CCGAGGCGTG CCCCTACGAT GTCAGCGGCT TCAGCTTCTC CGGCCTGCCC
GGTGTGATCA TCGGGCAGAA CGAGTCGATC GCGTGGGGGT TCACCAACCT CAACCCCGAC
GTCATGGACC TGTACGTGGA ACGGATCGAA GGCGACGGCT ACGTGGTCGA CGGCCGGACC
AGGCCGCTGG AGACCCGTGA GGAGACCGTC CGGGTCGCGG GCGGCGAGGA CGTCGACATC
GTCGTGCGCT CCACCCACCA CGGGCCGCTG CTGTCCGACA CCGCCGCGGG CGCGAACCTG
GAGGCGCTGG CGGAGGAACC GGAGCTGGAG GGCGCGGAGG AGGGCGAGTA CGCCGTCGCC
CTGCGGTGGA CCGCGCTCCA GCCGGGCAGG ACCGCGGACT CGATCTTCGC GCTGAACCGG
GCCCGGGACT GGGGCGATTT CCGTGAGGCG GCCAGCCGGT TCGAGGCCCC CGCGCAGAAC
CTCGTCTACG CGGACGAGGA GGGGAACATC GGCTACCAGG CCCCCGGCCT GGTGCCGGTG
CGCGGCGAGG GCGACGGCCG CTACCCGGCG CCCGGCTGGG ACTCCGCCTA CGACTGGGAG
GAGTACCTCC CCTTCGAGGA GCTGCCGAGC GTGTACAACC CCGAGTCGGG CGTCATCGTC
ACCGCCAACC AGTCGGTCGT GGACGCCGAC TACGAGCACA TGCTCACGAG TGACTGGGAC
TACGGGTACC GGTCCCAGCG GATCAACGAC CTGCTCACCG AGGCGATCGG CGAGGGGCCG
GTGACCGGTG AGGACATGTC GCGCATCCAC ATGGACTCCT TCCACGGCGG CGCGGTCGAG
GTCGTGCCGC ACCTGTTGGA GGCGGACGTC GACGGGGTCA CGGCCCAGGC CCAGGAACAG
CTGCGGGAGT GGGACCTGTT CACCGGGACC GACTCCGCCG GGGCCGCGTT CTACCAGGCC
ACCTGGCGCC ACCTGCTGCC GCTGCTGTTC GACGAACTGG AGCCGCTGAC CATGAGCGGC
AGCTCGCGCG GGATGTACGT GGTGGGCCGG CTGCTGGAGG ACCCCGATTC GGACTGGTGG
CGGGGCACCG AGGCCAGCGG GCGCGAGGAG GTGCTGGCCG CGGCGATGGA CGCCGCCGCC
CAGGAGCTGA CCGAGCTGCT GGGAGGGGAC CCCGCCGACT GGCGCTGGGG GGACCTGCAC
ACCCTGACCG CGACCCACGA GTCGTTCGGA ACGTCGGGCA TCGGTCCGGT CGAGTGGCTG
TTCAACCGGG GCCCGGTGGA GAGCTCGGGC GGGGCCAGCA TCGTCAACGC CACGGGCTGG
GACCCGACCG CGGGGTACGC GATCACGGCG GTGCCGTCGA TGCGGATGGT GGTGGACCTG
GCCGACCGCG ACGCGTCGAC CTGGGTCCAC CTGACCGGCA ACTCCGGACA CGCCTTCCAC
CCCAACTACG ACGACCAGCT GGAGCCGTGG AGCCGGGGTG AGACCCTGCC GTTCGCGGTC
ACCGAGGAGG CCGTGCGCGC GGCCGCCACG GACGAACTGG TCCTCAACCC GTAG
 
Protein sequence
MGVMRKRVLV RVLLGFLAAV VVLALVGALL GIWTVRRSFP ETSGELPLPG LESSVTVLRD 
EHGVPHVYAD NTHDLFMAQG FTHAQDRFWE MDFRRHVTAG RTAELFGPDQ VDTDVYLRTM
GWRHVAEQEY ELLAPETQAH LDSYAAGVNA WLAENDGTKA SLEYGLLSVL NGGHEIEEWT
PVDSLAWLKA MAWDLGGNMT EETERAQLLD AGMTREQIEE LYPAYPHEEH LPITDTDEHG
QAGADGESGG SEDSDEADLR EADERRTPEL PEEAVTALTG VAATAASLPS MLGPSTSPDL
GSNSWVVGGE HTESGLPLLA NDPHLGASMP STWYQIGLHC TELTEACPYD VSGFSFSGLP
GVIIGQNESI AWGFTNLNPD VMDLYVERIE GDGYVVDGRT RPLETREETV RVAGGEDVDI
VVRSTHHGPL LSDTAAGANL EALAEEPELE GAEEGEYAVA LRWTALQPGR TADSIFALNR
ARDWGDFREA ASRFEAPAQN LVYADEEGNI GYQAPGLVPV RGEGDGRYPA PGWDSAYDWE
EYLPFEELPS VYNPESGVIV TANQSVVDAD YEHMLTSDWD YGYRSQRIND LLTEAIGEGP
VTGEDMSRIH MDSFHGGAVE VVPHLLEADV DGVTAQAQEQ LREWDLFTGT DSAGAAFYQA
TWRHLLPLLF DELEPLTMSG SSRGMYVVGR LLEDPDSDWW RGTEASGREE VLAAAMDAAA
QELTELLGGD PADWRWGDLH TLTATHESFG TSGIGPVEWL FNRGPVESSG GASIVNATGW
DPTAGYAITA VPSMRMVVDL ADRDASTWVH LTGNSGHAFH PNYDDQLEPW SRGETLPFAV
TEEAVRAAAT DELVLNP