Gene Ndas_5338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5338 
Symbol 
ID9249241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp513073 
End bp514650 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content76% 
IMG OID 
Productbeta-lactamase 
Protein accessionYP_003683224 
Protein GI297564251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.271651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.211898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCCG ACCCACACGC CCGTCCCCGC CCAGATCCAC TCCCGGGACG GGTTCCACAC 
CCGGGGCCAC GCGGCGGACA CCGCGTCCTC AGCGTCCTGG GAGCCGCCGC GCTGCTGCTC
CTGCACGCGG CCCCCGCCTC CGCCGAACCC CTGCGGACCC CGGCCCCCGC CGCGGTCGAC
GCCCTCGTGG AGGAGTACCG GGAGGCCACC GGCCTTCCCG GCGCGGCGGT GGTCCTGACC
CACGGCACGG AGGTCGTCCA CGCGCAGGGG TACGGCACCA CCCCCGGCGG GGAGCCCGTC
ACCGAGCGCA CACCCATGGC CGTGGCCTCG GTGAGCAAGT CCTTCACCGC GCTGGCGGTC
CTCCAGCTGG TCGAGGCGGG AGAGCTCGGC CTGGACGACC CGCTGCGGGA GCACCTGCCC
GAGTTCACGA TGGCCGACCC CCGCGCGGCG CGGATCACCG TCCGCCAGCT GCTCGACCAG
ACCTCCGGCA TGTCCGACTC CGAGTTCGCC TCCTACAGCC GGGGCCGGAG CACCCTCACC
CTGCGCCAGT CCGTCGAGGA CCTGCGCGAC GCGGGTCTGG CCGCCGAACC CGGCACGCGG
TGGGAGTACC ACAACCCCAA CTTCCAGGTC GCCGCCCGCC TGGTCGAGGT GGTCGGCGGG
CGGTCCTTCG CCGACCACCT GGACGAGCGG GTCCTCTCGC CGCTGGGGAT GGACGACACC
ACCACCGTCG ACACCGACCT CGACCTGCCG TCCGGCGCGC GCGGCCACAT CAGCGTGCTC
GGACTCCCGT TCCCGGCGGA CGAACCCCCC GGGTTCGGCA ACGGCTCGGG CGGCGTCATC
AGCACCGCCG CGGACATGGG CGCGTGGCTC GTCGCCCACA ACAACGGCGG TCGGGGCCCC
GGCGGGGAAC GGATCCTGTC CGAGGAGGGG ATCGAGGCCC TGCACACCCC CTCGCCGGTC
TCCGGGGGCT CCTACGCCCT GGGGTGGTCG ACGGACGAGA CCGGTTCGGG GGCGCCGGTG
GTCGAGCACA GCGGCAACCT GATGACCGCC ACCGCGCACC AGGCGCTGCT GCCCGAGAGC
GGTCACGGTG TGGCGGTCAT GGCCAACAGC GGGTCGGCGG GCAGCGGCGC CTCCGCCCTG
GCCGCCGCGC TCGTGGAGCT GATCGACTCC GGGCGTGCGC CCGTACCGCC GACGGGAACG
CTGGTGCTGG TCGCGGACGC CGTCCTGCTC GCCCTGGGCG CCGCCGCCGT ACCGCTCGCC
TGGCGCGGCG TCCGGCGCGC GGGTGACTGG GCCCGCGCAC GCGGGGGGCG CCCGTGGTGG
GCCGCGGCGG CGCGCCTGCT GCCCTGCACC GCGCCGCCGC TGCTGCTGGC GGTCCTGCAC
GAGGTGGTCG GGTGGCTCTA CCGGGGCAGG GACGTGGCGT GGTTCCAGGT GCCCTACCTC
TTCACGTCCT TCTGGGCGGC CCTCGCCCTA CTCGCCCTGG CCTGTGCGAC GGTCGCCTCG
GTCCGCGTCG TCCGTCTGGT GTCGGTGCGC CGTGGAACAG GCGGTACCGG CGGGAGAGCG
GAGCGCGCGT CGGGCTGA
 
Protein sequence
MTPDPHARPR PDPLPGRVPH PGPRGGHRVL SVLGAAALLL LHAAPASAEP LRTPAPAAVD 
ALVEEYREAT GLPGAAVVLT HGTEVVHAQG YGTTPGGEPV TERTPMAVAS VSKSFTALAV
LQLVEAGELG LDDPLREHLP EFTMADPRAA RITVRQLLDQ TSGMSDSEFA SYSRGRSTLT
LRQSVEDLRD AGLAAEPGTR WEYHNPNFQV AARLVEVVGG RSFADHLDER VLSPLGMDDT
TTVDTDLDLP SGARGHISVL GLPFPADEPP GFGNGSGGVI STAADMGAWL VAHNNGGRGP
GGERILSEEG IEALHTPSPV SGGSYALGWS TDETGSGAPV VEHSGNLMTA TAHQALLPES
GHGVAVMANS GSAGSGASAL AAALVELIDS GRAPVPPTGT LVLVADAVLL ALGAAAVPLA
WRGVRRAGDW ARARGGRPWW AAAARLLPCT APPLLLAVLH EVVGWLYRGR DVAWFQVPYL
FTSFWAALAL LALACATVAS VRVVRLVSVR RGTGGTGGRA ERASG