Gene Ndas_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3504 
Symbol 
ID9247373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4203234 
End bp4206728 
Gene Length3495 bp 
Protein Length1164 aa 
Translation table11 
GC content68% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003681411 
Protein GI297562437 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTCCC CCTTGACCCG GGTAGCGGCG GTGACCACCG CTGCCACTCT GGCCCTGTCA 
CTCCTGACCG CGGTACCCGC CGCGGCCGAC GACGAGTCCA CATCGCCCAG CAGCGAAGGC
ACGCAGAGCG GGAACGAGGA GTTCGACGCG CTCGCGCTGG CGGAGCAGAC CGGGGAGCCG
GTGGAGATCC CGTCCCTGAC GGACGAGAAG ACCCAGCACT TCGCCAACCC CGACGGCACC
TTCACCGCCG AGGTCTCCGC GATCCCCGTT CGGGTTCGTT CCGGGGGTGG CTGGGTGGAC
GTGGACACCA CCCTGGTCGC CGGCGACGAC GGTCTCGTCC GACCCAGGGC GGCGGGCATG
GACATCGCCT TCTCCGGCGG CGGCGACGCC CCGATGGCCC GGATCGGCAT CGGCTCGAAC
AGCGTCGCGC TCGACTGGGT CGCGGACCTG CCCACGCCGA CCCTGGACGG CGACCAGGCC
ACCTACGCCG ACGTGCTGCC GGACGTGGAC CTGGTCCTGA CCGCCGGGGT GGAGGGCTTC
ACCCAGGTCC TGGTGGTGCA CACCCCCGAG GCCGCGGCCT CCCCGGAACT GGCCGAACTC
GAACTCCCCC TGAACGCCAC CGGGGTGAGC GTCGTCGCCG ACGAACACGG CAACATCGAG
GCCATCGGTG ACGAGGGCGG CCAGAGCGTC TTCACAGCCC AGGCCCCGGC GATGTGGGAC
TCCACCGGCC AGGACGAGGC TGCCGGTGAG AACCCGCTCC TGGCGCCGAC CACGGGAGCC
CGCACGGCTT TGGTCGAGAC CGAGATCGGC GTTGACTCCA TCCGTCTCAT TCCGGACCAG
GGGCTGCTCA CCGGGGAGGA CACCGAGTTC CCGCTCTACA TCGACCCCTC GGTCAGCGCC
AGCCGCCTCA ACTGGGCCTA TGTCGACAAG GCCTACCCGG ACCAGGCCAA CCACAACAAG
AAGATCGACA ATGTCGGTGT GGGCCGCTAC TCCGACCACA CCGGCACGTA CACCCGCCGC
GCGTTCTTCC AGTTCTCCGT TCCGGGACGC ATCAAGCAGG ACACCACCAC GATCCACTCG
GCCACGATGC GCGTCGAGGT GGAGTGGGCC TCCACCTGCA ACAGCAACTC CTACGTGGAG
CTGCACCGAG TGGACCGCTT CACGAACAGG ACGACCTGGA ACAACCAACC CACCGCGAGG
ACTCTGCTCG ACACACGGAA CATCAGGGAT GGCTGGGCCG CCTGCCCGTC CGACCGAGGT
ACGGAGTTCG ACGCCACCGA GGCCTACCAG TGGGGCGCGG ACAACGACGA GTCCTACGTC
TACCTGCGGC TGAAGGAACG CGACGAGACC TCGAACTCCG CCTGGCGGCG CATCGACATC
GACGGTAAGC CGCCCGTGCT GGTGATCAAC TACAACAACA CGCCCTCCCA GGTGGCCACC
TCCAGCATGT CCGACTCCCT GGGCGGAGTG TGCTCCACGG ACCAGGACAA CCCGCGACTG
ATCAACAACA CCACCGTCAC GTTCCGCGCC ACTGTCCGCG ACTACGACGC GCGCGCCGCC
TGGGGCGGCC AGAAGCTCAA GCTGCGTGTG GAGTGGCGGG TCAACGGCAC CGAACCGCGT
GAGTACGCCG ACTCCTCCTA CGCCGACGTG GGCTACTGGC CTGAGGGCTC CGAACGCGCG
GTGACCGCCA CCGGACTTCC CGAGGGCGAG GTGTTGGGAT ACCGGGCGAT CGCACACGAC
GAGACCGAGT GGGGCTACCC GTGGTCGGAC TGGTGCTGGA TCAAGATCGA CACATCCAAG
CCCGACACCG GTCCCATCGT GGCCTCCACC GACTACCCGG CCGACGACGC GCTGCACGGC
AGCGCCGGAC GCACCGGCGA CTTCACCTTC TCCAACAACG GCGTCGAACA GGCGGCTTCG
TACTACTACA GCCTCAATGA CACCTCGTGC ACGACCGAGG TGAAGCTCGC TGAGCCCGGG
GCCAGTGCCA CGATCCCGAT CACGCCCAGC CGTAGCGGGG CGAACATGAT CTACGCCAGC
ACTGTCGACG CCTACGGCAA CTCGTCTCCG TGCGAACTCG CGTACTACTT CCTGGTCGCG
CCCCCGAGCG CCCCGATCTC CCGCTTCACT CTCGACGAAG GCACAGGGGG AACCGCCGTC
GACGCGGCGG ACCCGGACAG GACCGCCACG GCCAGTGGTG ACACCGGTTG GACTCGAGGC
CGAGTGGGCG GTGTCCGACA GGGCCGCTAC CAGCTGGAAG GCACCAGCCT GACCACGGCG
GACGGGGGCC ACGCCCGTAC CGACGCACCG GTGGTCGACA CCAGTGGCGC GTTCTCCGTG
TCGGCGTGGG TGCGTCTGGA CGAGGCCACA GCCAACCACA CGGCCGTGTC CCAGGACGGT
GAACGGCACA GCGGCTTCTA CCTCGGCTAC AACCACACTG CCGAGGGCAA CTGGGTGTTC
AAACAGGCCC CCTTCGACGG AGACGAGACC AGTATCTCCC GGCGCGTCTA CTCGAACGAG
CCCGCTGAGA CGGGGGTGTG GACCCACCTG CTGGGAACCC ACGATCCAGA GACCGGTCAA
CTCGTCCTCT ACGTCGACGG AGTACGTCAG GGCGAGGCGG TCCAGGAGTC CCCGTGGAAC
GCGCAAGGCC CCCTGGTCAT CGGAGGTGCC AGGTACCGGG GTCAGTTCTA CGACGCCTGG
CCCGGAGCCA TCGACGACGT ACGGGTCTGG GACCGTGTGG TCACCGACGA GGTCACGGGG
GAGGACACCG AGGCGCGTTC AGAGGTCTGG ACCCTGGCCA ACCGCCCCAT CGCGCTGGAA
GGCCGCTGGC AACTGGACGA GACCGACGGC ACCCTCGCCG CGGACTCCTC CGACCACGGT
CTTGGCGCAA CCCTGCACGG CGACCCCCTG ACCGCGTGGA ACAAGGCCCT CAACGACGTC
ACCTTCGCTC CTGGTGTGAG CCTGAACCCA GCCGTCCCCG AACGCATCAC GACCGAAGGC
CCGGCTATCC GTACCGACCG CAGTTACAGC GTCGCGGCAT GGGTCCGCCT GGACGAGGTC
GGGCACAACT CCACGGCGGT CTCCCAGGAC GGTGTTCATC ACAGCGGCTT CTACCTGGGC
TACCAGTACA CAGCAGATTC CCACCAGTGG ATGCTGAAGA ATCCGCCTTC GGACACCGCG
GGAGCTTCGG GTTGGCACCG GGCGAGATCC GATCAGCACG CCGAGTTCGG ACGTTGGACC
CACCTGGTCG GAACCTACGA CCACACCTCG CGGACCACCA CCTTCTACCT CGACGGTGTC
GAGCAGGGCA CGGCCGAGGT GCCTGACGCC TGGCACGCCA ACGGTCCGGT CGTCATCGGC
GGCGGACGCT TCGAGCAGCA GCTCTCCGAT GCCTGGGCCG GAGACATCAG CGACGTCTTC
CTGTACCAGG GCGTGTTCGA CGAACACGAG ATCCTCGCAG TCAGGGAAGG AATCGCGCCG
GTCCCCAGGC TCTAA
 
Protein sequence
MLSPLTRVAA VTTAATLALS LLTAVPAAAD DESTSPSSEG TQSGNEEFDA LALAEQTGEP 
VEIPSLTDEK TQHFANPDGT FTAEVSAIPV RVRSGGGWVD VDTTLVAGDD GLVRPRAAGM
DIAFSGGGDA PMARIGIGSN SVALDWVADL PTPTLDGDQA TYADVLPDVD LVLTAGVEGF
TQVLVVHTPE AAASPELAEL ELPLNATGVS VVADEHGNIE AIGDEGGQSV FTAQAPAMWD
STGQDEAAGE NPLLAPTTGA RTALVETEIG VDSIRLIPDQ GLLTGEDTEF PLYIDPSVSA
SRLNWAYVDK AYPDQANHNK KIDNVGVGRY SDHTGTYTRR AFFQFSVPGR IKQDTTTIHS
ATMRVEVEWA STCNSNSYVE LHRVDRFTNR TTWNNQPTAR TLLDTRNIRD GWAACPSDRG
TEFDATEAYQ WGADNDESYV YLRLKERDET SNSAWRRIDI DGKPPVLVIN YNNTPSQVAT
SSMSDSLGGV CSTDQDNPRL INNTTVTFRA TVRDYDARAA WGGQKLKLRV EWRVNGTEPR
EYADSSYADV GYWPEGSERA VTATGLPEGE VLGYRAIAHD ETEWGYPWSD WCWIKIDTSK
PDTGPIVAST DYPADDALHG SAGRTGDFTF SNNGVEQAAS YYYSLNDTSC TTEVKLAEPG
ASATIPITPS RSGANMIYAS TVDAYGNSSP CELAYYFLVA PPSAPISRFT LDEGTGGTAV
DAADPDRTAT ASGDTGWTRG RVGGVRQGRY QLEGTSLTTA DGGHARTDAP VVDTSGAFSV
SAWVRLDEAT ANHTAVSQDG ERHSGFYLGY NHTAEGNWVF KQAPFDGDET SISRRVYSNE
PAETGVWTHL LGTHDPETGQ LVLYVDGVRQ GEAVQESPWN AQGPLVIGGA RYRGQFYDAW
PGAIDDVRVW DRVVTDEVTG EDTEARSEVW TLANRPIALE GRWQLDETDG TLAADSSDHG
LGATLHGDPL TAWNKALNDV TFAPGVSLNP AVPERITTEG PAIRTDRSYS VAAWVRLDEV
GHNSTAVSQD GVHHSGFYLG YQYTADSHQW MLKNPPSDTA GASGWHRARS DQHAEFGRWT
HLVGTYDHTS RTTTFYLDGV EQGTAEVPDA WHANGPVVIG GGRFEQQLSD AWAGDISDVF
LYQGVFDEHE ILAVREGIAP VPRL