Gene Ndas_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0959 
Symbol 
ID9244804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1175007 
End bp1177793 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content72% 
IMG OID 
ProductDNA polymerase I 
Protein accessionYP_003678909 
Protein GI297559935 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.75162 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGACCA AGCGAGAGAC CCCCGACCAG ACAACCCGGA GCGGCGACGG CGGCAGGCGC 
CCCCGGCTCC TCCTCCTGGA CGGCCACTCG ATGGCCTTCC GCGCGTTCTT CGCGCTCCCG
GTGGACAAGT TCGGCACGAG TACCGGGCAG TCGACCAACG CCGTGTACGG CTTCGCGTCG
ATGCTGGTCA AGCTGCTGCG CGACGAGGAG CCCACCCACG TCGCGGTGGC CTGGGACCTG
TCCGGCCCCA CCTTCCGGCA CGAGGAGTAC GCCGAGTACA AGGACGGCCG CTCCGACACC
CCGCCGGAGT TCCCCTCCCA GGTGCCGCTC ACCCAGGACC TGATGCGGCT GATCGGCGTC
GCCAACCTGT CCGCCCCGGG GTTCGAGGCC GACGACGTCA TCGCCACCCT GGCCCACCAG
GGCGGCGAGG CCGGGATGGA GGTGCTCATC GCCTCCGGCG ACCGCGACGC CTTCCAGCTG
GTCACCGACT CCTGCACGGT CCTGTACCCG GGCAAGAGCC TGTCGGACCT GCGGCGGATG
GACCCCGCGG CGGTCGAGGA CAAGTACGGG GTCACCCCCG AGCGCTACCG CGACCTGGCC
GCGCTGGTGG GGGAGAAGGC CGACAACCTG CCCGGTGTGC CGGGCGTGGG CCCCAAGACC
GCCGCCAAGT GGATCACCAA GTACGGGTCC CTGGACGAGC TGGTCGCCCA CGCCGACGAG
CTCACCGGCA AGGCGGGGCA GAGCTTCCGC GACCACCTGG ACGACGTCCT GCGCAACCAG
CGGCTCAACC GGTTGGCCAC GGACGTGGAG CTGGACGCGG AGGTCACCGC CCTGGCCCTG
GGCGAGGCCG ACCGCACCGG GATCGACGCG CTCTTCGACA ACCTGGAGTT CGCCTCCAAC
CTGCGCGAGC GCCTCTACGC CGTCGTCCGC CTGCCCGAGG ACGGCTCCGG GGAGACGGGG
ACCGAGGCGG TCGAGGGCTT CCGCGTCGAA CTGACCGTGG CCGGGACCGG CGAGCTGGCC
CCCTGGCTGG CCGCGCACGC CGCCCCCGAC GACGCGGCCT CCGACACCCT CACGCCCACC
GCCCCCGCCG GACTGGCCCT GGACGGCGCC TGGGGACAGG GCACCGGCCG GGTGGACGCG
CTGGCGATCA GCGTGCCCTC CGGTGACGCC GTGTTCGTCG ACCCCACCGC CCTGGACCCC
GCCGACACCG AGGCGCTGGC GGACTTCCTG GCCGACCCCC GCCGCCCCAA GGCCGTGCAC
GAGTACAAGG GCGCCCTGCT GGCCCTGGGC GCGCACGGGT GGGAGCTGGG CGGGGTGGTC
AGCGACACCG CCCTGGCCGC CTACCTGGTC CAGCCGGGGC AGCGCCGCTT CGACCTGGCC
GACCTGTGCC GCAAGTACCT GGGCCGGGAG CTGGAGGAGG ACTCCTCCGG CGACCAGCTC
ACCCTGGACC TGGGCGGCGA GGACGGGGCG GGCTCGGGCC GCCAGCACCT GCTGGCCGTG
CGCGCGAGCG CCACCCGCGA CCTGGCCGGG GTGCTCTCCG CCGAACTCGA CAAGCGCGGC
GGCACCCACC TGCTGCACGA GGTCGAGCTG CCGCTGGTGG ACGTGCTGGC GCGCCTGGAA
CGCGCGGGGA TCGCCGCCGA CCGCCCCTAC CTGGAGGAGC TCCAGGGCGA GTTCGCCGCG
GCGGGCCGGG TCGCGGTGGA GCGCGCGCAC GAGATCGTGG GCCGCGAGTT CAACCTCGGC
TCGCCCAAGC AGCTCCAGCA GGTGCTGTTC GAGGACCTGG GCCTGCCCAG GACCAAGAAG
ATCAAGACGG GCTACACCAC CGACGCCGAC GCCCTGGCGT GGCTGGCCTC CCAGACCGAC
AACGAGCTGC CCGCGGTCCT TCTCCACCAC CGCGACCAGA CCAAGCTGCG CACCACGGTC
GAGGGGCTGA TCAAGACCGT CGCCGACGAC GGCCGCATCC ACACCACCTT CAACCAGACG
GTGGCGGCGA CCGGGCGCCT GAGCTCCACC GACCCCAACC TCCAGAACAT CCCCGTGCGC
ACCGACGTGG GGCGGCGCAT CCGCCGCGCG TTCGTCGTGG GGGAGGGCTA CGAGGAGCTG
CTCACCGCCG ACTACAGCCA GATCGAGCTG CGCATCATGG CGCACCTGTC GGGCGAGCAG
GCGCTGATCG ACGCCTTCAA CAGCGGCTAC GACTTCCACG CGCAGATGGC CGCGCAGATC
TTCGACGTCG AGGTCGAGCA GGTGGACGGC GAGGCGCGGT CCAGGATCAA GGCCATGAGC
TACGGCCTGG CCTACGGGCT GAGCGCCTAC GGCCTCTCCC AGCAGCTGGG GATCACGCCG
GAGGAGTCCA AGCGCCTCAT GGAGGACTAC TTCGCCGAGT TCGGCGGGGT GCGCGACTAC
CTCAACGCCA TGGTCGAGGA GGCCCGCCGG GTCGGCTACA CCGAGACCAT CCTGGGGCGG
CGCCGCTACC TGCCCGACCT GACCAGCGAC AACCGCCAGC GCCGGGAGAT GGCCGAGCGG
ATGGCGCTCA ACGCGCCGAT CCAGGGGTCG GCCGCCGACA TCATCAAGGT GGCCATGCTC
CGGGTGGACG CGGCCCTCAC CGAGGGCGGG CTCACCTCGC GGGTCCTGCT CCAGGTGCAC
GACGAACTCG TGGTGGAGGT CGCTCCCGGC GAGCGCGCGG AGGTCGAAAA CATCGTGGCG
CGCGAGATGA GCTCCGCCTA TGATCTGCGT GTGCCGCTGG CCGTCTCGGT CGGCAGCGGC
CAGAACTGGC ACGACGCCGC GCACTGA
 
Protein sequence
MVTKRETPDQ TTRSGDGGRR PRLLLLDGHS MAFRAFFALP VDKFGTSTGQ STNAVYGFAS 
MLVKLLRDEE PTHVAVAWDL SGPTFRHEEY AEYKDGRSDT PPEFPSQVPL TQDLMRLIGV
ANLSAPGFEA DDVIATLAHQ GGEAGMEVLI ASGDRDAFQL VTDSCTVLYP GKSLSDLRRM
DPAAVEDKYG VTPERYRDLA ALVGEKADNL PGVPGVGPKT AAKWITKYGS LDELVAHADE
LTGKAGQSFR DHLDDVLRNQ RLNRLATDVE LDAEVTALAL GEADRTGIDA LFDNLEFASN
LRERLYAVVR LPEDGSGETG TEAVEGFRVE LTVAGTGELA PWLAAHAAPD DAASDTLTPT
APAGLALDGA WGQGTGRVDA LAISVPSGDA VFVDPTALDP ADTEALADFL ADPRRPKAVH
EYKGALLALG AHGWELGGVV SDTALAAYLV QPGQRRFDLA DLCRKYLGRE LEEDSSGDQL
TLDLGGEDGA GSGRQHLLAV RASATRDLAG VLSAELDKRG GTHLLHEVEL PLVDVLARLE
RAGIAADRPY LEELQGEFAA AGRVAVERAH EIVGREFNLG SPKQLQQVLF EDLGLPRTKK
IKTGYTTDAD ALAWLASQTD NELPAVLLHH RDQTKLRTTV EGLIKTVADD GRIHTTFNQT
VAATGRLSST DPNLQNIPVR TDVGRRIRRA FVVGEGYEEL LTADYSQIEL RIMAHLSGEQ
ALIDAFNSGY DFHAQMAAQI FDVEVEQVDG EARSRIKAMS YGLAYGLSAY GLSQQLGITP
EESKRLMEDY FAEFGGVRDY LNAMVEEARR VGYTETILGR RRYLPDLTSD NRQRREMAER
MALNAPIQGS AADIIKVAML RVDAALTEGG LTSRVLLQVH DELVVEVAPG ERAEVENIVA
REMSSAYDLR VPLAVSVGSG QNWHDAAH