Gene Ndas_4953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4953 
Symbol 
ID9248841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp92677 
End bp95517 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content71% 
IMG OID 
ProductDNA topoisomerase I 
Protein accessionYP_003682841 
Protein GI297563868 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.47647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACCCA CGAAGGGCAG CGCCGGCAAG AACGGCTCGA AGGACAGCGG ACGACAGGGG 
GCGGGCACCG CCCTCGTCAT CGTCGAGTCA CCGGCCAAGG CGAAGACCAT CGCCGGTTAT
CTGGGGCGCG GCTACGTCGT GGAGTCCAGC ATCGGCCACA TCCGCGACAT GCCCAACAAG
GCCGCCGAGA TCCCCGCCAA GTACAAGGGG CAGGCGTGGG CCCGGCTCGG GGTGGACGTC
GACGGCGACT TCGAGCCGCT CTACGTGGTC AACACCGACA AGAAGGCGCA CGTCAAGAAG
CTCAAGGAGC TCATGGCCGA CGCCGACGAA CTCCTCCTCG CGACAGATGA GGACCGCGAG
GGCGAGGCGA TCGCCTGGCA CCTGCTGGAG GAGCTCAAGC CCCGCATCCC CGTCCGCCGC
ATGGTGTTCA ACGAGATCAC CAAGGACGCC ATCCAGCGCG CGGCCCACAA CACCCGCGAC
CTGAACACCC GCCTGGTCAA CGCACAGGAG ACCCGGCGCA TCCTGGACCG CCTCTACGGC
TACGAGGTCT CCCCCGTGCT GTGGAAGAAG GTCATGCCCA AGCTCTCGGC GGGCCGCGTG
CAGTCCGTGG CCACCCGCCT GGTGGTCGAG CGCGAACGCG AGCGCATGGC GTTCACCTCC
GCCGAGTACT GGGACCTCAA GGCCCTCTTC GACGCCACCG AGGCCGGTGT CGCCGAAGGC
CCCGCCACCT TCCCCGCCGC CCTCGTCTCC GTGGACGGCA CCCGCGTCGC CGCGGGCCGC
GACTTCACCC CGCAGGGCAC GCTGCGCTCG GACCGCCCCG TCCGCCAGCT GGACGAGGCC
GCGGCGCGCG GCCTCGCCGA GCGGCTGTCC GGCGCCGGGT TCTCGGTCTC CTCGGTGGAG
CGCAAGCCCT ACCGCCGCTC GCCCTACGCG CCGTTCCGCA CCACCACCCT CCAGCAGGAG
GCGTCCCGCA AGCTCGGCCT GTCGGCCAAG CAGACCATGC AGGTCGCCCA GCGGCTGTAC
GAGAACGGCT ACATCACCTA CATGCGCACC GACAGCACCA CGCTGTCGGA CAGCGCGGTC
AAGGCCGCCC GCTCCCAGGT GCGCAGCCTC TACGGCGGCG ACTACCTGCC CGACAAGCCG
CGCGTGTACG CCAAGAAGGT CAAGAACGCC CAGGAGGCGC ACGAGGCGAT CCGCCCGTCC
GGCGACACCT TCCGCACGCC CGCCCAGACC GGCCTGTCCG GCCCCGAGTT CCGGCTCTAC
GAGCTGGTCT GGAAGCGCAC CGTCGCCTCC CAGATGAAGG ACGCGGTCGG CGAGTCCGTC
ACCGTGCGCA TCGAGGGCAC CTCCTCCGAC GGCGAGGTCG CCGAGTTCAG CGCGACCGGC
AAGGTCATCA CCTTCCACGG GTTCCTCAAG GCCTACGTGG AGGGCTCCGA CGACCCGGCC
GCCGACCTGG ACGACCGCGA GCGCCGCCTG CCCGCGATGT CGGAGGGCGA CCCGCTCAAG
GCCCAGAACC TGGAGGCCGA GGGGCATAGC ACCCGGCCGC CCGCGCGCTA CACCGAGGCC
ACCCTGGTCA AGGAGCTGGA GGAGCGCGAG ATCGGCCGCC CCTCCACCTA CGCCTCGATC
ATCGGCACCA TCCTGGACCG CGGCTACGTG TTCAAGAAGG GCACGGCGCT GGTGCCGTCC
TTCCTGGCCT TCGCCGTGGT GCAGTTGCTG GAGCGCCACT TCGGCAACCT GGTGGACTAC
GAGTTCACCG CGCGCCTGGA GGACGTGCTC GACGACATCG CCCGGGGGGA GGCCGAGAGC
CTGCCCTGGC TGCGCCGCTT CTACTTCGGC GGCGAGGACA CCGAGGGCCA GCGGGAGACC
GGGCTCAAGG AGCTGGTCAA CGACAACCTC TCCGACATCG ACCCCAAGGA GATCAGCTCC
CTGCCGCTGC CCGGCACCGA CATCGTGCTG CGGGTGGGGC GCTACGGCCC CTACCTGGAC
CGGGACGGCG TGCGGGTGAA CGTGCCCGAG GACCTGGCCC CGGACGAGTT GACCGTGGAC
AAGGCCGAGG AGCTGTTCGC CCAGCCCAGC GGCGACCGGG AGCTGGGCAC CGACCCCGAG
ACCGGCCGCG TCATCGTGGC CAAGACCGGG CGGTTCGGCC CCTACGTCAC CGAGGTGATC
GAGGAGCCGC AGGAGGCCGA GGAGGGCAAG GCCAAGACCA GGGCCAAGGC CAAGGTCAAG
CCGCGCACCA GCTCGCTGCT CAAGTCGATG AGCCTGGACA CCGTCACCCT GGAGGACGCG
CTGCGCCTGC TGTCGCTGCC GCGCGTGGTC GGCCAGATCG ACGGCGAGGA CGTCACCGCG
CAGAACGGCC GGTTCGGCCC CTACATCAAG AAGGGCACCG ACAGCCGCTC CCTGGAGACC
GAGGAGCAGA TGTTCACGGT CACACTGGAG CAGGCCGCGG CGCTGTTCGC CCAGCCCAAG
CAGCGCGGGC GCCGCGCCGC CGCCCCGCCG CTGCGCGAGC TGGGGGAGGA CCCGGCCTCG
GGCGCCAAGA TGGTGATCAA GGACGGCCGG TTCGGCCCCT ACGTCACCGA CGGGGAGGTC
AACGCCTCCC TGCGCAAGGG CGACGAGGTG GAGTCGATCA CCGACCAGCG GGCCGCCGAG
CTGCTGGCCG AGCGCCGCGC CAAGGCGCCC GCCAAGAAGC CCGCGGCCAA GAAGCCCGCC
GCGAAGAAGG CGCCCGCCAA GAAGACCTCC ACGACGGCCA GGAAGACGGG CACGGCGGCC
AAGAAGGCGT CCACCTCCAC GACGGCCAGG AAGTCACCGG CCAAGGGGAC CGGCGCCTCC
GACCCCGCCC CGGAGGAGTA G
 
Protein sequence
MPPTKGSAGK NGSKDSGRQG AGTALVIVES PAKAKTIAGY LGRGYVVESS IGHIRDMPNK 
AAEIPAKYKG QAWARLGVDV DGDFEPLYVV NTDKKAHVKK LKELMADADE LLLATDEDRE
GEAIAWHLLE ELKPRIPVRR MVFNEITKDA IQRAAHNTRD LNTRLVNAQE TRRILDRLYG
YEVSPVLWKK VMPKLSAGRV QSVATRLVVE RERERMAFTS AEYWDLKALF DATEAGVAEG
PATFPAALVS VDGTRVAAGR DFTPQGTLRS DRPVRQLDEA AARGLAERLS GAGFSVSSVE
RKPYRRSPYA PFRTTTLQQE ASRKLGLSAK QTMQVAQRLY ENGYITYMRT DSTTLSDSAV
KAARSQVRSL YGGDYLPDKP RVYAKKVKNA QEAHEAIRPS GDTFRTPAQT GLSGPEFRLY
ELVWKRTVAS QMKDAVGESV TVRIEGTSSD GEVAEFSATG KVITFHGFLK AYVEGSDDPA
ADLDDRERRL PAMSEGDPLK AQNLEAEGHS TRPPARYTEA TLVKELEERE IGRPSTYASI
IGTILDRGYV FKKGTALVPS FLAFAVVQLL ERHFGNLVDY EFTARLEDVL DDIARGEAES
LPWLRRFYFG GEDTEGQRET GLKELVNDNL SDIDPKEISS LPLPGTDIVL RVGRYGPYLD
RDGVRVNVPE DLAPDELTVD KAEELFAQPS GDRELGTDPE TGRVIVAKTG RFGPYVTEVI
EEPQEAEEGK AKTRAKAKVK PRTSSLLKSM SLDTVTLEDA LRLLSLPRVV GQIDGEDVTA
QNGRFGPYIK KGTDSRSLET EEQMFTVTLE QAAALFAQPK QRGRRAAAPP LRELGEDPAS
GAKMVIKDGR FGPYVTDGEV NASLRKGDEV ESITDQRAAE LLAERRAKAP AKKPAAKKPA
AKKAPAKKTS TTARKTGTAA KKASTSTTAR KSPAKGTGAS DPAPEE