Gene Ndas_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3303 
Symbol 
ID9247165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3944042 
End bp3946528 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content71% 
IMG OID 
ProductDNA topoisomerase (ATP-hydrolyzing) 
Protein accessionYP_003681215 
Protein GI297562241 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.698383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGTA CTACTTCCCG TCCCCCCGAG GATCTCGCCG AGAAGATCAT CGACATCGAC 
GTCTCCGAGG AGATGCGCGG CAGCTTCCTG GAGTACGCCT ACTCGGTCAT CTACCAACGC
GCGCTACCGG ACGCGCGCGA CGGTATGAAA CCGGTGCAGC GGCGCATCCT CTACCAGATG
AACGAGATGG GGCTGCGGCA CGACCGCGGC CACGTCAAGT GCGCCCGCGT CGTCGGGGAC
GTGATGGGCC GCCTCCACCC CCACGGCGAC ACCGCCATCT ACGACGCGCT CGTGCGCCTG
AGCCAGCCCT TCGCCATGCG CGTGCCCCTG GTGGACGGCC ACGGCAACTT CGGCTCCCTG
GGCGGCGACG ACGCGCCGGC CGCCATGCGC TACACCGAGG CGCGCCTGGA CCGCGCCGCC
GAGCAGCTGG TCGCCTCCAT CGACGAGAAC GTCGTCGACT TCCGGCCCAA CTACGACGGC
CAGGAGACCG AGCCCGAGGT CCTGCCCGCC GCCTTCCCGA ACCTGCTGGT CAACGGCGCC
TCCGGCATCG CGGTGGGCAT GGCCACCAAC ATGGCCCCGC ACAACCTGGG CGAGGTCGTC
GCGGCCGCCC GCCACCTGAT CGCCCACCCC GAGGCCACCC TGGAGGAGCT CACCGAGTTC
GTCCCGGGCC CGGACCTGCC CACCGGCGGC ACCATCGTCG GCCTGGACGG CATCCGCGAC
GCCTACCGCA GCGGCCGGGG CACGTTCAAG ACCCGCGCCA CCGTCTCCAT CGAGAAGGTG
TCCGCGCGCC GCACCGGCCT GGTGGTCACC GAGCTGCCCT ACAGCGTGGG CCCGGAGAAG
GTCATCAGCC GGATCAAGGA GCTGGTGCAG TCCAAGAAGC TCCAGGGCAT CTCGGACCTG
AAGGACCTGA CCGACCGCAC GCAGGGCCTG CGCCTGGTCA TCGAGCTCAA GAACGGCTTC
AACCCCGAGG CGGTCCTGGA GGAGCTGTAC CGGCTCACGC CGATGGAGGA GTCCTTCGGC
ATCAACAACG TGGCCCTCGT GGACGGCCAG CCCCAGACGC TGGGACTGCG CGAACTCCTC
GAGGTCTACG TCAACCACCG CCTCGACGTG GTGCGCCGCC GCAGCGAGTT CCGCCGAGCC
AAGCGCCAGG AGCGCCTGCA CCTGGTGGAC GGCCTGCTCG TGGCCCTGCT GGACATCGAC
CGGGTCATCT CCCTGATCCG CGAGGCCGAG GACACCGCCG CCGCCCGCGA GTCGCTCATG
GGCGCCTACG ACCTGTCGGA GATCCAGGCC CGCTACATCC TGGAGACCCC GCTGCGGCGC
CTGACCCGCT TCGACCGGAT GGAGCTGGAG ACCGAGCGGG ACAAGCTCAA CAAGGAGGTC
GACGCCCTCA CCGCGGTTCT GGAGTCGGAC AAGAAGCTGC GCCGCCTGGT CTCCAAGGAG
ATGGGCGACG TCGCCAAGGA GTTCGCCACC CCGCGTCGCA CGCAGCTGCG CGAGAGCGAC
GGCGTGGCCC GCAGCGCGGT GATCCCGCTG GAGATGGCCG ACCAGCCCTG CCACGTGCTG
ATGGGCGTGC ACGGCGCGAT CGGCCGCAGC AAGGGACCCT CGGTGCCGCG CGTGTACGAG
GGCCTGCGCA CCCGGCACGA CGCGGTCCAC GTGTCCGTCC CCACCACCTC GCGCTCCGCC
GTGGGCCTGG TCACCGACCT GGGCCGGATG ATCCGCGTGC CGGTGGTGGA GCTGCCGGAG
CTGCCCGACG AGGGCGGGGA CAGCCCGCCG CTGGCCTCGG GGCTGGACGT GTCCGAGTTC
GTCCAGCTCG ACGGGGACGA ACGGGTGGTG TCGGTGGTGT CCATGGACGC CTCGGGCCCC
GGCTTCGCCC TGGGCACGCG CGGCGGTGTG GTCAAGCGGG TCTCCCCCGA CTACCCGCCC
AACAAGGACG ACTTCGAGGT CGTGGCGCTC AAGGACGACG ACCGCGTCGT GGGCGTCACC
CAGCTGTCCA CGGGCGAGGA GGACCTGGCC TTCATCACCT CCGAGGCCCA GCTGCTGCGC
TTCTCCGCGA GCGCCGTGCG GCCCCAGGGG CGGCCCGCGG GCGGTGTGGC GGGCGTGCGC
CTGTCGGAGG GGGCGCGGGT GCTGTGGTTC GGCGCCGTGC CCAGTCCGCA GGACTCGGTG
GTGGTCACCG TCTCGGGCTC CTCCGGCGCC CTGGAGGGCA CCCAGGCCGG TTCGGCCAAG
GTCACGCCCT TCGAGGCCTA CCCGGTCAAG GGCCGCGCCA CCGGCGGCGT GCGCTGCCAC
CGCTACCTCA AGGGCGAGGA CACCCTGCTG TTCGGATGGA TCGGCCGTTC CCCCGCCCGC
GCCGCGCGCG CGGACGGCAA GCCGGTCCGC CTGCCCGACC CGATCGACCG TCGGGACGGG
TCCGGTGACG CGCTGCCCCG CGCGATCGTT ACTGTAGGTA GCGGACAGAC CGACTCCGGG
ATCACCGTTC CCGGAGCGAA AGCCTGA
 
Protein sequence
MARTTSRPPE DLAEKIIDID VSEEMRGSFL EYAYSVIYQR ALPDARDGMK PVQRRILYQM 
NEMGLRHDRG HVKCARVVGD VMGRLHPHGD TAIYDALVRL SQPFAMRVPL VDGHGNFGSL
GGDDAPAAMR YTEARLDRAA EQLVASIDEN VVDFRPNYDG QETEPEVLPA AFPNLLVNGA
SGIAVGMATN MAPHNLGEVV AAARHLIAHP EATLEELTEF VPGPDLPTGG TIVGLDGIRD
AYRSGRGTFK TRATVSIEKV SARRTGLVVT ELPYSVGPEK VISRIKELVQ SKKLQGISDL
KDLTDRTQGL RLVIELKNGF NPEAVLEELY RLTPMEESFG INNVALVDGQ PQTLGLRELL
EVYVNHRLDV VRRRSEFRRA KRQERLHLVD GLLVALLDID RVISLIREAE DTAAARESLM
GAYDLSEIQA RYILETPLRR LTRFDRMELE TERDKLNKEV DALTAVLESD KKLRRLVSKE
MGDVAKEFAT PRRTQLRESD GVARSAVIPL EMADQPCHVL MGVHGAIGRS KGPSVPRVYE
GLRTRHDAVH VSVPTTSRSA VGLVTDLGRM IRVPVVELPE LPDEGGDSPP LASGLDVSEF
VQLDGDERVV SVVSMDASGP GFALGTRGGV VKRVSPDYPP NKDDFEVVAL KDDDRVVGVT
QLSTGEEDLA FITSEAQLLR FSASAVRPQG RPAGGVAGVR LSEGARVLWF GAVPSPQDSV
VVTVSGSSGA LEGTQAGSAK VTPFEAYPVK GRATGGVRCH RYLKGEDTLL FGWIGRSPAR
AARADGKPVR LPDPIDRRDG SGDALPRAIV TVGSGQTDSG ITVPGAKA