Gene Ndas_3307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3307 
Symbol 
ID9247169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3948983 
End bp3951088 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content70% 
IMG OID 
ProductDNA topoisomerase (ATP-hydrolyzing) 
Protein accessionYP_003681219 
Protein GI297562245 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCT TGACCGCAGT CGTCCACGAC CCCGAGGATT ACTCCGCCCG ACACCTGTCG 
GTCCTCGAAG GTCTTGAGGC GGTGCGCAAA CGGCCGGGCA TGTACATCGG GTCCACCGAC
AGCCGCGGCC TCACCCACTG CATGTGGGAG ATCATCGACA ACTCCGTGGA CGAGGCCCTG
GGCGGCTTCT GCACCCGGAT CGACGTCACA CTGCACTCCG ACGGGTCGGT GTCGGTGAGC
GACGACGGAC GCGGAATCCC CGTGGACGCC GAGCCCAAGT CCGGCCTGTC CGGTGTGGAA
CTGGTCATGA CCAAGCTCCA CGCGGGCGGC AAGTTCGGCT CCGGCTCCTA CACCGCCTCC
GGCGGCCTGC ACGGCGTCGG CGCCTCGGTG GTCAACGCCC TGTCGTCCCG CGTGGACGTC
GAGGTCGACC GCCAGGGCCG CACCCACGCG ATCAGCTTCC GCCGGGGCAT CCCCGGCCGC
TTCGCCGGTT CCGGCCCCGA CGCCGAGTTC ACGCCGGTCT CGGGCCTGGA GCAGGTCCGC
AAGGTGGCCC AGAAGATCAC CGGCACCCGC ATCCGGTTCT GGCCGGACAT GCAGATCTTC
CTCAAGGACG CCGACATCGC CCGCGAGGCG CTGCTGGACC GGGCCCGCCA GACCGCGTTC
CTGGTGCCGG GCCTGACCCT CTCCGTGCGC GACGAGCGCG TGGAGGGGGA GCCCGCCCAC
GAGGAGGAGT TCCGCTTCGA CGGCGGCATC GGCGAGTTCT GCACCTTCCT GGCCCCCGAC
GAGCCGGTCA GCGACGTGCT GCGCATCCAG GGCAGCGGCA ACTTCACCGA GACCGTCCCG
GTCCTGGACG ACGCGGGGCA CATGGTGCCC GCCGACGTCG AGCGCCGCCT GGACGTGGAC
GTCGCCCTGC GCTGGGGGAC GGGCTACGAC ACCACCGTGC GCTCCTTCGT CAACGTCATC
GCCACCCCCA AGGGCGGCAC CCACATCAAC GGCTTCGAAC GGGCCCTGGT GCGGGTGATC
AACGAGCAGC TGCGCACCAC CAAGCTGCTG AGGAACAGCG ACGACCCCGT CACCAAGGAC
GACACCCAGG AGGGGCTCAC CGCGGTCGTC ACCGTCCGCC TGCCCGAGCC CCAGTTCGAG
GGCCAGACCA AGGAGATCCT CGGCACCTCC GCGGCGACCA GGATCGTCTC CCAGGTGGTC
GGCAAGGAGC TGCGCGAGTT CCTGACCTCC ACCAAGAAGG TGGAGAAGGC CAGGGCGCGC
AGCGTCCTGG AGAAGGTGGT CGGGGCCGCC AAGGCCCGCC TGGCCGCGCG CCAGCAGCGC
GAGACCCAGC GGCGCAAGAC CGCCCTGGAG AACTCGGCGC TGCCCGCCAA GCTGGTGGAC
TGCCGCAGCG AGGGCCTGGA GCACAGCGAG CTGTTCATCG TCGAGGGGGA CTCGGCGCTG
GGCACCGCCA AGCTGGCCCG CGACTCGGAG TTCCAGGCGC TGCTGCCGAT CCGGGGCAAG
ATCCTCAACG TGCAGAAGTC CTCGGTCGCC GACATGCTCA AGAACGCCGA GTGCGCGGCC
ATCCTCCAGG TGGTCGGCGC CGGTTCGGGG CGCACCTTCG ACATCGACGC GGCCCGCTAC
GGCCGGATCA TCCTCATGGC CGACGCCGAC GTGGACGGCG CGCACATCCG CTGCCTGCTG
CTCACGCTCA TCTACCGCTA CATGCGGCCG ATGCTGGAGG CCGGACGGGT GTACGCGGCC
GTGCCGCCGC TGCACCGCAT CGAGCTGACC AACACCCGGC GCAAGCGCGG CAGCAAGCCC
GAGGACCGCT ACGTCTACAC CTACTCCGAC GCCGAGCTCT CGCGCACGCT GCTGGACCTG
GAGAAGCGCA AGATCTCCTG GAAGGAGCCG GTGCAGCGCT ACAAGGGCCT GGGTGAGATG
GACGCCGACC AGCTGGCCGA GACCACCATG GACCCGCGTT ACCGCACCCT GCGCCGGATC
CGTGTGGAGC AGGCCGAGGA GGCCTCGTCC GTGTTCAACC TGCTGATGGG CAACGAGGTC
GCCCCGCGCC GCCGCTTCAT CCAGCAGGGC GCCCAGGAGC TGGACGTGGC CCGCATCGAC
ACCTGA
 
Protein sequence
MTALTAVVHD PEDYSARHLS VLEGLEAVRK RPGMYIGSTD SRGLTHCMWE IIDNSVDEAL 
GGFCTRIDVT LHSDGSVSVS DDGRGIPVDA EPKSGLSGVE LVMTKLHAGG KFGSGSYTAS
GGLHGVGASV VNALSSRVDV EVDRQGRTHA ISFRRGIPGR FAGSGPDAEF TPVSGLEQVR
KVAQKITGTR IRFWPDMQIF LKDADIAREA LLDRARQTAF LVPGLTLSVR DERVEGEPAH
EEEFRFDGGI GEFCTFLAPD EPVSDVLRIQ GSGNFTETVP VLDDAGHMVP ADVERRLDVD
VALRWGTGYD TTVRSFVNVI ATPKGGTHIN GFERALVRVI NEQLRTTKLL RNSDDPVTKD
DTQEGLTAVV TVRLPEPQFE GQTKEILGTS AATRIVSQVV GKELREFLTS TKKVEKARAR
SVLEKVVGAA KARLAARQQR ETQRRKTALE NSALPAKLVD CRSEGLEHSE LFIVEGDSAL
GTAKLARDSE FQALLPIRGK ILNVQKSSVA DMLKNAECAA ILQVVGAGSG RTFDIDAARY
GRIILMADAD VDGAHIRCLL LTLIYRYMRP MLEAGRVYAA VPPLHRIELT NTRRKRGSKP
EDRYVYTYSD AELSRTLLDL EKRKISWKEP VQRYKGLGEM DADQLAETTM DPRYRTLRRI
RVEQAEEASS VFNLLMGNEV APRRRFIQQG AQELDVARID T