Gene Ndas_3822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3822 
Symbol 
ID9247693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4585882 
End bp4589064 
Gene Length3183 bp 
Protein Length1060 aa 
Translation table11 
GC content76% 
IMG OID 
ProductUvrD/REP helicase 
Protein accessionYP_003681725 
Protein GI297562751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0180671 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00809927 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGCGCG CCCACCAGGT GCAGCCGCCG CCGGTGCTGG ACGAGAACCA GCGGCGCGTG 
GTCGAGCACG AGGGCGGGCC CCTGCTGGTG CTCGCCGGTC CCGGGACAGG CAAGACCACC
ACGATCGTCG AGTCGGTCGT GGACCGCATC GACCACCGGG GCACGGACCC CTCGCGCGTG
CTGGTGCTCA CCTTCAGCCG CAAGGCCGCC CAGGAGCTGC GCGAGCGCAT CACCGCCCGG
CTGCGCCGCA CCACCCGCGA ACCGCTGGCC CTGACCTTCC ACAGCTACGC CTACGCCCTG
ATCCGGCGCG AGTTCCAGCG CATGGGCGAC CTGCCGCCGC GCCTGCTCTC GGGCCCCGAG
CAGCTCATGG AGGTCCGCGA ACTCCTGCGG GGCGAGGCCC TGGACGGCGC CGCGGACTGG
CCCGAGCGCC TGCGCCCCGC CCTGGAGACC CGCGGGTTCG CCGAGGAGCT GCGCGACTTC
CTCATGCGCG CCCAGGAACG CGGCATGGGC CCGGACGAGC TGGCCGCGCT CGGCCGCGAC
CGCGACCGCG ACGACTGGGT GGCCGCGGCC GGGTTCCTGG ACCGCTACAC CGGCCGGTTC
GACATCGCCC CCGTGCCCAC GCTCAACTAC GCCGAACTCG TGCGCGTCGC CGCCAACCTG
CTCTCCGACC CCGGGGTCCG CGAGCGCGAG CGCGCCGCCC ACGAGGCGGT GTTCGTCGAC
GAGTACCAGG ACACCGACCC CGCCCAGGAG GAACTCCTGC GCGCCCTGGC CGGGGACGGC
CGCGACCTGG TCGCGGTCGG CGACCCCGAC CAGTCCATCT ACGCCTTCCG CGGGGCCGAG
GTGCGGGGCA TCCTCGACTT CCCGCGCCGC TTCCCGACCG CGCGGCGCAC CGAGGCGCCC
GTGGTCGCGC TGCGCACCTG CCGCCGCAGC GGCCGCGCCC TGCTGTCGGC CTCGCGCGGC
CTGACCCGGC GCCTGCCCGC GGTCGCCTCC GCCCGCGGCC ACGTCAACGA GCACCGCGAC
CTGACCCCCG CCGAGGGCGT GCCCGACGGC GAGGTCCGCG TGCTGCTGGC CGACAGCGCC
GCCCAGGAGG CCGCGGTCAT CGCCGACACC CTGCGCCGCG CCCACCTGGT GGACGGCGTC
CCGTGGTCGG ACATGGCCGT GCTGGTGCGC TCCTCCACCC GCCAGCTGCC GGTGCTGCGC
CGCGCCCTGA CCGCCGCCTC GGTCCCGGTC GCGGTCGGCG CCGACGACCT GCCCGTGGCC
GCCGAGCCCA TCGTGCGCCC CATGCTCGCG CTGATCCGCT ACGGGCTGGC CCCGGCCGAG
CTGGACGAGG ACGCCGCGCG CGAACTGCTC ACCAGCCCCT TCGGCGAGGC CGACACGGTA
CGGCTGCGCA GGCTCGTGCG CGCGCTGCGC CGCCTGGACC TGGACCGCGC CCACGACGGC
GGCGCCCACG GCGGAGCCCC GGAAGGGGAC GCCTCGCGGG AAGGCGCCCG GGAGGAGAGG
AGCGGCGCCT ACCGGCCCTC CGCGCAGCTG CTCGTGGACG CCCTGCGCGA CCCGGCCGAG
CTCACCCTCG TCGACCCCGA GATCGCCGCG CCCGCCACCC GTGTCGCCCA GGCGCTGCGG
ACCGTCCGCG ACCTGAACGC CAGGGGGGCC GACGCCGAAC AGGTCCTGTG GGAGCTGTGG
CGCGACAGCG GCCTGGCCGA CCGCCTGCTG CGCGCCAGCC TGGCCGGGGG CCGCCGGGGC
GCCGCCGCCG ACCGCGACCT GGACGCCGTG GTCGCCCTGT TCGAGAGCGC CGCCCGCTAC
TGCGACCGGC TGCCCCCCGG CAGTCCCGCG GGGTTCCTGG AGGACCTCGC CGCCCAGGAG
ATCCCCGGCG ACAGCCTCGC CGAGCGCGCG CCCGAGGGCG AGGCGGTGCG CATCCTCACC
GCCCACCGCT CCAAGGGCCT GGAGTGGGGC CTGGTCGTGG TCGCCGGAGC CCAGGAGGGC
GACTGGCCCG ACCTGCGCCT GCGCGGTTCC CTGCTCGGCG TGGAGGAACT CGTCGACACC
GACGTGCACT CCGCCGACTC CTCCGGAGCT GCCCTGGCCT CCAAACTCCT GGACGAGGAG
CGCCGCCTGT TCTACGTGGC CCTCACCCGC GCCCGCCGCA CCCTGGTGGT GACCGCCGTG
GGCGGGGAGG ACACCGACGA GCGGCCCTCG CGCTTCCTGA ACGAGATGGG CGTGGGCGAG
CCCGAGCCGG TCAGCACCGG CCTGCGCTGG CTGTCCCTGC CCTCGCTCGT CGCCGACCTG
CGCTCGGTGG TCACCGACCC CGGCGCCCCC GAACCGCTGC GCGAGGCCGC CGCCACCCAC
CTGGCCCGCC TGGCCGAGGA GGGCGTGCGC GGCGCCGACC CCGCGGAGTG GTACGCGCTC
ACCCCCTTCA CCGACGACCG GCCCCTGGCC GAGGAGGAGG ACACCATCCG CGTCTCGCCC
TCCCAGGTGG AGAGCTTCAC CAACTGCCAG CTGCGCTGGC TCCTCGAACG CGCCGCCGGG
GCCTCCTCCG GCGACTCGGC CTCGGCCCTG GGCACCGTCG TGCACGCGGT CGCCGTCCTG
GTCGCCCAGG GCAGCACCCC CGACGAGATC AGCGGGCGCA TGGACGAGAT CTGGGCCGAC
CTGGACTTCG GCGGCCCCTG GCAGGCCAGG GCGCAGCGCG ACCGCGCCGA CACGATGGTC
CGCAAGCTCG TGGACTGGGA GGCCGCCAAC GACCGCGAAC TCGTCGTCAC CGAGGAGGGC
TTCCGGGTGG ACGTGGGCGG CATCGAGATC ACCGGCCGCG TGGACCGCCT CGAACGCGAC
GACCAGGGCA GGGCCGTGGT CGTGGACATC AAGACCGGGA AGAACAAGGC CGACGACCTG
GCCCGCCACC CCCAGCTCGG CGTCTACCAG ATGGCCGTGC TCAAGGGCGC CTTCGCCAAG
CTCGGCCTCA CCGAGCCGGG CGGCGCCGCG CTCGTGCAGG TCGGCGAGAA GATCCAGAGG
GCCCGGGAAC AACCCCAGCC GCCCCTGAGC GAGGACCCCG ACCCCGGATG GGCCGGGACG
CTGGTGCGCG AGGTCGCCAC CGGCATGGGC GGATCCCGAT TCACCGCGAC ACGCAACAAG
GGATGCAGGT CGTGCGCCGT GCGCGCCTGC TGCCCGGTTC AGGACGAGGG CAGACATGTC
TGA
 
Protein sequence
MRRAHQVQPP PVLDENQRRV VEHEGGPLLV LAGPGTGKTT TIVESVVDRI DHRGTDPSRV 
LVLTFSRKAA QELRERITAR LRRTTREPLA LTFHSYAYAL IRREFQRMGD LPPRLLSGPE
QLMEVRELLR GEALDGAADW PERLRPALET RGFAEELRDF LMRAQERGMG PDELAALGRD
RDRDDWVAAA GFLDRYTGRF DIAPVPTLNY AELVRVAANL LSDPGVRERE RAAHEAVFVD
EYQDTDPAQE ELLRALAGDG RDLVAVGDPD QSIYAFRGAE VRGILDFPRR FPTARRTEAP
VVALRTCRRS GRALLSASRG LTRRLPAVAS ARGHVNEHRD LTPAEGVPDG EVRVLLADSA
AQEAAVIADT LRRAHLVDGV PWSDMAVLVR SSTRQLPVLR RALTAASVPV AVGADDLPVA
AEPIVRPMLA LIRYGLAPAE LDEDAARELL TSPFGEADTV RLRRLVRALR RLDLDRAHDG
GAHGGAPEGD ASREGAREER SGAYRPSAQL LVDALRDPAE LTLVDPEIAA PATRVAQALR
TVRDLNARGA DAEQVLWELW RDSGLADRLL RASLAGGRRG AAADRDLDAV VALFESAARY
CDRLPPGSPA GFLEDLAAQE IPGDSLAERA PEGEAVRILT AHRSKGLEWG LVVVAGAQEG
DWPDLRLRGS LLGVEELVDT DVHSADSSGA ALASKLLDEE RRLFYVALTR ARRTLVVTAV
GGEDTDERPS RFLNEMGVGE PEPVSTGLRW LSLPSLVADL RSVVTDPGAP EPLREAAATH
LARLAEEGVR GADPAEWYAL TPFTDDRPLA EEEDTIRVSP SQVESFTNCQ LRWLLERAAG
ASSGDSASAL GTVVHAVAVL VAQGSTPDEI SGRMDEIWAD LDFGGPWQAR AQRDRADTMV
RKLVDWEAAN DRELVVTEEG FRVDVGGIEI TGRVDRLERD DQGRAVVVDI KTGKNKADDL
ARHPQLGVYQ MAVLKGAFAK LGLTEPGGAA LVQVGEKIQR AREQPQPPLS EDPDPGWAGT
LVREVATGMG GSRFTATRNK GCRSCAVRAC CPVQDEGRHV