Gene Ndas_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0839 
Symbol 
ID9244684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1030176 
End bp1031324 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content75% 
IMG OID 
ProductHolliday junction DNA helicase RuvB 
Protein accessionYP_003678789 
Protein GI297559815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.153919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACGACT TCCACGAGGA AGACCCCCGG GGCTCGGCCC CCGGCGGGCC CCCGCCCTAC 
GAGCGCGACG CGGTCTCGCC CGACGCGGGC ACGGACGAGC GCCAGATCGA GGGCGCCCTG
CGCCCCCGCG CCCTGGACGA GTTCGTCGGC CAGGAACGCG TGCGCGAACA GCTCTCCCTG
GTGCTGCACA GCGCCAAGCG GCGCAACCGG GCGCCCGACC ACATCCTCAT GTCCGGCGGC
CCGGGCCTGG GCAAGACCAC CCTGGCCATG ATCATCGCCG CCGAGATGGG GGCCCCGCTG
CGCATCACCT CGGGCCCGGC CATCGAGCGC TCCGGGGACC TGGCCGCGGT GCTCTCCACC
CTCCAGGAGG GCGAGGTGCT CTTCCTGGAC GAGATCCACC GCATGGCCCG CCCCGCCGAG
GAGATGCTCT ACGTCGCGAT GGAGGACTTC CGGGTCGACG TCGTGGTCGG CAAGGGCCCC
GGCGCCACCG CCATCCCGCT GGACATCGCG CCGTTCACCC TGGTCGGGGC CACCACCCGT
GCGGGCATGC TGCCCGCGCC CCTGCGCGAC CGCTTCGGAT TCACCGCGCA CATGGACTTC
TACACCCCCC AGGAACTGGA GCTGATCCTC CAGCGCTCGG CGGGTCTGCT CGGCGCGCCC
CTGGACGCGG ACGCGGCCGT GGAGATCGCC GGGCGCTCGC GCGGCACCCC CCGGATCGCC
AACCGGCTGC TGCGCCGGGT GCGCGACTAC GCCGAGGTGC GCGGGAACGG GCGGCTGTCG
CTGGACACCG CCCGCGCCGC CCTCGACCTC TACGAGGTGG ACGAACTGGG CATGGACCGG
CTGGACCGCG CCATCCTCGA CGTGCTCATG AGGAGGTTCC GCGGCGGCCC GGTCGGCCTG
TCCACGCTGG CGGTGTCGGT GGGGGAGGAG GCCGAGACGG TGGAGACCGT CGCCGAGCCC
TTCCTGGTCC GCTCCGGCTT CCTGGCCCGC ACCCCGCGGG GCCGGGTGGC CACCCCGCAG
GCCTGGGCGC ACATGGGGCT CACCCCGCCG CCGGACGCGG CCTTCGGCGC GGCGGCGGCC
AACGGCGGCG GCGCCGGTAA CCCCGCCCCC GCGGGCAACG CGGGTCACAA CGGTGCGGCG
AGTCCCTGA
 
Protein sequence
MYDFHEEDPR GSAPGGPPPY ERDAVSPDAG TDERQIEGAL RPRALDEFVG QERVREQLSL 
VLHSAKRRNR APDHILMSGG PGLGKTTLAM IIAAEMGAPL RITSGPAIER SGDLAAVLST
LQEGEVLFLD EIHRMARPAE EMLYVAMEDF RVDVVVGKGP GATAIPLDIA PFTLVGATTR
AGMLPAPLRD RFGFTAHMDF YTPQELELIL QRSAGLLGAP LDADAAVEIA GRSRGTPRIA
NRLLRRVRDY AEVRGNGRLS LDTARAALDL YEVDELGMDR LDRAILDVLM RRFRGGPVGL
STLAVSVGEE AETVETVAEP FLVRSGFLAR TPRGRVATPQ AWAHMGLTPP PDAAFGAAAA
NGGGAGNPAP AGNAGHNGAA SP