Gene Ndas_5235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5235 
Symbol 
ID9249128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp388004 
End bp390349 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content70% 
IMG OID 
ProductATP-dependent DNA helicase PcrA 
Protein accessionYP_003683121 
Protein GI297564148 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00141788 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCCTCCC AAGAACAGCT TCTCGAAGGT CTGAACGGTC CCCAGCGCGA CGCCGTCACC 
CACAGCGGCT CCCCGCTCCT GATCGTGGCG GGTGCCGGGT CCGGCAAGAC CCGGGTCCTC
ACCCACCGCA TCGCCCACCT CATGGCCGCG CGCGGTGTCC GCCCGGGTGA GATCCTCGCC
ATCACCTTCA CCAACAAGGC CGCCGCCGAG ATGCGCGAGC GCATCCAGGC GCTGCTGGGC
GTGCGCGCCG CCAACAGCAT GTGGACCATG ACCTTCCACT CCGCGTGCGT GCGCATCCTG
CGCAGGGAGG CAGCCAGGCT CGGCTACCCG AGCAGCTTCA CCATCTACGA CTCCGCCGAC
TCCGCGCGCC TGATGCAGCT GGTGTGCAAG GAGATGGACC TGGACCCCAA GCGGTTCCCG
CCCAAGTCCT TCTCCGCCCA GGTCTCCAAC CTCAAGAACG AGCTGGTCGA CTACGACACG
TTCGCCGGAC AGGCCCAGAC CGAGCAGGAG AAGAAGCTCG CCGAGGCCTA CCAGCTCTAC
CAGCGCCGCC TGCACGAGGC GGGCGCGATG GACTTCGACG ACCTGATCAT GGTCACCGTC
AACCTGTTCC AGATGTTCCC GGACGTCGCC GAGTACTACC GGCGCCGCTT CCGGCACGTC
ATGGTCGACG AGTACCAGGA CACCAACCAC GCCCAGTACG TGTTCATCCG CGAACTGGTC
GGCGTGGCCG AGGGCTCCGA CACCAGCGTG GTGCCGCCCG CTGAGCTGTG CGTGGTCGGC
GACGCCGACC AGTCCATCTA CGCGTTCCGC GGCGCCACCA TCCGCAACAT CCTGGAGTTC
GAGCGCGACT TCCCCGACGC GCGCACCATC CTCCTGGAGC AGAACTACCG CTCCACCCAG
ACCATCCTGT CCGCGGCCAA CGCGGTCATC GACCGCAACG AGGGCCGCCC GGCCAAGAAC
CTGTGGTCGG AGCAGGGCGA CGGACCGGCC ATCGTCGGCT ACGTCGCCGA CAACGAGCAC
GACGAGGCCG CCTTCGTGGT CGGCGAGATC GACAAGCTCA CCGACGACGG AACCCTCACC
CCGAGCCAGG TCGCGGTGTT CTACCGGACC AACGCCCAGT CCCGCGTGTT CGAGGACGTG
TTCATCCGCA CCGGGCTGCC CTACAAGATC GTCGGCGGCG TGCGCTTCTA CGAGCGCAAG
GAGATCCGCG ACGTCCTCGC CTACCTGCGG GTCCTGGCCA ATCCCGAGGA CACCGTCAGC
CTGCGGCGCA TCCTCAACGT GCCCAAGCGG GGGATCGGCG CCCGCGCGGA GGAGTCGATC
GAGCTGTTCG CCGCCCGCGA GCGCATCTCC TTCTCCCGGG CGCTGCGCCG GGTGGAGGAG
ATCCCCGGGA TGGCCGCCCG CTCGGTCAAG GCGGTGCTCA ACTTCACCGC CCTGCTGGAG
GAGCTGGAGC AGACCGTGCC CGAGGGCACG CCCGCGGAGA TCGTCGAGGC GGTGCTGAGC
AAGACCGGGT ACCTGTCCGA ACTGGCCGAG TCCAAGGACC TCCAGGACGA GAGCCGGGTG
GAGAACCTGG AGGAGTTCGT CGACGTCGCC CGCGAGTTCG AGCACACCTT CGCCGCCCTC
CTGGAGGAGG AGCCCACGGA GGACGGGGAG GAGGCCGCCG GGGCCGTCGA TCCGGGGGCG
CCGACCCTGG TCGACTTCCT GGAGCGGATC TCCCTGGTCG CCGACACCGA CCAGATCCCC
GACGAGGACG ACGAGGGCGG CGTGGTCACG CTGATGACCC TGCACGCGGC CAAGGGGCTG
GAGTTCCCCG CGGTCTTCCT CACCGGGATG GAGGACGGGG TGTTCCCGCA CACCCGCACG
CTCGGCGACA AGACGCAGCT GGAGGAGGAG CGCCGTCTGG CCTACGTGGG CCTGACCCGC
GCGCAGCGCC TGCTGTACGT CAGCCGCGCC GCCGTGCGCA GCGCCTGGGG GACCCCCTCC
TACAACCCCG CCTCCCGCTT CCTGGACGAG ATCCCCTCGT CCCTGGTCGA CTGGCGCCGC
GCCGAGTCCA CCCTGGCCGC CCCGCCCAGC CGCAGCATCG GCGGCCGGGG CTCCGGGGGC
TTCGGCGGCG GGGGCGGTTT CAGCGGCACC TTCGGCGGCG GCTCACGGTC GCGCGGGGGA
GCGAAGGCGG CCAAGGAGGC GCCCGCGCTC AGTGTGGGGG ACCTGGTCAA CCACGACTCC
TTCGGCATGG GCCGGGTGCA GCTGGTGGAG GGGACCGGGG ACAGGACCAA GGCCCGCATC
GACTTCGGCG CGGACATCGG CGAGAAGGAC TTCCTGGTCA AGTACGCGCC GATCGAGAAG
CTCTGA
 
Protein sequence
MSSQEQLLEG LNGPQRDAVT HSGSPLLIVA GAGSGKTRVL THRIAHLMAA RGVRPGEILA 
ITFTNKAAAE MRERIQALLG VRAANSMWTM TFHSACVRIL RREAARLGYP SSFTIYDSAD
SARLMQLVCK EMDLDPKRFP PKSFSAQVSN LKNELVDYDT FAGQAQTEQE KKLAEAYQLY
QRRLHEAGAM DFDDLIMVTV NLFQMFPDVA EYYRRRFRHV MVDEYQDTNH AQYVFIRELV
GVAEGSDTSV VPPAELCVVG DADQSIYAFR GATIRNILEF ERDFPDARTI LLEQNYRSTQ
TILSAANAVI DRNEGRPAKN LWSEQGDGPA IVGYVADNEH DEAAFVVGEI DKLTDDGTLT
PSQVAVFYRT NAQSRVFEDV FIRTGLPYKI VGGVRFYERK EIRDVLAYLR VLANPEDTVS
LRRILNVPKR GIGARAEESI ELFAARERIS FSRALRRVEE IPGMAARSVK AVLNFTALLE
ELEQTVPEGT PAEIVEAVLS KTGYLSELAE SKDLQDESRV ENLEEFVDVA REFEHTFAAL
LEEEPTEDGE EAAGAVDPGA PTLVDFLERI SLVADTDQIP DEDDEGGVVT LMTLHAAKGL
EFPAVFLTGM EDGVFPHTRT LGDKTQLEEE RRLAYVGLTR AQRLLYVSRA AVRSAWGTPS
YNPASRFLDE IPSSLVDWRR AESTLAAPPS RSIGGRGSGG FGGGGGFSGT FGGGSRSRGG
AKAAKEAPAL SVGDLVNHDS FGMGRVQLVE GTGDRTKARI DFGADIGEKD FLVKYAPIEK
L