Gene Ndas_3661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3661 
Symbol 
ID9247530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4392461 
End bp4395568 
Gene Length3108 bp 
Protein Length1035 aa 
Translation table11 
GC content79% 
IMG OID 
ProductSuperfamily I DNA and RNA helicase and helicase subunits 
Protein accessionYP_003681565 
Protein GI297562591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.517036 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGTCAC TGCGGGACGA CGAGGACGAG GTCGGGCACG CGGGCGTGCC GGGGGCGCCG 
CCGGGAGTCA CCGGCCCCCT CCCCGGCGCG CTCGGCGGCC CGTCGGGAGT CGGCACGGCC
TCGCTGCGCT GGAAGCCCCC GATCGACCGT TCCGAGGTGA GCTCCCTGCT GGCCTACCTG
CGCGCCTGCA TGCGCCGCGA GGCGGTGCAC ACCCACGTCG TCCCGGTCGG CGACCTGGGC
TCCGACAGCT GCGCGTGCCT GCCCCCGGGC CCGGAGGTGC TCTTCAGCGG GGCCGGGGAG
ACGCTTGCGC TGCGCCCGGA GTCCGTCCGC GTCCTGCGCG CGGCCAACGA CCTCGGCCAG
TCCGCGCGCT ACGGCTACCC GCTCGTGGTC CTGGGGGAGG GCGGGGACCG GGCGGCCCTG
CCCCTGCTCA CCGTCGACGT GCGCGTCGTG GACGAGACCG CGGACACCCC CGCCGGGGCC
GGGGACACCC TGGTCCGCGC GGTCGGCCCG CCCGACGTCA ACCCCGCGCT CCTGGAACGC
CTCGGCCTCA CCGACCCCGA GGACCTGTTC GAACTCCGGA CCCGGCTGCG CTCCGGGGCG
CCCGACCCCC TGCGGCGCCC CGCGGCCGTC GCCGACCTCG CCGCCAGGAT CCGCCTCCTG
CTCGCCGGCC TGGAGATCGA GCGGGTGGAC GACATCGCGC CCCTGGGCAC CCGCGGCGCC
CCGCGCACCT GGATGGACGG CGCCCACAAC GTCGCCGTGC TCTTCCGGGC CGGGCCCGGG
GGACGCCACG ACGCCGACGA GCGCGAACCC GTCGGCGTCG AGGGCGTCCT GGCCGACCTC
GACCCCTCCG GCCGCGACGG CCTCGACCCC GAGGACGTCA GCGGGACCGC CCTGGAGGCC
CTGCTCGGAC AGCCCTCCCG AACGGCCGCG GCCTCCGGCC CGGCGGCGTC CGGACGGCTC
TTCCGCTGCT CCCGCGCCGC GAACCGCCGC GGTTGCGCCG ACCCCGGCGG GGACACCGCC
GAGCCCCCCG TCCCGATCTC GGCGACCGCC CTCGACCAGT CCCAGTACGC GGTCCTGTGC
GCGGCCATGC GTGAGCAGCT CACCGTGGCC GCGGCCCCGC CCGGGAGCGG GGTGCACGAC
CTGGTCGACG CCCTCGTGCG CACCGCCGTC AGCAACGGCC AGCGCGTCCT GGTGTGCGGT
CGCACCGAGG CCGACGTCGC CGCGGTGCGC CTGCGCGCGG ACGCCTCCCC CGGGCACCCC
GTCGTGCGCG TGGGCGGGGA GGGACGGCGC ACGGCCGAGG CCGTGCTGCT CACCCGGCTC
CTCACCGAGC ACTCCCGGAG CCTGCCCGCG GCCCCCGCCG GTTCCGGCGG GGAGGACCCC
ACCGTCCACT GGGCGGACCT GGCCGCGGAC TGGACGGCGG TCCGCGAGGC GTGGCGGGCC
ATGGACACCA TGGCCTCCGG CGGCCACGCG CTCGCCCGCC TGGCCGAGGA ACGCGGCCGG
GTCGTCGCCC AGGGGTGGGA CCCCGACTCC CTCTTCACCC CGGAGCGGGG CGGCCCCGAG
TACTGGCTGC ACCGGGCCGA GCGCGCGGCG GCCGGGGGCC TGGCCGGGCT GGCGCACCGG
GGCGCCGTCC GCCGCGAACT GGGCGTGGAC ACCGACCCCG ACAGCCTCGC CCGCCTGCGC
GCGGTCGCCC GTCTGGAGGG CGAGTGGCGC GCGGCCGTGG ACCGGCGGAT CCGCTGCGCG
CCGCTGAACG CGCTCACCGC CGACCTGGCC GACGCCCTCG CCCGGCACCG CAGGTCCAGC
TCCGCCTGCC TGCGCGCGGT CGGCGACCCC CGGCTGTGGC GGGGACGCGC CGCCATCGAG
CACCGCCTGG AGAGCCTCAA CTGGCACCGC GGCCACGGCT GGCCGGGCCC GGGCGCGCTG
TTCGACACCC TGCCCGCGTG GGTGTGCCGC ACCGACCAGG TCCGGGCGCT GCAACCCCGG
GCGGGCCTGT TCGACCTGGC CGTCGTGGTC GGGGCCGAGC GCACCCGGCT GGCCGAACTG
CTGCCGGTGC TCTACCGCGC CAACCGCGCC GTGGTGTTCG GCGACCCCGC GCACCCGGGA
CCGGTGAGCG TGCTGGAACC GGACGAGGAG CGCCGCGCGC TGGCCGCCGC GGGTCTGGTC
GCGGGCCAGT TGGACGACCG GGGCCTGCGC TACGGCGGCG GTTCGGCGAT GCGCGCCCTG
TACCGGGCGG CGCCGCCCAT GCGGTGGCTG GACGAGCACG ACGGGGCGCC GCCCCAGCTC
GCCGGGGCCG CGTCGCGGCA CTGCTACGGG GGCAGGCTCG CGGTGCGCAC CCTCCCCGAC
CCCTCGGGCG GCCCCGCGTT CGAGTGGCGC GACGTCGCCG GGGTCTGCGA GGCCGCGCCC
GGGGCGTCCT TCGTCAACCG GGACGAGGCC TACCGGGTGG CGGTGGTGGT GGACGAACTC
GACGAGCACC TGCCCCGGGG CCGGGTGGTC GCGGTGGTCG CGCCGACCCA GCCCCAGGCG
GCCCTGGTGC GCCGCCTGCT GAGCAAGCGC GTCCTGCGCC ACGAGGTGCG GGTGGGCGGA
CCGGACCTGT TGGCGGACGA CCACGATCCG GCGGACGTCA CCGTCCTGAC GCCGACGCTG
GCCTCCGGGG CGCCCGCCGT CGCCGAGCGG CGTGTGCGCC GGATGGGTCA CCTGTGGTCG
TCGGTGCTCA CCCGCACCCG CCAGCGGCTC GTGGTGGTGG GCGACCGCGG GTACTGGTCG
GGCGACGACG GCCCGCTGGG CGAGCTGGAG GCGTCGGCGG CCGGGGGCCA CGCGGGGCGG
ACCGACGCGG CGGCCTCCGC GCTGGTCAGG GAGCTGCGCG GGGTGGGGAC GGAGGTGACC
CTCCAGCAGA CGGGCGAGGG CTGGACCGCC GACATGGTGG TCCGGTTCGG CTCCCGCCGC
CTGCTGCTCC TGCTCGACCG GGAGCCCGAC GGCCGCTCCC TGCGCTGGCT GCTCGCCCGG
GGGGAGACGC TCAACCGCAC CACGGGGGAC CCCGTGGTCG TGGTGCCCGC CTGGCGCTGC
CTGGCCGACC CGCGCGCCCT GGTGGAGGAG ATCCTCACCG CCCACTGA
 
Protein sequence
MRSLRDDEDE VGHAGVPGAP PGVTGPLPGA LGGPSGVGTA SLRWKPPIDR SEVSSLLAYL 
RACMRREAVH THVVPVGDLG SDSCACLPPG PEVLFSGAGE TLALRPESVR VLRAANDLGQ
SARYGYPLVV LGEGGDRAAL PLLTVDVRVV DETADTPAGA GDTLVRAVGP PDVNPALLER
LGLTDPEDLF ELRTRLRSGA PDPLRRPAAV ADLAARIRLL LAGLEIERVD DIAPLGTRGA
PRTWMDGAHN VAVLFRAGPG GRHDADEREP VGVEGVLADL DPSGRDGLDP EDVSGTALEA
LLGQPSRTAA ASGPAASGRL FRCSRAANRR GCADPGGDTA EPPVPISATA LDQSQYAVLC
AAMREQLTVA AAPPGSGVHD LVDALVRTAV SNGQRVLVCG RTEADVAAVR LRADASPGHP
VVRVGGEGRR TAEAVLLTRL LTEHSRSLPA APAGSGGEDP TVHWADLAAD WTAVREAWRA
MDTMASGGHA LARLAEERGR VVAQGWDPDS LFTPERGGPE YWLHRAERAA AGGLAGLAHR
GAVRRELGVD TDPDSLARLR AVARLEGEWR AAVDRRIRCA PLNALTADLA DALARHRRSS
SACLRAVGDP RLWRGRAAIE HRLESLNWHR GHGWPGPGAL FDTLPAWVCR TDQVRALQPR
AGLFDLAVVV GAERTRLAEL LPVLYRANRA VVFGDPAHPG PVSVLEPDEE RRALAAAGLV
AGQLDDRGLR YGGGSAMRAL YRAAPPMRWL DEHDGAPPQL AGAASRHCYG GRLAVRTLPD
PSGGPAFEWR DVAGVCEAAP GASFVNRDEA YRVAVVVDEL DEHLPRGRVV AVVAPTQPQA
ALVRRLLSKR VLRHEVRVGG PDLLADDHDP ADVTVLTPTL ASGAPAVAER RVRRMGHLWS
SVLTRTRQRL VVVGDRGYWS GDDGPLGELE ASAAGGHAGR TDAAASALVR ELRGVGTEVT
LQQTGEGWTA DMVVRFGSRR LLLLLDREPD GRSLRWLLAR GETLNRTTGD PVVVVPAWRC
LADPRALVEE ILTAH