Gene Ndas_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3014 
Symbol 
ID9246867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3597967 
End bp3599994 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content72% 
IMG OID 
Productexcinuclease ABC, C subunit 
Protein accessionYP_003680930 
Protein GI297561956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.45539 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCGA CTCCTCACCT GCGCCCCGCG CCCGGTTCCA TCCCCACCGA CCCGGGCGTG 
TACCGGTTCC GGGACTCCTC CGGACGCGTG ATCTACGTGG GCAAGGCCAA GAACCTGCGG
TCCCGCCTCT CCTCCTACTT CCGCGACTTC GCCGGACTGC ACCCGCGCAC GCAGCAGATG
GTCTCCACCG CGGCCGACGT CGACTGGACG ATCGTGGGCA CCGAGGTGGA GGCCCTCCAG
CTGGAGTACT CCTGGATCAA GCAGTACGAC CCCCGGTTCA ACGTCAAGTA CCGCGACGAC
AAGAGCTACC CCTACCTGGC GGTCACCCTC CAAGAGGAGT TCCCCCGGGT GCAGGTCATG
CGCGGCGCCA AGCGCAGGGG GGTGCGCTAC TTCGGCCCCT ACTCCCACGC CTGGGCCATC
CGCGAGACCG TGGACCTGCT GCTGCGGGTC TTCCCCGTGC GCACCTGCTC CCCGGGGGTC
TTCCGCGGCG CCCGCAACAG CGGCCGCCCC TGCCTGCTCG GCTACATCGG CAAGTGCGTG
GCCCCCTGCG TGGGCAAGGC CTCGCCCGAG GACCACCGGG CCCTGGCCGA GGACTTCTGC
TCCTTCCTGG CGGGCGACAC CGGCCGGTTC CTGCGCGAGC TGGAGGGGCG GATGCGCGAG
GCCGCCGGGG AGATGGAGTA CGAGCGGGCC GCCCGCATCC GCGACGACAT CGAGGCCCTG
CGCGCCGCCC TGGAGAAGCA GGCCGTCGTC CTGCCCGACT CCACCGACTG CGACGTCATC
GCCGTGGCCG ACGACCAGTT GGAGGCGGCC GTGCAGATCT TCCACGTGCG CGGCGGGCGC
ATCCGCGGAC AGCGCGGCTA CGTGGTGGAC AAGGTGGCCG ACGACGGCCC CGGCGAGCTG
ATCGCGACCT TCCTCGGCCA GATCTACGGT CCCACCCGGG GCGACGACGA GAGCGGGGGC
ACCGGCACCG CCGTACCCCG CGAGGTCCTG GTCTCCCATG AGCCCGCCGA CCCCGAGGCC
ATGGCCGCCT GGCTGTCGGA GCACCGCGGC TCCTCGGTGG ACCTGCGGGT GCCGCAGCGG
GGCGACAAGA AGTCCCTCAT GGAGACCGTC GCCAAGAACG CGGCCGAATC GCTGGCCCGG
CACAAGACCC ACCGGGCCGG GGACCTGAGC ACCCGCGGCC GCGCCCTCCA GGAGATCCAG
GAGGCCCTGG AGCTGCCCGA GGCGCCGCTG CGCATCGAGT GCTTCGACAT CTCCAACCTC
CAGGGCGAGC ACGTGGTGGC GTCCATGGTC GTCTTCGAGG ACGGCCTGGC CCGCAAGTCC
GAGTACCGCC GCTTCTCCGT CCGCGGCAGC GGCGAGGGCG GCCGGGAACA GCACGACGTC
GCGGCCATGT ACGAGGTCGT CCACCGGCGC TTCCGGCGCT ACCTGGAGGA GAGCGCCCGC
AGCGGCGAGG TCGCCCGCAT GGGGGAGACC GGCGACCACG GTGGTCACCA AAGCGACGAC
GAGCCGTCAC CCGGGAAGTT CGCCTATCCG CCTAACCTGG TGGTGGTGGA CGGCGCCCGG
CCCCAGGCCG AGGCGGCGCG CCGCGCCCTG GACGAACTCG GGATCGAGGA CGTCGCCGTG
TGCGGTCTGG CCAAGCGCCT GGAGGAGGTG TGGTTGCCCG GCGACGAGGA CCCGGTGATC
CTGCCCCGCG CGGGCGAGGG GCTCTACCTG CTCCAGCGGG TGCGCGACGA GGCCCACCGC
TTCGCCATCC AGTACCACAG GCACAAGCGC GCCAAGGCGC TGACCGGCAG CAGCCTGGAC
GAGCTGCCCG GTCTGGGGCC GTCCCGCAGG ACGGCCCTGA TCAAGGCGTT CGGCTCGGTG
CGCAGGCTCG CCTCGGCCAC GGCCGAGGAG ATCGCGGCGG TGCCCGGGAT CGGCCCCAAG
CTGGCCGAGG CCGTGCACGC ACACCTGTCC GGCGGCCCCG CCACGACGGA GGGGCGGGGC
GACGGGGCAC CAGCACAGCA CATCACTGAC GGGGGAGGAG ACGCATGA
 
Protein sequence
MAATPHLRPA PGSIPTDPGV YRFRDSSGRV IYVGKAKNLR SRLSSYFRDF AGLHPRTQQM 
VSTAADVDWT IVGTEVEALQ LEYSWIKQYD PRFNVKYRDD KSYPYLAVTL QEEFPRVQVM
RGAKRRGVRY FGPYSHAWAI RETVDLLLRV FPVRTCSPGV FRGARNSGRP CLLGYIGKCV
APCVGKASPE DHRALAEDFC SFLAGDTGRF LRELEGRMRE AAGEMEYERA ARIRDDIEAL
RAALEKQAVV LPDSTDCDVI AVADDQLEAA VQIFHVRGGR IRGQRGYVVD KVADDGPGEL
IATFLGQIYG PTRGDDESGG TGTAVPREVL VSHEPADPEA MAAWLSEHRG SSVDLRVPQR
GDKKSLMETV AKNAAESLAR HKTHRAGDLS TRGRALQEIQ EALELPEAPL RIECFDISNL
QGEHVVASMV VFEDGLARKS EYRRFSVRGS GEGGREQHDV AAMYEVVHRR FRRYLEESAR
SGEVARMGET GDHGGHQSDD EPSPGKFAYP PNLVVVDGAR PQAEAARRAL DELGIEDVAV
CGLAKRLEEV WLPGDEDPVI LPRAGEGLYL LQRVRDEAHR FAIQYHRHKR AKALTGSSLD
ELPGLGPSRR TALIKAFGSV RRLASATAEE IAAVPGIGPK LAEAVHAHLS GGPATTEGRG
DGAPAQHITD GGGDA