Gene Ndas_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0001 
Symbol 
ID9248755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp308 
End bp2266 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content73% 
IMG OID 
Productchromosomal replication initiator protein DnaA 
Protein accessionYP_003677960 
Protein GI297558986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000505913 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00770539 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGCTGACG CGCAGGTCAA CCTCGCAGTG GTCTGGTCCA GCGTCCTGGA CGGGATCGAC 
AACGACTCCC TTCCCGCGCA CCAGCGCGCA TGGCTGCCGC AGACCCGTCC GCTGGGCCTG
ATCGAGGACA CCGCGCTCAT CGCGGCGCCC AACGAGTTCA CCAAGAAGGT GCTGGAGACA
CGCCTGTACC CGGCGATCAG CAAGGCGCTC TCCGCCCACC TGGGCCGGGA GATCCGGGTC
GCGGTGACGG TCGACCCGAC CGCGGTGCCG ACCCCGCCGC CCACCCCGAC GCCGCCCGCC
GCTCCGGGGC CGCAGGGCCC GCGCAGCGTG GAGGGGCACA CCGCGCCCGC CTTCTCCCCC
GTCGTGCCGG GCACCGCCCC CGGCCCGACC CGGCCGACGG CTCCCCCGGC GGCGCCCGCG
GACGCCCAGC GGGCCGTGAC CGGCTCCCGG GGACAGCACG CGCGTCCGGC AGCCGAGGCG
GCCGAGCAGC CCGCACCGTG GCCCTCGGCC CCCGCTCCGG CCCCGGAGCA GGCCGACCTG
CTGGGACCCA CGCCCCCGGC CGACCACCCC TCCCCCACCG ACGAGGCGCC GCACACGGTC
CGCGACGCGC CCGCGCCCTG GGTCCAGCAG ACCTTCGCCT CCCCCCAGGA AACGCCCACC
GCGCCCCCGC TGTGGGAGCA GCCCTCGGCG CTGGAGCGGC CGCAGGAGCC CGCGGGGTGG
CGGCAGCAGT CCTGGACGGA ACCGGAGTGG GACCGGCCCA ACCGCTGGGA GACCCCGCGG
GAGGAGGCGC CCGCGCCGGG GGCGGAGGCG GCCGAGGAGG CCGCGCAGCC CGCTCCGGAG
GCCGTGGAGG CCCCTGTGGA GGACGAGGCC CCGAAGGCCC CCTCCTCTCC CCCGGGGCGC
TCGCAGGAGG TCCCGCCGGG CGAGCACGCG CGCCTGAACC CGAAGTACAC CTTCGACACC
TTCGTCATCG GCTCCAGCAA CCGCTTCGCG CACGCGGCGT CGGTGGCGGC CGCCGAGGCT
CCGGCCAAGG CCTACAACCC GCTGTTCATC CACGGCGGTT CGGGGCTGGG CAAGACCCAC
CTGCTGCACG CCATCGGGCA CTACACGCAC CGCCTCTACG AGGGTTCGCG GGTGCGGTAC
GTCAGCTCGG AGGAGTTCAC CAACGAGTTC ATCAACTCGA TCCGGGACGG CAAGGCGGAC
GGGTTCCGCC GCCGGTACCG CGACATCGAC GTGCTCCTGG TGGACGACAT CCAGTTCCTG
GAGAACAAGG AGCAGACGCA GGAGGAGTTC TTCCACACCT TCAACACGCT GCACAACTCC
GACAAGCAGA TCGTCATCTC CAGCGACCGG CCGCCCAAGC AGCTGACCAC GCTGGAGGAC
CGGATGCGCA GCCGGTTCGA GTGGGGCCTG CTCACCGACG TCCAGCCGCC CGAGCTGGAG
ACCCGGATCG CCATCCTGCG CAAGAAGGCC GCGCAGGAGG GGCTGGCGGC CCCTCCCGAG
GTGCTGGAGT TCATCGCGAG CAAGATCTCC ACCAACATCC GGGAACTGGA GGGCGCGCTG
ATCCGCGTGA CCGCCTTCGC CAGCCTGAAC CGGCAGTCGG TGGACCTGGA CCTGACCAGC
CAGGTGCTGC GGGACCTGGT GCCCAGCACG GAGGTCCCCG AGGTCACGGC GGGGGCGATC
ATGTCCCAGA CGGCCGCCTA CTTCGGCCTG ACGGTCGAGG ACCTGTGCGG GACCTCGCGC
TCCCGGGTGC TGGTGACCGC CCGGCAGATC GCCATGTACC TGTGCCGGGA GCTGACCGAG
CTGTCCCTGC CCAAGATCGG GCAGCAGTTC GGCCGCGACC ACACCACGGT CATGCACGCC
GAGCGCAAGG TCCGCGGGCT GATGGCCGAG CGCCGGTCCA TCTACAACCA GGTGCACGAG
CTCACCAGCC GGATCAAGGA CCAGCCCTCG CTGACCTGA
 
Protein sequence
MADAQVNLAV VWSSVLDGID NDSLPAHQRA WLPQTRPLGL IEDTALIAAP NEFTKKVLET 
RLYPAISKAL SAHLGREIRV AVTVDPTAVP TPPPTPTPPA APGPQGPRSV EGHTAPAFSP
VVPGTAPGPT RPTAPPAAPA DAQRAVTGSR GQHARPAAEA AEQPAPWPSA PAPAPEQADL
LGPTPPADHP SPTDEAPHTV RDAPAPWVQQ TFASPQETPT APPLWEQPSA LERPQEPAGW
RQQSWTEPEW DRPNRWETPR EEAPAPGAEA AEEAAQPAPE AVEAPVEDEA PKAPSSPPGR
SQEVPPGEHA RLNPKYTFDT FVIGSSNRFA HAASVAAAEA PAKAYNPLFI HGGSGLGKTH
LLHAIGHYTH RLYEGSRVRY VSSEEFTNEF INSIRDGKAD GFRRRYRDID VLLVDDIQFL
ENKEQTQEEF FHTFNTLHNS DKQIVISSDR PPKQLTTLED RMRSRFEWGL LTDVQPPELE
TRIAILRKKA AQEGLAAPPE VLEFIASKIS TNIRELEGAL IRVTAFASLN RQSVDLDLTS
QVLRDLVPST EVPEVTAGAI MSQTAAYFGL TVEDLCGTSR SRVLVTARQI AMYLCRELTE
LSLPKIGQQF GRDHTTVMHA ERKVRGLMAE RRSIYNQVHE LTSRIKDQPS LT