Gene Ndas_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1887 
Symbol 
ID9245737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2300319 
End bp2302649 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content74% 
IMG OID 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003679821 
Protein GI297560847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.72737 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAACC ACGACGACCG CATCCGGGTG CGCGGCGCCC GCATCCACAA CCTCAAGAAC 
GTGGACACGG AGCTGCCCCG CGACGCCCTG GTCGCCTTCA CCGGCGTCTC CGGGTCGGGG
AAGTCCTCCC TGGCCTTCGG AACCCTCTAC GCCGAGTCCC AGCACCGGTA CCTGGAGTCG
GTGGCGCCGT ACGCCAGGCG CCTGCTCCAG CAGCTCCCGG CCCCGGAGGT GGACGACGTC
ACCGGGATGC CGCCCGCCGT GGCGCTCGCC CAGCCGCGCT CGGCGCCCTC GGTCCGCTCC
ACCGTGGGCA CCCTCACCAC GCTGTCCAAC ACACTGCGCA TGCTCTTCTC CCGGGCGGGC
GACTACCCGC CCGGGGCGCG GCGCCTGGAC TCGGACTCCT TCTCCCCCAA CACCGCGGCC
GGGGCCTGCC CGACGTGTCA CGGGGAGGGG GTCGAGCACG AGGTCACCGA GGGGTCGCTG
GTGCCCGACC CCGGGCTGAG CATCGCCGAC GGGGCGGTCG CCTCCTGGCC CGGGGCATGG
CAGGGCAAGA ACCTGCGCGA CATCCTCGAC ACCCTCGGGT ACGACATCCA CCGGCCCTGG
CGGGAGCTGC CCCAAGGCGA GCGCGACTGG ATCCTGTTCA CCGAGGAGCA GCCCGTGGTG
ACCGTCCACC CGGTCCGCGA GGCGGGGCGT GTCCACCGGC CCTACCAGGG CAGGTACAGC
AGCGCGCGCA GGCTGGTGCT CAAGGCGTAC GCCTCCTCCG CCAGCGAGGC CCGTCGGCGC
AGGGCCGCCG AGTTCATGGT GGACCGGACC TGCCCGGAGT GCGGCGGTCG GCGGCTGCGG
CAGGAGGCGC TGCGCGTCAC CTTCGCCGGG CACACCATCG CCGAACTCGC GGCGCTGCCC
CTGACCGAGC TGGTGGCCCT GCTGCGCCCC TGGGCGCAGG ACCCGGGCGC GGCGGGGGCT
CTGGTCGGGG AGATCGCGGC ACGGGTCGGG GTGCTGTCGG AGCTGGGCCT GGGCTATCTG
AGCGCGGCCC GACCGGCCCC GACCCTGTCC ACCGGGGAGT TCCAGCGGAT CCGGCTGGCC
ACCCAGCTGG GCACGGGGCT CTTCGGGGTG GTCTACGTCC TGGACGAGCC CTCCGCGGGC
CTGCACCCGG CCGACGCCGA GGCGCTCTCC AGGACCCTGC GGCGCCTGCG CGACGGGGGC
AACACCGTGT GCTTCGTCGA GCACGACCTG GACGTGGTGC GCGGCGCGGA CTGGATCGTC
GACGTCGGGC CGGGGGCGGG CGAGCACGGC GGGCGCGTCC TGTACAGCGG CCCGGTCCCC
GGCCTGCGCG GGGTCCCGGA GTCGGTGACA CGCCGCTACC TGTTCGGCGA CGCCCTCCCG
GAGCACCGGC CCCGCACCCC CGGCGGATAC CTGGAACTGC GCGGGGCCAC CCGCAACAAC
CTCCAAGGGC TGGACGCCGA CGTCCCCCTC GGCGTGTTCA CCGCCCTCAC CGGCGTGTCC
GGCTCGGGCA AGTCCTCCCT GCTGGCGGAG CTCGGGGACC GGGCCGCCGA ACACGGCCGG
GTGGTGTGGG TCAGCCAGCA GCCCATCGGA CGGACGCCGC GCTCCAACCT GGCGACCTAC
ACCGGGCTCT TCGACACAGT GCGCAAGCTG TTCGCGGCCA CCGAGGAGGC CCGCTCGCTC
GGTTACGGGC CCGGCCGGTT CTCCTTCAAC GTCGTGGGGG GACGCTGCCC GGAGTGCGAG
GGCGAGGGGT TCGTGTCGGT GGAGCTGCTG TTCCTGCCCA CCACCTACGC GCCCTGTCCA
GCCTGTGGCG GCTCGCGCTA CAACGACGAC ACCCTCCGGG TGCGGTACCG GGGGCGCACG
GTCGCGGACG TGCTCGCGAT GTCGGTGGAG GAGGCGGCGG GGTTCTTCAC CGAGGAGCCG
TCGGTGCGGC GTTCTCTGGA GACCCTCACC GGGGTGGGCC TGGGCTACCT GCGCTTGGGC
CAGCCCGCGA CGGAGCTGTC CGGCGGCGAG GCCCAGCGGA TCAAACTGGC CACCGAACTC
CAGCGCCGCC GGGTCGCCGA CACGGTCTAC CTGCTCGACG AGCCCACCAC CGGTCTCCAT
CCGCACGACA CCGACGTCCT CGTCGGCCGC CTGCGCGACC TGGTCGGCGC GGGCGCCACC
GTGGTGGCGG CCGAGCACGA CATGCGGGTC GTGGCCACCG CCGACCACGT CATCGACCTG
GGCCCGGGCG GCGGATCGCA GGGGGGCCGG ATCGTCGCTC AGGGCACGCC CGCACAGGTC
GCGGCGGCCC CGGACAGCCG CACGGGCCCC TACCTCAAGG GGCTGCTCTG A
 
Protein sequence
MDNHDDRIRV RGARIHNLKN VDTELPRDAL VAFTGVSGSG KSSLAFGTLY AESQHRYLES 
VAPYARRLLQ QLPAPEVDDV TGMPPAVALA QPRSAPSVRS TVGTLTTLSN TLRMLFSRAG
DYPPGARRLD SDSFSPNTAA GACPTCHGEG VEHEVTEGSL VPDPGLSIAD GAVASWPGAW
QGKNLRDILD TLGYDIHRPW RELPQGERDW ILFTEEQPVV TVHPVREAGR VHRPYQGRYS
SARRLVLKAY ASSASEARRR RAAEFMVDRT CPECGGRRLR QEALRVTFAG HTIAELAALP
LTELVALLRP WAQDPGAAGA LVGEIAARVG VLSELGLGYL SAARPAPTLS TGEFQRIRLA
TQLGTGLFGV VYVLDEPSAG LHPADAEALS RTLRRLRDGG NTVCFVEHDL DVVRGADWIV
DVGPGAGEHG GRVLYSGPVP GLRGVPESVT RRYLFGDALP EHRPRTPGGY LELRGATRNN
LQGLDADVPL GVFTALTGVS GSGKSSLLAE LGDRAAEHGR VVWVSQQPIG RTPRSNLATY
TGLFDTVRKL FAATEEARSL GYGPGRFSFN VVGGRCPECE GEGFVSVELL FLPTTYAPCP
ACGGSRYNDD TLRVRYRGRT VADVLAMSVE EAAGFFTEEP SVRRSLETLT GVGLGYLRLG
QPATELSGGE AQRIKLATEL QRRRVADTVY LLDEPTTGLH PHDTDVLVGR LRDLVGAGAT
VVAAEHDMRV VATADHVIDL GPGGGSQGGR IVAQGTPAQV AAAPDSRTGP YLKGLL