Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1887 |
Symbol | |
ID | 9245737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2300319 |
End bp | 2302649 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | excinuclease ABC, A subunit |
Protein accession | YP_003679821 |
Protein GI | 297560847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.72737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAACC ACGACGACCG CATCCGGGTG CGCGGCGCCC GCATCCACAA CCTCAAGAAC GTGGACACGG AGCTGCCCCG CGACGCCCTG GTCGCCTTCA CCGGCGTCTC CGGGTCGGGG AAGTCCTCCC TGGCCTTCGG AACCCTCTAC GCCGAGTCCC AGCACCGGTA CCTGGAGTCG GTGGCGCCGT ACGCCAGGCG CCTGCTCCAG CAGCTCCCGG CCCCGGAGGT GGACGACGTC ACCGGGATGC CGCCCGCCGT GGCGCTCGCC CAGCCGCGCT CGGCGCCCTC GGTCCGCTCC ACCGTGGGCA CCCTCACCAC GCTGTCCAAC ACACTGCGCA TGCTCTTCTC CCGGGCGGGC GACTACCCGC CCGGGGCGCG GCGCCTGGAC TCGGACTCCT TCTCCCCCAA CACCGCGGCC GGGGCCTGCC CGACGTGTCA CGGGGAGGGG GTCGAGCACG AGGTCACCGA GGGGTCGCTG GTGCCCGACC CCGGGCTGAG CATCGCCGAC GGGGCGGTCG CCTCCTGGCC CGGGGCATGG CAGGGCAAGA ACCTGCGCGA CATCCTCGAC ACCCTCGGGT ACGACATCCA CCGGCCCTGG CGGGAGCTGC CCCAAGGCGA GCGCGACTGG ATCCTGTTCA CCGAGGAGCA GCCCGTGGTG ACCGTCCACC CGGTCCGCGA GGCGGGGCGT GTCCACCGGC CCTACCAGGG CAGGTACAGC AGCGCGCGCA GGCTGGTGCT CAAGGCGTAC GCCTCCTCCG CCAGCGAGGC CCGTCGGCGC AGGGCCGCCG AGTTCATGGT GGACCGGACC TGCCCGGAGT GCGGCGGTCG GCGGCTGCGG CAGGAGGCGC TGCGCGTCAC CTTCGCCGGG CACACCATCG CCGAACTCGC GGCGCTGCCC CTGACCGAGC TGGTGGCCCT GCTGCGCCCC TGGGCGCAGG ACCCGGGCGC GGCGGGGGCT CTGGTCGGGG AGATCGCGGC ACGGGTCGGG GTGCTGTCGG AGCTGGGCCT GGGCTATCTG AGCGCGGCCC GACCGGCCCC GACCCTGTCC ACCGGGGAGT TCCAGCGGAT CCGGCTGGCC ACCCAGCTGG GCACGGGGCT CTTCGGGGTG GTCTACGTCC TGGACGAGCC CTCCGCGGGC CTGCACCCGG CCGACGCCGA GGCGCTCTCC AGGACCCTGC GGCGCCTGCG CGACGGGGGC AACACCGTGT GCTTCGTCGA GCACGACCTG GACGTGGTGC GCGGCGCGGA CTGGATCGTC GACGTCGGGC CGGGGGCGGG CGAGCACGGC GGGCGCGTCC TGTACAGCGG CCCGGTCCCC GGCCTGCGCG GGGTCCCGGA GTCGGTGACA CGCCGCTACC TGTTCGGCGA CGCCCTCCCG GAGCACCGGC CCCGCACCCC CGGCGGATAC CTGGAACTGC GCGGGGCCAC CCGCAACAAC CTCCAAGGGC TGGACGCCGA CGTCCCCCTC GGCGTGTTCA CCGCCCTCAC CGGCGTGTCC GGCTCGGGCA AGTCCTCCCT GCTGGCGGAG CTCGGGGACC GGGCCGCCGA ACACGGCCGG GTGGTGTGGG TCAGCCAGCA GCCCATCGGA CGGACGCCGC GCTCCAACCT GGCGACCTAC ACCGGGCTCT TCGACACAGT GCGCAAGCTG TTCGCGGCCA CCGAGGAGGC CCGCTCGCTC GGTTACGGGC CCGGCCGGTT CTCCTTCAAC GTCGTGGGGG GACGCTGCCC GGAGTGCGAG GGCGAGGGGT TCGTGTCGGT GGAGCTGCTG TTCCTGCCCA CCACCTACGC GCCCTGTCCA GCCTGTGGCG GCTCGCGCTA CAACGACGAC ACCCTCCGGG TGCGGTACCG GGGGCGCACG GTCGCGGACG TGCTCGCGAT GTCGGTGGAG GAGGCGGCGG GGTTCTTCAC CGAGGAGCCG TCGGTGCGGC GTTCTCTGGA GACCCTCACC GGGGTGGGCC TGGGCTACCT GCGCTTGGGC CAGCCCGCGA CGGAGCTGTC CGGCGGCGAG GCCCAGCGGA TCAAACTGGC CACCGAACTC CAGCGCCGCC GGGTCGCCGA CACGGTCTAC CTGCTCGACG AGCCCACCAC CGGTCTCCAT CCGCACGACA CCGACGTCCT CGTCGGCCGC CTGCGCGACC TGGTCGGCGC GGGCGCCACC GTGGTGGCGG CCGAGCACGA CATGCGGGTC GTGGCCACCG CCGACCACGT CATCGACCTG GGCCCGGGCG GCGGATCGCA GGGGGGCCGG ATCGTCGCTC AGGGCACGCC CGCACAGGTC GCGGCGGCCC CGGACAGCCG CACGGGCCCC TACCTCAAGG GGCTGCTCTG A
|
Protein sequence | MDNHDDRIRV RGARIHNLKN VDTELPRDAL VAFTGVSGSG KSSLAFGTLY AESQHRYLES VAPYARRLLQ QLPAPEVDDV TGMPPAVALA QPRSAPSVRS TVGTLTTLSN TLRMLFSRAG DYPPGARRLD SDSFSPNTAA GACPTCHGEG VEHEVTEGSL VPDPGLSIAD GAVASWPGAW QGKNLRDILD TLGYDIHRPW RELPQGERDW ILFTEEQPVV TVHPVREAGR VHRPYQGRYS SARRLVLKAY ASSASEARRR RAAEFMVDRT CPECGGRRLR QEALRVTFAG HTIAELAALP LTELVALLRP WAQDPGAAGA LVGEIAARVG VLSELGLGYL SAARPAPTLS TGEFQRIRLA TQLGTGLFGV VYVLDEPSAG LHPADAEALS RTLRRLRDGG NTVCFVEHDL DVVRGADWIV DVGPGAGEHG GRVLYSGPVP GLRGVPESVT RRYLFGDALP EHRPRTPGGY LELRGATRNN LQGLDADVPL GVFTALTGVS GSGKSSLLAE LGDRAAEHGR VVWVSQQPIG RTPRSNLATY TGLFDTVRKL FAATEEARSL GYGPGRFSFN VVGGRCPECE GEGFVSVELL FLPTTYAPCP ACGGSRYNDD TLRVRYRGRT VADVLAMSVE EAAGFFTEEP SVRRSLETLT GVGLGYLRLG QPATELSGGE AQRIKLATEL QRRRVADTVY LLDEPTTGLH PHDTDVLVGR LRDLVGAGAT VVAAEHDMRV VATADHVIDL GPGGGSQGGR IVAQGTPAQV AAAPDSRTGP YLKGLL
|
| |