Gene Ndas_3018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3018 
Symbol 
ID9246871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3602954 
End bp3605800 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content70% 
IMG OID 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003680934 
Protein GI297561960 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.244171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.490973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCACT CGATGGTCGA ACAACTGGTA GTCCGGGGAG CCCGGGAGCA CAACCTCAAG 
GACGTCTCGC TGGACCTGCC GCGCGACTCC ATGATCGTGT TCACCGGCCT GTCCGGTTCG
GGCAAGTCGT CGCTGGCCTT CGACACGATC TTCGCCGAGG GCCAGCGGCG CTACGTGGAG
TCGCTGTCGG CCTACGCCCG CCAGTTCCTG GGGCAGATGG ACAAGCCGGA CGTGGACTTC
ATCGAGGGCC TGTCCCCGGC GGTGTCCATC GACCAGAAGT CCACCAGCCG CAACCCCCGC
TCCACAGTCG GCACCATCAC CGAGGTCTAC GACTACCTGC GGCTGCTGTG GGCGCGCGTC
GGCGTCCCGC ACTGTCCCGA GTGCCGCCGC GAGATCGCCC GCCAGACCCC GCAGCAGATC
GTGGACCGCG TCCTGGAGAT GGAGGAGGGC ACCCGCTTCC AGGTGCTCGC CCCCGTCGTG
CGCGGACGCA AGGGCGAGTA CGTCGAGCTG TTCAAGGACC TGCAGTCCAA GGGCTACACC
CGCGCGGTGG TGGACGGGCA GGCCGTGCGC CTGGACGAGG CGCCCAAGCT GGGCCGCTAC
GACAAGCACG ACATCGCCGT GGTCGTGGAC CGCCTCAGCG TCAAGCCCTC CTCCCGCGGG
CGCCTGACCG ACTCGGTGGA GACGGCGCTC AAGCTGGCCG GGGGCACCAT CATCCTGGAC
TTCGTGGACG TGGAGGCCGG GGACCCCGAC CGCGAGAAGG TCTTCTCCGA GCACCTGTAC
TGCCCTTACG ACGACCTGTC CTTCGAACAG CTCGAACCGC GCTCCTTCTC CTTCAACGCC
CCCTACGGCG CCTGCGCCGA GTGCTCGGGC CTGGGCACCC GCATGGAGGT CGACCCCGAA
CTCCTGGTCC CCGACCCCGA GAAGACGCTG GCCGAGGGCG CCATCGGCCC CTGGTCCGGC
GGGCCCAACA GCGGCTACTG GGAGCGCATC CTCAAGGCGG TGGGCGAGGC GATCGGCTTC
GACCTGGACA CCCCCTGGGA GCGGCTGCCG CGCCGCGCGC GCAAGGCCCT GCTGGAGGGG
CACGACACCC AGGTCCACGT CAGCTACCGC AACAGGTACG GGCGCAACCG CTCCTACTAC
ACCGAGTTCG AGGGCGTCAT CCCCTGGGTC AAGCGCCGCC ACTCCGAGAC CGAGAGCGAC
TACGGCCGCG AACGGCTGGA GGGGTACATG CGCACCGTGC CCTGCCCGAC CTGCGAGGGC
ACCCGCCTCA AGCCGGTCGT GCTCGCGGTG ACCGTGGGCG GCAAGTCCAT CGCCGAGGTG
GCCCAGATGC CGCTCAGCGA CAGCGCGGCC TTCCTGGCCG GGCTCACGCT CTCCGAGCGC
GACGCGGTCA TCGCGGCCCA GGTGCTCAAG GAGATCAACG CCCGGCTCGG CTTCCTGCTC
GACGTCGGCC TGGACTACCT GAGCCTGGCG CGCTCCTCGG GCTCGCTCTC GGGCGGGGAG
GCCCAGCGCA TCCGCCTGGC CACCCAGATC GGCTCCGGCC TGGTGGGCGT GCTGTACGTG
CTGGACGAGC CCTCCATCGG CCTGCACCAG CGCGACAACG CGCGCCTGCT GGAGACCCTC
CAGCGCCTGC GCGACATCGG CAACACGCTC ATCGTCGTGG AGCACGACGA GGACACCATC
CGCGCCGCCG ACTGGGTCGT GGACATCGGC CCCGGCGCGG GTGAGCACGG CGGCCACGTC
GTGGTCTCGG GGATCGTGGA CGAGCTGCTC ACCTCCGAGG ACTCCCTCAC CGGCGAGTAC
CTGTCCGGCA AGCGCGGCAT CGAGGTGCCC GTGGAGCGCC GCCCCCTCAC CCGGGGGCAC
GAGCTGGTGG TCCGGGGAGC GCGCGAGAAC AACCTCCACG GGGTCGACGT CGCCTTCCCG
CTGGGCGTGT TCACCGCCGT CACCGGCGTG TCCGGCTCGG GCAAGTCCAC CCTGGTCAAC
GAGATCCTCT ACAAGGCGCT GGCCAAGGAG CTCAACGGGG CGCGCGACGT GCCCGGCCGC
CACCTGCGGG TCAACGGCAT GAACAAGGTC GACAAGGTCG TGCACGTGGA CCAGAGCCCC
ATCGGGCGGA CCCCGCGCTC CAACCCGGCC ACCTACTCGG GGGTCTTCGA CCACATCCGC
AAGCTGTTCG CGCAGACCAC CGACGCCAAG ACGCGCGGCT ACCAGCCGGG CCGGTTCTCC
TTCAACGTCA AGGGCGGCCG CTGCGAGGCC TGCTCCGGCG ACGGCACGCT CAAGATCGAG
ATGCAGTTCC TGCCCGACGT CTACGTGCCC TGCGAGGTGT GCCACGGCGC CCGGTACAAC
CGGGAGACCC TCCAGGTCCG CTACAAGGGC AAGAACATCT CCGAGGTCCT CAACATGCCG
ATCTCGGAGG CCCTGGAGTT CTTCGAGCCG ATCAACGCCA TCCGCCGCCA CCTCCAGACC
CTGGCCGACG TCGGCCTGGG CTACGTGCGG CTGGGCCAGC CCGCCACGAC GCTGTCGGGC
GGTGAGGCGC AGCGGGTCAA GCTCGCCGCC GAACTCCAGC GCCGCTCCAC CGGGCGGACG
GTGTACGTGC TCGACGAGCC CACGACCGGC CTGCACTTCG AGGACATCCG CAAGCTGCTG
GGCGTGCTCA ACCGCCTGAC CGACACCGGC AACACGGTGA TCGTCATCGA GCACAACCTC
GACGTCATCA AGACGGCCGA CCACGTCATC GACATGGGCC CCGAGGGCGG CTCCGGTGGC
GGCACCGTGG TCGCGCAGGG AACCCCGGAG GAGGTCGCCG CGGTGGCCGA GTCCTACACC
GGGCGTTTCC TGGCCAAGAT GCTCTGA
 
Protein sequence
MSHSMVEQLV VRGAREHNLK DVSLDLPRDS MIVFTGLSGS GKSSLAFDTI FAEGQRRYVE 
SLSAYARQFL GQMDKPDVDF IEGLSPAVSI DQKSTSRNPR STVGTITEVY DYLRLLWARV
GVPHCPECRR EIARQTPQQI VDRVLEMEEG TRFQVLAPVV RGRKGEYVEL FKDLQSKGYT
RAVVDGQAVR LDEAPKLGRY DKHDIAVVVD RLSVKPSSRG RLTDSVETAL KLAGGTIILD
FVDVEAGDPD REKVFSEHLY CPYDDLSFEQ LEPRSFSFNA PYGACAECSG LGTRMEVDPE
LLVPDPEKTL AEGAIGPWSG GPNSGYWERI LKAVGEAIGF DLDTPWERLP RRARKALLEG
HDTQVHVSYR NRYGRNRSYY TEFEGVIPWV KRRHSETESD YGRERLEGYM RTVPCPTCEG
TRLKPVVLAV TVGGKSIAEV AQMPLSDSAA FLAGLTLSER DAVIAAQVLK EINARLGFLL
DVGLDYLSLA RSSGSLSGGE AQRIRLATQI GSGLVGVLYV LDEPSIGLHQ RDNARLLETL
QRLRDIGNTL IVVEHDEDTI RAADWVVDIG PGAGEHGGHV VVSGIVDELL TSEDSLTGEY
LSGKRGIEVP VERRPLTRGH ELVVRGAREN NLHGVDVAFP LGVFTAVTGV SGSGKSTLVN
EILYKALAKE LNGARDVPGR HLRVNGMNKV DKVVHVDQSP IGRTPRSNPA TYSGVFDHIR
KLFAQTTDAK TRGYQPGRFS FNVKGGRCEA CSGDGTLKIE MQFLPDVYVP CEVCHGARYN
RETLQVRYKG KNISEVLNMP ISEALEFFEP INAIRRHLQT LADVGLGYVR LGQPATTLSG
GEAQRVKLAA ELQRRSTGRT VYVLDEPTTG LHFEDIRKLL GVLNRLTDTG NTVIVIEHNL
DVIKTADHVI DMGPEGGSGG GTVVAQGTPE EVAAVAESYT GRFLAKML