Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4694 |
Symbol | |
ID | 9248576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5570732 |
End bp | 5573695 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | Lantibiotic dehydratase domain protein |
Protein accession | YP_003682586 |
Protein GI | 297563612 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.908748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.810572 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGGGC CGGACTTCGG CCTGGTCCGC GTCCCCCTGC TCAGCGTCGA GGAGTCCGCC CGCATGGTCG CCGCGGGCGA CGTCCACGAC CCCGTCGCCG CCATGGCCGT CGACCTGGCC GCCGACCCCC ACCTGCGCAC CGCGTCCGCG TCCGAGGAGC GCGCCCGCGC CACCCTGCTG CGCTACCTCG CGCGCATGGG CGGGCGCGCC ACCCCCTACG GGCTGTTCGC CGGAACCGCG CCGCTGTCGG TCGGGGACCG CCGCGACCTG GAGGCCGACC GGCGCGACCA ACACCGCGTG CGGGTACGCG TGGACGTGCG CGCCCTGGAG GAGACGGTCG CCGACGCCCT CGCCGACGCC GACCCCGACC ACGTCCCCCT GCGGCTCAAC CCCCTGGCCG GCGTCGGTCC CTCCGCCGTG CGCTTCGCCG CTCCGGGCGA CGCCACCGCG GCCGTGGTGA GCGTTCGGCG CACCGAGGCC ATCGACACCG CCCTGGAGGT CCTCGGCGGC GCCGAGATGA GCGCCGCCGA CCTCGCCGAA GCGCTCTCCG AGCGGCTGCC CGGCGTGGAG CCCGACCGCC TGCGCGCCTT CGTGCGCGGA CTGAGGGACA GGGGCCTGCT CCAGCCCTCC GACGGCCTGA TCGCCCCCGG TGACGAGCCC GCCGACCGGG CCGTGCGCCT CCTGGACGCC GTCGGCGACC GTGACCGGGC CGCCGCCGTG CGCGTCCTGC TGGCCGACGC CGCAGGGGAG CGCCCCTTCG AGCCGGGGCT GCGCGACCGT CTCGACACCG CCTGGGACCG AGCCGCCGAC CACGCGCCCG CCCTGGCCCG GACCAGGTAC GCCGAACGCT TCGACCTGCA CCCCGAACTC GCCATGCGCG CCGCCAGCCT GGACCGGCGC ACCGTCGCCG ACCTGCGCTC CGCCGTGCGC CGCCTCACCG CCCTCTCCTC CCCCGGCGGC GGCCCAGGCT TCGACATGGC CTCCTTCCGC GCCGCCTTCG CCCAGCGCTT CGAGGACGCC GAGGTGCCGC TGCTGAGCGC GCTGGACCTG GAGTCGGGCG TGCTGCGGCC CGCTCGGCGC GGCGCCTCCG AACTCGCCGC CAGGGCGGGC CTGCGCGCCG GTTCCCGCCC CGCCGAACCC ACCGTCAACC CCGAGCTGCT CGACCTGCTC GGACGCTGGA CCGCCGACGG CGGCCACCTC GACGGCGGCT CCGTGGACAT CGCCCACCTG CCCGAGTCCG ACACCGACGG CTCCCGCGCG CTGCTGGCCG TCCTGCTCGG CGACGCCGAC CCCTCCTCCC ACGACGGACC GCACAGCATG CTCGTCGGCG GGGTCGGCCG CGCCCCCCAC GCCCTGGTGG CCCGGTTCGG CCTCCACCGC CCCGCCGTCG CCGACCGCGT CCGGGAGCAG GTCGACCGCG CCCGCGGGCG GCACGGAGCC GCGGACCCCG CGCGGAACCC CCTCCACGCC GAACTCGTCT ACCACCCGGG CGGACGCATC GGCAACGTCC TGGTGCGCCC CCGCGTGCTG GACGAGACCA TCGCCCTGAC CGGCGCCCAC GCGGGCACCC TGCACCTGGA CCGGCTGCTC CTGCGCCTGT GCCCGGACGG CTTCCGCCTG CGCGACGCCC GCACCGGCCG ACCCGTCCTC GTGGAGCTCA ACACCGCCCA CAACGTCGAC TTCCACGGCC TGGACCCGGT CTACGCGGTG CTGGGCCACC TGGCCACGTC CGGCGGAGCG GGCTGGTCGT GGGGCCCGCT GGCCCGTCTG CCCCACCTGC CCCGCGTCAC CTGCGGCCGG GTCGTGGTCA CCCCCGAGCG CTGGCTGCTG CGCCCCGGGG ACGTCGCCGC GGTCCTGTCC GCGCCCTCCC CGGCCGCCGC GCTGCGCGAT CGCCTGCCCG GCCTGGGCGG GCGCACCTGG GTGGGCACCG GCGAGTACGA CCACGTGCTC CCCGTGGACC TGCGCGAGGA CGCCTCGGTG CGCGCCGCCC TGGCACGCGC CGGCGAACGC GACACCGCTT TCGTGGAGAT GCCCCAGGCC GAGGCGCCCG CCGTGCGCGG CCCCGGCGGG GGCCACGTCG CCGAGGTGGT CGTGCCCACC GGGCCCGTCC TGCGCGAGCC GCCCGGCACC GGCGCGGGGA CGGCCGTCCT GGACCGCGGA CACGGCCGGG CCTGGATCTA CGCGCGCCTG TACTGCGGGC ACGCCACCGC CGACCAGGTG GTGGCGCGCG CCCACCGGCT CTCCTCGGAC CTGCGCGCCG CCGGTGAGGC CGACCAGTGG TTCTTCCTCC GCTACCAGGA CGGGGACGGC TACCACGTGC GGGTGCGGGT CCGCCCGGCC GAACCCGCCG CGCGGCCCGG CGTGCTGACC GCCGTGGACG CCCTCGGGGC CCGGCTGGCC GCCGAGGGCC TGGTCAGCAG GGTCGTCCTG GACGAGTACG TGCCCGAGGT GGCGCGCTAC GGCGGCACAG AGGGCCTGCG GGCGGCCGAG GGGCTGTTCA CCGCCTCCAG CGACCGCGTC GCCGCCGCGC TGCCGGAGCT GGCCGACGAG TCGGCCCGCC TCTACCGGGC GGTCGCCGAC GTCACCCACT GGTGCACCGA GCTGTTCGCC GCCTTCGACG AGCGCGAGGA GTTCCTGCGC GCGTGCCAGG GCGGTCTGGA CGTGGCCCCC ACCCGCGAGG GCAACCGCCT CGGCAAGTTC GCCCGCACGC ACGAGGCCGC CCTGCGCGCC CACCTGGAGG GGGTCCGCTC CGACGAGGGC GTGGCCAAGG CCCTGGGCGC GCTGGCCGCC GCGCTGGAGC CCGGGACCGG GACCCGCGAC CGGTGGTCGG TGTTCGGGTC GGCGCTGCAC CTGCACCTGA ACCGGACCTT CGCCTTCGAC GCGGTGCGCA TGGAGTACCT GGCGCACGAA CTCGCCCGGC GCCACCTGCG CCGTCTGCAC GCACTGGAGG GCAGGAAACG ATGA
|
Protein sequence | MSGPDFGLVR VPLLSVEESA RMVAAGDVHD PVAAMAVDLA ADPHLRTASA SEERARATLL RYLARMGGRA TPYGLFAGTA PLSVGDRRDL EADRRDQHRV RVRVDVRALE ETVADALADA DPDHVPLRLN PLAGVGPSAV RFAAPGDATA AVVSVRRTEA IDTALEVLGG AEMSAADLAE ALSERLPGVE PDRLRAFVRG LRDRGLLQPS DGLIAPGDEP ADRAVRLLDA VGDRDRAAAV RVLLADAAGE RPFEPGLRDR LDTAWDRAAD HAPALARTRY AERFDLHPEL AMRAASLDRR TVADLRSAVR RLTALSSPGG GPGFDMASFR AAFAQRFEDA EVPLLSALDL ESGVLRPARR GASELAARAG LRAGSRPAEP TVNPELLDLL GRWTADGGHL DGGSVDIAHL PESDTDGSRA LLAVLLGDAD PSSHDGPHSM LVGGVGRAPH ALVARFGLHR PAVADRVREQ VDRARGRHGA ADPARNPLHA ELVYHPGGRI GNVLVRPRVL DETIALTGAH AGTLHLDRLL LRLCPDGFRL RDARTGRPVL VELNTAHNVD FHGLDPVYAV LGHLATSGGA GWSWGPLARL PHLPRVTCGR VVVTPERWLL RPGDVAAVLS APSPAAALRD RLPGLGGRTW VGTGEYDHVL PVDLREDASV RAALARAGER DTAFVEMPQA EAPAVRGPGG GHVAEVVVPT GPVLREPPGT GAGTAVLDRG HGRAWIYARL YCGHATADQV VARAHRLSSD LRAAGEADQW FFLRYQDGDG YHVRVRVRPA EPAARPGVLT AVDALGARLA AEGLVSRVVL DEYVPEVARY GGTEGLRAAE GLFTASSDRV AAALPELADE SARLYRAVAD VTHWCTELFA AFDEREEFLR ACQGGLDVAP TREGNRLGKF ARTHEAALRA HLEGVRSDEG VAKALGALAA ALEPGTGTRD RWSVFGSALH LHLNRTFAFD AVRMEYLAHE LARRHLRRLH ALEGRKR
|
| |