Gene Ndas_4694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4694 
Symbol 
ID9248576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5570732 
End bp5573695 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content78% 
IMG OID 
ProductLantibiotic dehydratase domain protein 
Protein accessionYP_003682586 
Protein GI297563612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.908748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.810572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGGGC CGGACTTCGG CCTGGTCCGC GTCCCCCTGC TCAGCGTCGA GGAGTCCGCC 
CGCATGGTCG CCGCGGGCGA CGTCCACGAC CCCGTCGCCG CCATGGCCGT CGACCTGGCC
GCCGACCCCC ACCTGCGCAC CGCGTCCGCG TCCGAGGAGC GCGCCCGCGC CACCCTGCTG
CGCTACCTCG CGCGCATGGG CGGGCGCGCC ACCCCCTACG GGCTGTTCGC CGGAACCGCG
CCGCTGTCGG TCGGGGACCG CCGCGACCTG GAGGCCGACC GGCGCGACCA ACACCGCGTG
CGGGTACGCG TGGACGTGCG CGCCCTGGAG GAGACGGTCG CCGACGCCCT CGCCGACGCC
GACCCCGACC ACGTCCCCCT GCGGCTCAAC CCCCTGGCCG GCGTCGGTCC CTCCGCCGTG
CGCTTCGCCG CTCCGGGCGA CGCCACCGCG GCCGTGGTGA GCGTTCGGCG CACCGAGGCC
ATCGACACCG CCCTGGAGGT CCTCGGCGGC GCCGAGATGA GCGCCGCCGA CCTCGCCGAA
GCGCTCTCCG AGCGGCTGCC CGGCGTGGAG CCCGACCGCC TGCGCGCCTT CGTGCGCGGA
CTGAGGGACA GGGGCCTGCT CCAGCCCTCC GACGGCCTGA TCGCCCCCGG TGACGAGCCC
GCCGACCGGG CCGTGCGCCT CCTGGACGCC GTCGGCGACC GTGACCGGGC CGCCGCCGTG
CGCGTCCTGC TGGCCGACGC CGCAGGGGAG CGCCCCTTCG AGCCGGGGCT GCGCGACCGT
CTCGACACCG CCTGGGACCG AGCCGCCGAC CACGCGCCCG CCCTGGCCCG GACCAGGTAC
GCCGAACGCT TCGACCTGCA CCCCGAACTC GCCATGCGCG CCGCCAGCCT GGACCGGCGC
ACCGTCGCCG ACCTGCGCTC CGCCGTGCGC CGCCTCACCG CCCTCTCCTC CCCCGGCGGC
GGCCCAGGCT TCGACATGGC CTCCTTCCGC GCCGCCTTCG CCCAGCGCTT CGAGGACGCC
GAGGTGCCGC TGCTGAGCGC GCTGGACCTG GAGTCGGGCG TGCTGCGGCC CGCTCGGCGC
GGCGCCTCCG AACTCGCCGC CAGGGCGGGC CTGCGCGCCG GTTCCCGCCC CGCCGAACCC
ACCGTCAACC CCGAGCTGCT CGACCTGCTC GGACGCTGGA CCGCCGACGG CGGCCACCTC
GACGGCGGCT CCGTGGACAT CGCCCACCTG CCCGAGTCCG ACACCGACGG CTCCCGCGCG
CTGCTGGCCG TCCTGCTCGG CGACGCCGAC CCCTCCTCCC ACGACGGACC GCACAGCATG
CTCGTCGGCG GGGTCGGCCG CGCCCCCCAC GCCCTGGTGG CCCGGTTCGG CCTCCACCGC
CCCGCCGTCG CCGACCGCGT CCGGGAGCAG GTCGACCGCG CCCGCGGGCG GCACGGAGCC
GCGGACCCCG CGCGGAACCC CCTCCACGCC GAACTCGTCT ACCACCCGGG CGGACGCATC
GGCAACGTCC TGGTGCGCCC CCGCGTGCTG GACGAGACCA TCGCCCTGAC CGGCGCCCAC
GCGGGCACCC TGCACCTGGA CCGGCTGCTC CTGCGCCTGT GCCCGGACGG CTTCCGCCTG
CGCGACGCCC GCACCGGCCG ACCCGTCCTC GTGGAGCTCA ACACCGCCCA CAACGTCGAC
TTCCACGGCC TGGACCCGGT CTACGCGGTG CTGGGCCACC TGGCCACGTC CGGCGGAGCG
GGCTGGTCGT GGGGCCCGCT GGCCCGTCTG CCCCACCTGC CCCGCGTCAC CTGCGGCCGG
GTCGTGGTCA CCCCCGAGCG CTGGCTGCTG CGCCCCGGGG ACGTCGCCGC GGTCCTGTCC
GCGCCCTCCC CGGCCGCCGC GCTGCGCGAT CGCCTGCCCG GCCTGGGCGG GCGCACCTGG
GTGGGCACCG GCGAGTACGA CCACGTGCTC CCCGTGGACC TGCGCGAGGA CGCCTCGGTG
CGCGCCGCCC TGGCACGCGC CGGCGAACGC GACACCGCTT TCGTGGAGAT GCCCCAGGCC
GAGGCGCCCG CCGTGCGCGG CCCCGGCGGG GGCCACGTCG CCGAGGTGGT CGTGCCCACC
GGGCCCGTCC TGCGCGAGCC GCCCGGCACC GGCGCGGGGA CGGCCGTCCT GGACCGCGGA
CACGGCCGGG CCTGGATCTA CGCGCGCCTG TACTGCGGGC ACGCCACCGC CGACCAGGTG
GTGGCGCGCG CCCACCGGCT CTCCTCGGAC CTGCGCGCCG CCGGTGAGGC CGACCAGTGG
TTCTTCCTCC GCTACCAGGA CGGGGACGGC TACCACGTGC GGGTGCGGGT CCGCCCGGCC
GAACCCGCCG CGCGGCCCGG CGTGCTGACC GCCGTGGACG CCCTCGGGGC CCGGCTGGCC
GCCGAGGGCC TGGTCAGCAG GGTCGTCCTG GACGAGTACG TGCCCGAGGT GGCGCGCTAC
GGCGGCACAG AGGGCCTGCG GGCGGCCGAG GGGCTGTTCA CCGCCTCCAG CGACCGCGTC
GCCGCCGCGC TGCCGGAGCT GGCCGACGAG TCGGCCCGCC TCTACCGGGC GGTCGCCGAC
GTCACCCACT GGTGCACCGA GCTGTTCGCC GCCTTCGACG AGCGCGAGGA GTTCCTGCGC
GCGTGCCAGG GCGGTCTGGA CGTGGCCCCC ACCCGCGAGG GCAACCGCCT CGGCAAGTTC
GCCCGCACGC ACGAGGCCGC CCTGCGCGCC CACCTGGAGG GGGTCCGCTC CGACGAGGGC
GTGGCCAAGG CCCTGGGCGC GCTGGCCGCC GCGCTGGAGC CCGGGACCGG GACCCGCGAC
CGGTGGTCGG TGTTCGGGTC GGCGCTGCAC CTGCACCTGA ACCGGACCTT CGCCTTCGAC
GCGGTGCGCA TGGAGTACCT GGCGCACGAA CTCGCCCGGC GCCACCTGCG CCGTCTGCAC
GCACTGGAGG GCAGGAAACG ATGA
 
Protein sequence
MSGPDFGLVR VPLLSVEESA RMVAAGDVHD PVAAMAVDLA ADPHLRTASA SEERARATLL 
RYLARMGGRA TPYGLFAGTA PLSVGDRRDL EADRRDQHRV RVRVDVRALE ETVADALADA
DPDHVPLRLN PLAGVGPSAV RFAAPGDATA AVVSVRRTEA IDTALEVLGG AEMSAADLAE
ALSERLPGVE PDRLRAFVRG LRDRGLLQPS DGLIAPGDEP ADRAVRLLDA VGDRDRAAAV
RVLLADAAGE RPFEPGLRDR LDTAWDRAAD HAPALARTRY AERFDLHPEL AMRAASLDRR
TVADLRSAVR RLTALSSPGG GPGFDMASFR AAFAQRFEDA EVPLLSALDL ESGVLRPARR
GASELAARAG LRAGSRPAEP TVNPELLDLL GRWTADGGHL DGGSVDIAHL PESDTDGSRA
LLAVLLGDAD PSSHDGPHSM LVGGVGRAPH ALVARFGLHR PAVADRVREQ VDRARGRHGA
ADPARNPLHA ELVYHPGGRI GNVLVRPRVL DETIALTGAH AGTLHLDRLL LRLCPDGFRL
RDARTGRPVL VELNTAHNVD FHGLDPVYAV LGHLATSGGA GWSWGPLARL PHLPRVTCGR
VVVTPERWLL RPGDVAAVLS APSPAAALRD RLPGLGGRTW VGTGEYDHVL PVDLREDASV
RAALARAGER DTAFVEMPQA EAPAVRGPGG GHVAEVVVPT GPVLREPPGT GAGTAVLDRG
HGRAWIYARL YCGHATADQV VARAHRLSSD LRAAGEADQW FFLRYQDGDG YHVRVRVRPA
EPAARPGVLT AVDALGARLA AEGLVSRVVL DEYVPEVARY GGTEGLRAAE GLFTASSDRV
AAALPELADE SARLYRAVAD VTHWCTELFA AFDEREEFLR ACQGGLDVAP TREGNRLGKF
ARTHEAALRA HLEGVRSDEG VAKALGALAA ALEPGTGTRD RWSVFGSALH LHLNRTFAFD
AVRMEYLAHE LARRHLRRLH ALEGRKR