Gene Ndas_3764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3764 
Symbol 
ID9247633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4519811 
End bp4523224 
Gene Length3414 bp 
Protein Length1137 aa 
Translation table11 
GC content77% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003681668 
Protein GI297562694 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGACC AGGATTTCGC CCGACACGTC GTCACCGCGG TCATCGTCTC CCACGACGGC 
TCCCGCTGGC TGCCCGAGAC GATGCAGGCG CTGCGCGGCC AGTCCTGGCC CGTGCAGCGG
GCCGTCGCCG CCGACACCGG CAGCACCGAC GACAGCGCCG AGGTCCTGGC CCGGTTCCTG
CCCGCCGACG CCGTCGTGGA GCTGCCCGCC GACACCGGGT ACGGCGACGC CGTCCGCGCC
GCCCTGGAAC TGCCGCGCTC CACCAGCGCC GTACGCGGCT TCGATGAGGA CGCCACCGAG
TGGATCTGGC TCATCCACGA CGACCTCACC CCGGACCGGG ACGCCCTCGC CCACCTGCTG
GACGCCGCCG ACCAGGACCC CCGCGCCGCC GTCCTGGGCC CCAAGCTGCG CGATTGGTTC
GACCGCCGCC TCCTGGTGGA GGCGGGCGTG ACCATCGACG GCGCGGGCCG CCGCGAGACC
GGCCTGGAGC AGCGCGAGTT CGACCACGGC CAGCACGACG GCACACGCCA GGTCCTGGCG
GTCTCCAGCG CGGGCATGCT CGTGCGCCGC GACGTGTGGG ACCGCCTCGG CGGCTTCGAC
CCCTCCCTGC CGCTGTTCCG CGACGACATC GACTTCTGCT GGCGGGTGGG CGGCGCCGGA
CTGCGCGCCG TCCTCGTCAC CGACGCGGTC GCCTACCACG CCGAGGCCTC CGCCCGGCGC
CGCCGGCCCA TCTCCGCCAC CCGCAACCAC CCGCGCCGGG TGGACCGCCG CCACGCCGTC
TTCGTGCTGC TGGCCAACCT GCCCCTGGGC GGCATGGTGG CCGCGCTGCT GCGCAACTCC
GTGGCCTCGC TGCTGCGGGT GGTGGGGTAC CTGCTCATCA AGCAGCCCGC CAACGCCCTG
GACGAGGCCG CCGCGATCAC CCTGGTCTAC CTGCGGCCGC TCCGGCTGAT GCGCGCCCGC
TTCCGGCGCC GCCGCGACCG CCGCCGCACC TACAGCGCCA TCCGGCCCTT CCTCGCGCGC
GGCGTGGCCA TGCGCCAGGT CCACGACGTG CTCACCGGGA TCCTGTCCGG GGAGCCCGTG
CGCAACACCC CCGGCCGTCA CCAGGCCGTC ACCGCGCCGC CCGCCGCCGA GGACGAGGAC
GAGATCCCCA CGGACACCGC CGGCTTCGTG CGCAGCGTGC TCCTGCGGCC CGGGGTGCTG
CTGGTCCTGG CCCTGGCCGC GGTGGCCCTC GTCGCCGAGC GCTCCCTGCT CCTGGGCGAC
CTCCTGGCGG GCGGCGCCCT GCCCCCGGTC GCGGGCGGCG CGGGCGACCT GTGGAACCTG
TACCTGTCCG GCAGCCCCGA CAGCGGGCTC GGCGCGGCCG ACCCCGTCCC GCCCTACGTC
GGACTGCTCG CCCTGCTGTC CACGCTCACC CTGGGCAAGC CCTGGCTGGC CGTCACGATC
GTCCTGCTGG GCTGCGTGCC CCTGTCCGGG CTGACCGCCT ACCTGCTCGC CCGCGAGGTG
CTGGGCTTTC GCCCCGCACG CCTGTGGACG GCCGCGGCCT ACGCGCTGCT GCCGGCGGCC
ACCGGCGCGG TCGCCCAGGG CCGCCTGGGC ACCGCCCTGG TCCACGTCCT GCTGCCGGTG
CTGGGACTGC TGCTGGTGCG GCTGGTGTCG ATGCCGCCCA AGCCCTCGCG CCGAGCGGCC
TGGGGAGTGG GCCTGGTCCT GACGGTCGCC ACCGCGTTCG TGCCGATGGT GTGGCTCCTG
TCGCTGGTCA CCGGCGTGCT GGTGGCCGTC GCCTTCGGCC ACCTCGGCCG CCGGATCTAC
GTCAGCGTCG CGCTGGGCCT GGCCGTCCCG CTCGTGCTGC TGATGCCGTG GACGCTCGAA
CTGCTCCTGC ACCCCAGCCT GTGGCTGCTG GAGGCGGGAC TGCACCGCCC CGAGCTGTCG
GCGCCGGGCC CGACCCCGCA GGAACTGCTC ATGCTCTCGC CGGGCGGCCC CGGGACACCG
CCGTTCTGGG TCACGGCCGG GTTCGCCGTG GCCGCGCTGT GCTCCCTGCT GCTGCTCCGC
AACCGGATGC TGGCCGCGGC GGGGTGGTCG CTCGCCCTGT TCGGGATCCT CGTGGCCGTC
CTGACCAGCC GGGTGGTCGT CGAGCCCTAC TACGGCGGCC CCCCGGCGCC GGTCTGGCCC
GGGGTCGCCC TGACCTTCGC CGCCACGGCG GTGCTGCTGT CGGCGGCGAC CGCCGCGCGC
TCCTTCGGCG ACATGGTGCG CCTGGGCGGG CCCAGGCGGG TCTTCGCGCT CGCCGTGGGA
CTGCTCGCCC TGGCGACCCC CGTCTGCGCC GCGGGCGTGT GGATGTGGGA GGGCGTGCGG
GGGCCGGTCA CCGCGCACGC CGAACCCGTC GTCCCCGGCG CGCTGACCGG GGCCGGGACC
GTCGAGGGCG GCGGGTCCGA CGGGGCGGGG ACCCGGCCGC GCACCCTCGT GGTCACCGCC
GACGGTGAGG GCGGCGTGGA CTACCTGGTC GTGCGCGGAC GCGAACCCCG TCTGGGCGAG
GAGCACCTCG TGCCCGAGAG CGGGATCCGC GGGGCGATGG ACCGGGCCGT GGCCGAACTC
ACCGTCGGCC AGGGCGGCGA CCAGATGTAC ACCCTGGCCG ACTTCGGCAT CCAGTACGTG
CTCTACCCGC GCCCGCGGAT CAGCGGGCCC GCCGACGTCA CCATGGTCGA CACCCTGGAC
GGCACACCGG GGCTGGAGCG GCAGTCGCTG TCGCGCCACT ACGCGCTCTG GCGGCTGGCC
GCCCCCACCG GCGCGCTGCG GGTGGTCTCC GAGGACGGCG TCGAAGCCGA GGTCCTCGCG
GTGCGCGGGG ACGCGGACGA GGTGAGCGCC CCGGTCCCCG AGGGCGGCAC CGGGCGGCGG
CTCGCCCTGG CCGAGGCCGC AGACAGCGGC TGGCGCGCCA GCCTGGACGG CGTCGAACTG
GACCCCGTCC CCACCCAGAA CGGAACCCAG GCCTGGGCGC TGCCCGTGGA GGGCGGCGAC
CTGCGCGTCT GGCACACCGA CTACGTCCAC GCGGCCTGGC TGCTCACGCA GGGGGTCCTG
CTCACGGTGG TCGCGGTCCT CGCCGCGCCG GGTGTGCGGA CCGAGGAGGA GGCGCGGCTG
ATCGAGGCCA CGCCCACACC CCGCCCGCGG CGGCCGGAGC GGCTGCGGCG CTCGGGGAGG
TCCCGGGCGT CCTCGCGCCG GGGCTCACGC GCCAGGCCCG GCCGGTCGCG GCCCGGCGGT
GAGGACCCCG GCGTGCGCGC GGACGGGGAC GCCGGAGCGG ACGCGCCGGA GGACGGCGTC
CCGCCCGGTT CCGCGTCCGA GGAGGACACC GGCACGCTGC CCGCCGTGCG GGGCGGGGGC
CGCCGCCGGG GCACCCGCGG CGTGCGCAGG GGGGAGCGGC GCCGTGGCCG GTGA
 
Protein sequence
MPDQDFARHV VTAVIVSHDG SRWLPETMQA LRGQSWPVQR AVAADTGSTD DSAEVLARFL 
PADAVVELPA DTGYGDAVRA ALELPRSTSA VRGFDEDATE WIWLIHDDLT PDRDALAHLL
DAADQDPRAA VLGPKLRDWF DRRLLVEAGV TIDGAGRRET GLEQREFDHG QHDGTRQVLA
VSSAGMLVRR DVWDRLGGFD PSLPLFRDDI DFCWRVGGAG LRAVLVTDAV AYHAEASARR
RRPISATRNH PRRVDRRHAV FVLLANLPLG GMVAALLRNS VASLLRVVGY LLIKQPANAL
DEAAAITLVY LRPLRLMRAR FRRRRDRRRT YSAIRPFLAR GVAMRQVHDV LTGILSGEPV
RNTPGRHQAV TAPPAAEDED EIPTDTAGFV RSVLLRPGVL LVLALAAVAL VAERSLLLGD
LLAGGALPPV AGGAGDLWNL YLSGSPDSGL GAADPVPPYV GLLALLSTLT LGKPWLAVTI
VLLGCVPLSG LTAYLLAREV LGFRPARLWT AAAYALLPAA TGAVAQGRLG TALVHVLLPV
LGLLLVRLVS MPPKPSRRAA WGVGLVLTVA TAFVPMVWLL SLVTGVLVAV AFGHLGRRIY
VSVALGLAVP LVLLMPWTLE LLLHPSLWLL EAGLHRPELS APGPTPQELL MLSPGGPGTP
PFWVTAGFAV AALCSLLLLR NRMLAAAGWS LALFGILVAV LTSRVVVEPY YGGPPAPVWP
GVALTFAATA VLLSAATAAR SFGDMVRLGG PRRVFALAVG LLALATPVCA AGVWMWEGVR
GPVTAHAEPV VPGALTGAGT VEGGGSDGAG TRPRTLVVTA DGEGGVDYLV VRGREPRLGE
EHLVPESGIR GAMDRAVAEL TVGQGGDQMY TLADFGIQYV LYPRPRISGP ADVTMVDTLD
GTPGLERQSL SRHYALWRLA APTGALRVVS EDGVEAEVLA VRGDADEVSA PVPEGGTGRR
LALAEAADSG WRASLDGVEL DPVPTQNGTQ AWALPVEGGD LRVWHTDYVH AAWLLTQGVL
LTVVAVLAAP GVRTEEEARL IEATPTPRPR RPERLRRSGR SRASSRRGSR ARPGRSRPGG
EDPGVRADGD AGADAPEDGV PPGSASEEDT GTLPAVRGGG RRRGTRGVRR GERRRGR