Gene Ndas_4927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4927 
Symbol 
ID9248814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp61356 
End bp63689 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content72% 
IMG OID 
Productglycoside hydrolase family 31 
Protein accessionYP_003682816 
Protein GI297563843 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.606594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTCA CCGACGGCTT CTGGCAGATG AGGGAGGGCG TGCGCGCCAA CCGCGCGCGC 
GAGGCCCGCG ACGTGCGGGT GCACGACGAC CGGTTCACCC TCTACGCCCC GGTGCGCCCG
ATCGGGCACC GGGGGGACAC GCTCAACACA CCGCTGATCA CGGTGGACTG CTGGTCGCCC
GCGCCCGGTG TGATCGGTGT GCGGAGCACG CACCTGGCGG GCTCGGTCCG GCGCGCGCCG
GAGTTCGACG TGCGCAGCGA CCCCGGCGCC GCCCCCTCGG TGGCGCGGGA CGGCTCCTCG
GTGGAGCTGA CCAGCGGGGA GCTGTCGCTG CGGGTGGCCA CCGAGGGCCC CTGGCGGATG
GAGTTCCGCG CGGGCGGCGC CGTGCTGACC GGCTCCGACG CCAAGGGCAC CGCGTTCATG
GAGGACGCGG ACGGCTCCCA CCACATGCTC GGGCAGCTGT CGCTGGGGAT CCGCGAGCTG
GTGTACGGCA TGGGCGAGCG GTTCACGCCG TTCACGCGCA ACGGCCAGAC CGTGGACATC
TGGCAGGCCG ACGGCGGCAC GAGCAGCGAG CAGGCCTACA AGAACGTGCC GTTCTACCTG
ACCAACCGGG GCTACGGGGT GTTCGTGGCG CACTCGGGGC CGGTGTCGTT CGAGGTCGGC
TCGGAGTCGG TGGGCCGCGT GCAGTTCAGC GTGGAGGACC ACGCCCTGAC CTACTACGTG
ATCCACGGGG AGAGCCCCAA GGAGATCCTG GCCCGGTACA CCGCGCTCAC CGGCCGCCCC
GCGCTGCCGC CGCGGTGGTC GTTCGGGCTG TGGCTGTCCA CGTCGTTCAC CACCTCCTAC
GACGAGGACA CGGTCAACCG CTTCATCGAC GGCATGGCGG AGCGGGGCGT GCCGCTCAGC
GTGTTCCACT TCGACTGCTT CTGGATGCGC GAGTTCCACT GGTGCGACTT CGAGTGGGAC
CCGGAGCTGT TCCCCGATCC CGTGGGCATG CTCTCCCGGC TCAAGGGGCG CGGGCTGCGG
ACCTGCGTGT GGATCAACCC CTACATCGCC CAGCGGTCGG CGCTGTTCGA GGAGGGCTCC
CGGCTGGGGC ACCTGGTCCG GCGGCCGGAC GGGACCGTGT GGCAGTGGGA CATGTGGCAG
GCGGGGATGG CGCTGGTGGA CTTCACGTCC GCCGACGCCC GCGCGTGGTA CGCCGGAAAG
CTCAAGGTCC TGCTCGACAT GGGTGTGGAC TGCTTCAAGA CCGACTTCGG CGAGCGGGTG
CCGACCGACG TGGTGTGGTC GGACGGCTCC GACCCGCAGG CCATGCACAA CTACTACACG
CACCTGTACA ACGAGACGGT GTTCGACCTG CTGAAGCGCG AGCGCGGTGA GGGCGAGGCC
GTCCTGTTCG CGCGCTCGGC CACGGCGGGC GGGCAGAGCT TCCCGGTGCA CTGGGGCGGC
GACTGCGCCT CGACGTTCGA GGCGATGGCG GAGAGCCTGC GCGGCGGCCT GTCCCTGGGG
CTGTCGGGGT TCGGGTTCTG GAGCCACGAC ATCGGCGGCT TCGAGGGCAC CCCCGACGCC
GCGGTGTTCA AGCGCTGGCT CGCGTTCGGC CTGCTCTCCT CGCACAGCAG GCTGCACGGC
AGCCGCTCCT ACCGGGTGCC GTGGGACTTC GACGAGGAGT CCACCGAGGT GGCCCGCGTG
TTCACCCGGC TCAAGTGCGC GCTCATGCCC TACCTGTTCG GCGCGGCCGT GCAGGCCCAC
CGGGAGGGGA CGCCCGTGAT GCGCGCGATG CTGCTGGAGT TCCCCGACGA CCCGACCTGC
CACCACCTGG ACACGCAGTA CATGCTGGGT GAGGACCTGC TGGTCGCCCC GGTGCTGAGC
GCGGACGGCT CCGTGGAGTA CTACGTCCCC GAGGGCGTGT GGACCCACCT GATCACGGGC
GAGACGGTGC GGGGCCCGGT CTGGCGCCGC GAGACCCACG GGTTCGACTC CCTGCCCCTG
CTGGTGCGGC CGAACGCGGT CCTGCCGGTC GGCGCGGTGG ACGACCGGCC CGACTACGAC
TACACGGACG GGCTGACCCT GCGCGTGTAC GGTGCCGGGG AGGCGGCCAC GGCGACCACC
ACGGTCGTGC CGTCCGCCGA CGGCTCGGCC GCCGCGGTCT TCCGGACCGA GCGCTCGGGG
GGCGGGGTCA CCGTGGAGGC CGGGGCCGCT CCGGCCCACG GGTGGCGGGT GCTGCTGGTC
GGCACTGGCG GGGCCGAGAC CGATGGCACC GGGGCGGAGG TCACCGTGAC CGACGACGGC
ACCCTGGTGT CGGTTCCGGC AGGGACCGCC CGCGTGGACC TGCGCCTGTC CTGA
 
Protein sequence
MKFTDGFWQM REGVRANRAR EARDVRVHDD RFTLYAPVRP IGHRGDTLNT PLITVDCWSP 
APGVIGVRST HLAGSVRRAP EFDVRSDPGA APSVARDGSS VELTSGELSL RVATEGPWRM
EFRAGGAVLT GSDAKGTAFM EDADGSHHML GQLSLGIREL VYGMGERFTP FTRNGQTVDI
WQADGGTSSE QAYKNVPFYL TNRGYGVFVA HSGPVSFEVG SESVGRVQFS VEDHALTYYV
IHGESPKEIL ARYTALTGRP ALPPRWSFGL WLSTSFTTSY DEDTVNRFID GMAERGVPLS
VFHFDCFWMR EFHWCDFEWD PELFPDPVGM LSRLKGRGLR TCVWINPYIA QRSALFEEGS
RLGHLVRRPD GTVWQWDMWQ AGMALVDFTS ADARAWYAGK LKVLLDMGVD CFKTDFGERV
PTDVVWSDGS DPQAMHNYYT HLYNETVFDL LKRERGEGEA VLFARSATAG GQSFPVHWGG
DCASTFEAMA ESLRGGLSLG LSGFGFWSHD IGGFEGTPDA AVFKRWLAFG LLSSHSRLHG
SRSYRVPWDF DEESTEVARV FTRLKCALMP YLFGAAVQAH REGTPVMRAM LLEFPDDPTC
HHLDTQYMLG EDLLVAPVLS ADGSVEYYVP EGVWTHLITG ETVRGPVWRR ETHGFDSLPL
LVRPNAVLPV GAVDDRPDYD YTDGLTLRVY GAGEAATATT TVVPSADGSA AAVFRTERSG
GGVTVEAGAA PAHGWRVLLV GTGGAETDGT GAEVTVTDDG TLVSVPAGTA RVDLRLS