Gene Ndas_4760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4760 
Symbol 
ID9248642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5647956 
End bp5649137 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGlucan endo-1,3-beta-D-glucosidase 
Protein accessionYP_003682651 
Protein GI297563677 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.68587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.425556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC GAAGACTGTT CCTCGGCGGC GCATCGCTCG CCACCGTCAG CACCCTGCTC 
GTCACGACCG GTTGCTCCCG TGCGGAGAGC AACGGCGAAC TCACCTCGGG CGGCTCTCAG
GCCGGCGCCC TTCCGGTGAG GGTCACCAAC GAGACCGGGA CCTTCCCCGA CAGCGAGATC
CGCGCCTACA TCGTGGGAAC CGACCTGACC ACGAACACGC AGTCCTGGGT CGACGCCAGC
GGCGTCGCGC ACCCGGTCGC GGAGTCCGAC AACACCGACG ACGGGTTCAC CGACTACTCG
ATCCCGCTGG ACTCGGTCAG CGAGCTGTCC CTGCCGTTCA TGTCCGGCCG GGTGTACTTC
GCCCTCGGCG GCAAGCTCCG GTTCAAGGTC GTCACCGACG GCAACGGCAG GCCCGCGCTC
CAGTACCCGG CGGGCTGGGT CGAGTCCGAC CCCAGCCACG GGGTGCTCCA CGACACCGTG
GAGTTCACCC ACAACGAGAC CGGCATGTAC TGCAACACCA GCATGGTGGA CCAGTTCAGC
GTGCCGCTGG CCATCCGCCT CCAGGGCGAG GCCGACCAGA CCACCGGCAC GTTCGAGCCG
GGCGGCCGCG ACGCGGTCTT CGCCACGCTC TCCGCCGACC CCGTCTTCTC GTCGCTCGTG
CTGAGCGAGG GAGAACTCGT CATCGCCCCC GGGCACGGGC TGGACGCCGG ACTGTTCCCG
GCCGACTACT ACGACTCCTA CATCGACCAG GTCTGGGAGC GCTACAGCAC CACCGACCTG
CGCGTCACCA CCAACGCCGG GACCTTCACC GGCCGCGTGG ACGGCTCGGG CAACCTGGTC
TTCGACGGCG GCGTGGCCCC GATCCCCAAG CCCTCCACCC GCGACGTCTT CTTCTGCGAC
GGCGCCCTGG CGGCCCCCAA CGACGGCATC ACCGGTCCGG TGGCCGCGAT CCTGGGCGCG
GCGTTCAACC GCTCGACGCT GCTGGACACC GCCGAGCACC CGGTCACCGA CCCCGCGGCG
TTCTACAACC ACGAGACCAC CAACCGCTAC GCGGCGGTCT TCCACGAGAA CACCGTGGAC
GGCAAGGCCT ACGGCTTCGC CTTCGACGAC GTGTCGAACT TCGCCTCCTA CGTCCAGGAC
CACGCGCCGG TGTCCCTCGA CGTGACGCTG ACCGCGTTCT AG
 
Protein sequence
MIDRRLFLGG ASLATVSTLL VTTGCSRAES NGELTSGGSQ AGALPVRVTN ETGTFPDSEI 
RAYIVGTDLT TNTQSWVDAS GVAHPVAESD NTDDGFTDYS IPLDSVSELS LPFMSGRVYF
ALGGKLRFKV VTDGNGRPAL QYPAGWVESD PSHGVLHDTV EFTHNETGMY CNTSMVDQFS
VPLAIRLQGE ADQTTGTFEP GGRDAVFATL SADPVFSSLV LSEGELVIAP GHGLDAGLFP
ADYYDSYIDQ VWERYSTTDL RVTTNAGTFT GRVDGSGNLV FDGGVAPIPK PSTRDVFFCD
GALAAPNDGI TGPVAAILGA AFNRSTLLDT AEHPVTDPAA FYNHETTNRY AAVFHENTVD
GKAYGFAFDD VSNFASYVQD HAPVSLDVTL TAF