Gene Ndas_3519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3519 
Symbol 
ID9247388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4226866 
End bp4228632 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content68% 
IMG OID 
Product1, 4-beta cellobiohydrolase 
Protein accessionYP_003681426 
Protein GI297562452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAC CCCCCCGTGC CGCGAACCTA CGTTCGTGGA CACGACGCGG CCTGGCCGCG 
ACCTCCGCCC TGGTGCTCGG CAGCACCCTG GCGGTCGCCT CCTCCGTACC CGCCAGCGCG
GCGGCCGGTT GCGAGGTCGA CTACCAACTC AACGACTGGG GCTCCGGGTT CACCGCGAGC
GTGGAGATCA CCAACCTCGG CGACGCGGTC AACGGCTGGA CCCTGGAATG GGACTTCGCC
GGAAACCAGC GGATCACCAA CTCCTGGAAC GGCACCGTCA CCCAGAGCGG ACAGAGCGTC
TCGGTCACCA ACGCCGGGCA CAACGCCTCA CTCTCCACCG ACGGCACCGC CAGCTTCGGC
TTCCAGGGAA GCTACACCGG AAGCAACGCC GCGCCCACGG CCTTCGAACT GAACGGCGTG
CTGTGCAGCG GTGACGTCGA GGAGCCGGAG GAACCGGAGG AACCGGAAGA GCCCGGGGAG
CCCGAGGAGC CCGGCAGCAA CGGTCGGGTG GACAACCCGT ACGTGGGCGC TGAGGTGTAC
GTCAACCCGA TCTGGTCGGC CAACGCCGCC GCCGAGCCCG GCGGGGACGC CGTGGCCGAC
GAGCCCACCG GGGTGTGGCT GGACCGCATC AGCGCCATCG AGGGCAACGA CAGCCCCACC
ACCGGCAGCA TGGGACTGCG CGATCACCTG GACGAGGCCC TGGCCCAGGC CAACGGTGAA
CCCCTGGTGT TCCAGGTGGT CATCTACAAC CTGCCCGGCC GTGACTGCGC CGCTTTGGCC
TCCAACGGCG AGCTGGGCCC GGACGAGATC GACCGGTACA AGAACGACTA CATCGATCCC
ATCGCCCAGA TCCTGGCCGA TTACGAGGAC ACCGAGCTGC GGGTGGTGAC CACGGTCGAG
ATCGACTCGC TGCCCAACCT GGTCACCAAC GTCTCCCCGC GCGAGACCGC GACCGAGAAC
TGCGACGAGA TGCTGGCCAA CGGCAACTAC GTCGAGGGGG TGGGCTACGC GCTGGCGCAG
CTGGGCGCGA TCGACAACGT CTACAACTAC GTCGACGCCG GCCACCACGG GTGGATCGGT
TGGCAGGACA ACTTCACCGC CTCGGCGGCG CTGTTCTTCG AGGCGGCCAA CGCCGCCGGT
GCCAGCCCCG ACGATGTGCA CGGGTTCATC GCCAACACCG CGAACTACTC GGCTCTGGTG
GAGGAACACT TCTCCGTCGA CCAGACCATC GCCGGTACGC CGGTGCGCCA GTCGGAGTGG
GTGGACTGGA ACCAGTTCAC CGACGAGTTG TCCTTCGCCC AGGCTCTGCG TGAGGAGCTG
GTGGGTCAGG GGTTCGATTC GGGGATCGGG ATGCTCATCG ACACCTCCCG TAACGGGTGG
GGTGGGCCGG ACCGTCCGGA CGGGCCGGGG CCGAGCACGG ATGTGAACGC CTACGTGGAC
GGGGGCCGCT ACGACCGCCG TCTCCAGTCG GGGAACTGGT GCAACCAGTC CGGTGCGGGG
CTGGGTGAGC GCCCCCAGGC CGCTCCGGAG GCGGGGATCG ACGCCTATGT GTGGATGAAG
CCGCCGGGTG AGTCCGACGG GTCCAGTGAG TTCATCGAGA ACCCCGAGGG CAAGGGGTTC
GACCGGATGT GCGATCCGAC CTATGAGGGC AACCCGCGCA ACAACTACAA CATGAGTGGC
GCGCTGCCCA ACGCGCCGAT CTCGGGGCAC TGGTTCTCCG CGCAGTTCCA GGAGCTTTTG
GCCAACGCCT ACCCGCCCAT CCAGTAG
 
Protein sequence
MSTPPRAANL RSWTRRGLAA TSALVLGSTL AVASSVPASA AAGCEVDYQL NDWGSGFTAS 
VEITNLGDAV NGWTLEWDFA GNQRITNSWN GTVTQSGQSV SVTNAGHNAS LSTDGTASFG
FQGSYTGSNA APTAFELNGV LCSGDVEEPE EPEEPEEPGE PEEPGSNGRV DNPYVGAEVY
VNPIWSANAA AEPGGDAVAD EPTGVWLDRI SAIEGNDSPT TGSMGLRDHL DEALAQANGE
PLVFQVVIYN LPGRDCAALA SNGELGPDEI DRYKNDYIDP IAQILADYED TELRVVTTVE
IDSLPNLVTN VSPRETATEN CDEMLANGNY VEGVGYALAQ LGAIDNVYNY VDAGHHGWIG
WQDNFTASAA LFFEAANAAG ASPDDVHGFI ANTANYSALV EEHFSVDQTI AGTPVRQSEW
VDWNQFTDEL SFAQALREEL VGQGFDSGIG MLIDTSRNGW GGPDRPDGPG PSTDVNAYVD
GGRYDRRLQS GNWCNQSGAG LGERPQAAPE AGIDAYVWMK PPGESDGSSE FIENPEGKGF
DRMCDPTYEG NPRNNYNMSG ALPNAPISGH WFSAQFQELL ANAYPPIQ