Gene Ndas_3770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3770 
Symbol 
ID9247639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4530116 
End bp4531378 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content71% 
IMG OID 
Productglycoside hydrolase family 18 
Protein accessionYP_003681674 
Protein GI297562700 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGAAC ACCGCCTCAG GAAGGCCCCC AGGGCCGCGA AGGCCGCGCT GACCGGCGCC 
CTCGCCCTCC TGACCGCCGG ACTCCTCGCC GGTTTCAGCC AGCTCTCCCC CGCTCCCGCC
GAGGTCGAGA CCGTGTCCGA CAGCGACTCA GCCCAGGTCC AGCAGACCGG CCAGTGGCTC
ACCGGGTACT GGCACAACTT CAACAACGGC TCCACCGTGA TGCCGCTGTC GGAGATCCCC
TCCGAGTACA ACCTGGTGGC CGTCGCCTTC GCCGACAACC ACCCCACCCT CGACGGCGGA
ATCACGTTCA ACCTGGCCAG CGGCGAGCTG GGCGGCTACA CCGACCAGCA GTTCCGCGAC
GACATCGCGG CCATCCAGGC CGAGGGCCGC AAGGTCATCA TCTCCGTGGG CGGCGAGCGC
GGCCACGTGG ACGTCACCAA CGCCACCCAG GCGGGCAACT TCGCCGACAC CGTGTACCAG
CTGATGCAGG ACTACGGCTT CGACGGGGTC GACATCGACC TGGAGCACGG CATCAACGCG
CAGTACATGT CCCAGGCCCT GCACGACCTG AGCGGCCGGG CCGGGTCGGA CCTGATCATC
ACGATGGCGC CGCAGACCAT CGACTTCCAG AGCCCCGACC GGGAGTACTA CAAGCTGGCG
TCGGACATCT CCGACATCCT CACCATCGTC AACATGCAGT ACTACAACTC CGGCTCGATG
CTGGGGTGCG ACGACCAGGT GTACCACCAG GGCACCCCGG ATTTCGTCGC CGCGCTGGCC
TGCATCCAGC TGGAGATGGG GCTGAGCCCG GACCAGGTGG GCCTCGGGCT GCCCGCCGTG
CAGTCCGCCG CCGGGGGCGG GTACATGGCG CCCGGCCAGG TCGTGCGCGC CCTGGACTGC
CTGGAGGCCG GGACCGACTG CGGCTCCTTC TCCCCCGCCG CCCCGTACGG CCCCATCGGC
GGTGTGATGA CCTGGTCCAT CAACTGGGAC GCCACCAGCG GCTACGCCTT CGCCGAGACC
ATCTCGGCGG GCCTGGCCAC CGGCCCCGGC AACGGCGGCG GGCCCACCGA GGAGCCGACC
GAGGAGCCCA CCGAGCAGCC CGGCGACTGC ACCGCCGCGG CCTGGACGGC CGACAGCGTC
TACACCGGCG GCGACGTGGT CTCCCACGAG GGCTCGGAGT ACCGCGCCCA GTGGTGGACC
CGCGGCGAGG AGCCGGGCAC CACCGGCGAG TGGGGCGTCT GGAGGCTCGT CGGCACCTGC
TGA
 
Protein sequence
MPEHRLRKAP RAAKAALTGA LALLTAGLLA GFSQLSPAPA EVETVSDSDS AQVQQTGQWL 
TGYWHNFNNG STVMPLSEIP SEYNLVAVAF ADNHPTLDGG ITFNLASGEL GGYTDQQFRD
DIAAIQAEGR KVIISVGGER GHVDVTNATQ AGNFADTVYQ LMQDYGFDGV DIDLEHGINA
QYMSQALHDL SGRAGSDLII TMAPQTIDFQ SPDREYYKLA SDISDILTIV NMQYYNSGSM
LGCDDQVYHQ GTPDFVAALA CIQLEMGLSP DQVGLGLPAV QSAAGGGYMA PGQVVRALDC
LEAGTDCGSF SPAAPYGPIG GVMTWSINWD ATSGYAFAET ISAGLATGPG NGGGPTEEPT
EEPTEQPGDC TAAAWTADSV YTGGDVVSHE GSEYRAQWWT RGEEPGTTGE WGVWRLVGTC