Gene Ndas_3986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3986 
Symbol 
ID9247857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4767938 
End bp4769122 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content72% 
IMG OID 
ProductEndo-1,4-beta-xylanase 
Protein accessionYP_003681889 
Protein GI297562915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCGC TCCGCCTCGT CACCGCCGTC CTCGCCGCGA CGGCGCTGGC GGTCCCCCTG 
GCCGCCCCCG CGTCCGCCGC CCCCGGGCAC GACCGCGGCG ACGGCCACCG GTCGCCGCAC
CACCCCCGCC ACGCCAAGCC GGGCACGCTG CGCTGGGCCG CGCCAGACGG CCTGGTCATC
GGCACCGCCG TCGCGGGCGG CGGGCACCAC GTGGACCAGG ACTACCCCGA CCCCTTCACC
TCGGACCGGC GGTACCGCCG GATCCTGGCC CGGGAGTTCA GCTCGGTCTC CCCCGAGAAT
CAGATGAAGT GGGAGTACAT CCACCCCGAG CGGGACGAGT ACGCGTTCGG GATGGCCGAC
AGGATCGTGG ACTTCGCCGA GCGCCACGGC CAGGACGTGC GCGGGCACAC CCTGCTGTGG
CACAGCCAGA ACCCCGCGTG GCTGGAGGAG GGCGACTTCA CCGACGAGGA GCTGCGGGAG
ATCCTGCGCG ACCACATCAC CACCGCCGTC GGCCGCTACG AGGGCCGGAT CTCGCAGTGG
GACGTGGCCA ACGAGATCTT CGACGAGTCG GGGCGGCTGC GCACCGAGGA CAACATCTGG
ATCCGCGAGC TCGGGCCGGG GATCATCGCC GACGCCTTCC GCTGGGCGCA CGAGGCCGAC
CCGTCGGCGG AGCTGTACTT CAACGACTAC GGCGTCGAGG ACGTCAACGC CAAGAGCGAC
GCCTACCACG CGCTGGTCCA GGAGCTGCTG GCGGACGGCG TGCCGGTGCA CGGGTTCTCC
GCCCAGGTGC ACCTGAGCAT GCGGTACGGG TTCCCGTCCG GCCTGGAGGA GAACCTCCAG
CGCTTCGACG ACCTCGGCCT GGGCACGGCG CTGACCGAGG TGGACGTGCG CATGGACGTG
CCCGGGGGCG GGGAGCCGAC CGGGGAGCAG CTGGAGCGGC AGGCCGACTA CTACGGTCGG
GCGCTGGAGG CATGCCTCGG GGTGGAGGGC TGCGACTCCT TCACCATCTG GGGCTTCACC
GACAAGTACT CGTGGGTACC GGTGTTCTTC GAGGGCGAGG GAGCGGCGAC CGTCATGGAC
GGGGACTTCG ACCGCAAGCC CGCCTACTTC ACCCTCCGGT CCCTCCTGGA GGAGGCCGAG
GACGACGGGC GGGGCCGCCC GCCCCGGGCG GGCCGCGGCC GGTAG
 
Protein sequence
MKPLRLVTAV LAATALAVPL AAPASAAPGH DRGDGHRSPH HPRHAKPGTL RWAAPDGLVI 
GTAVAGGGHH VDQDYPDPFT SDRRYRRILA REFSSVSPEN QMKWEYIHPE RDEYAFGMAD
RIVDFAERHG QDVRGHTLLW HSQNPAWLEE GDFTDEELRE ILRDHITTAV GRYEGRISQW
DVANEIFDES GRLRTEDNIW IRELGPGIIA DAFRWAHEAD PSAELYFNDY GVEDVNAKSD
AYHALVQELL ADGVPVHGFS AQVHLSMRYG FPSGLEENLQ RFDDLGLGTA LTEVDVRMDV
PGGGEPTGEQ LERQADYYGR ALEACLGVEG CDSFTIWGFT DKYSWVPVFF EGEGAATVMD
GDFDRKPAYF TLRSLLEEAE DDGRGRPPRA GRGR