Gene Ndas_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2218 
Symbol 
ID9246068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2649757 
End bp2650878 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content67% 
IMG OID 
ProductEndo-1,4-beta-xylanase 
Protein accessionYP_003680146 
Protein GI297561172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000248753 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000085459 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCCAGCA CGGCCTCCTC CCCCTGGCGC CCGCTCACCC GCGGACGCGG AGCCCTGCTG 
ACCGCCACCG CCCTGTGCGC GATCATCCTC CTGGTCACCG CGATCCTGGC CCTCACCCTG
CCCGCCCAAC CAGAATCCAC CGGCGCCGCA CAACTCCCCC ACCTGGCCCG CCAGAGCGAC
ATCGACCTGG GCGTCGCCGT AGCCGTCAAC CCCCTCGTCC ACGACCGCGA GTACCGCTCG
GTGGTCACCG AGCACTACAC CTCCCTGACC GCCGAGAACA CCATGAAATG GGAGTACGTC
CAACCCCGAC GCCACACCTT CGACTGGAGC GGCCCCGACA CCGTCGTCGA CTTCGCCGAA
CGCAACAACC TCACCGTGCA CGGCCACACC CTGCTCTGGC ACAACCAACA ACCCGCCTGG
CTCGCACAAG GCACCTGGAC CCCCGACGAA CTACGCCAGG TCATCCGCGA ACACATGCAG
GCCCTCATGG GCCGCTACCA AGGACGCGTC ACCTCCTGGG ACATCATCAA CGAACCCTTC
GAGGACAGCG GCCCCCGCCT ACGCGACAAC CTCTGGTACC AGGTCCTGGG CGAGGACTAC
ATCGCCGAGG CCCTCACCAC GGCCCACGAG ATCGACCCCC AGGCCCGCCT CTACATCAAC
GAGTTCGGCA TCGAAGGCGG CGGCCCCAAG ACCGACGCCC TCTACCAACT GGTCACCACC
CTGCTCGAAC GCGACGTCCC CCTGCACGGC ATCGGCTTCC AGAGCCACTT CATCCACGGA
CACGTCCCCG ACGACCTCGC CGAACAACTA CGCCGCTTCA CCGACCTGGG CCTGGAGGTC
AGCATCAGCG AACTCGACGT ACGCATCCCC GAACCCGTCC CCAACGGAGC CCTCCAGGAC
CAGGCCCGCG AATACGCACA GGTCGTCCAG GCCTGCCTGG ACGTACCCCG CTGCGTACGC
GTGTCCGTGT GGGGAGTCAG CGACCAGCAC TCCTGGATCC CCGAATGGTT CCCCGGCTAC
ACCGCCGCAC TGCCCTTCGA CGACTCCTAC GCACCCAAAC CAGCACTCAA CGCCATGGTC
CAGACCCTCT CCCGCCGACA CCCCCGTCCC CCGACGCACT GA
 
Protein sequence
MPSTASSPWR PLTRGRGALL TATALCAIIL LVTAILALTL PAQPESTGAA QLPHLARQSD 
IDLGVAVAVN PLVHDREYRS VVTEHYTSLT AENTMKWEYV QPRRHTFDWS GPDTVVDFAE
RNNLTVHGHT LLWHNQQPAW LAQGTWTPDE LRQVIREHMQ ALMGRYQGRV TSWDIINEPF
EDSGPRLRDN LWYQVLGEDY IAEALTTAHE IDPQARLYIN EFGIEGGGPK TDALYQLVTT
LLERDVPLHG IGFQSHFIHG HVPDDLAEQL RRFTDLGLEV SISELDVRIP EPVPNGALQD
QAREYAQVVQ ACLDVPRCVR VSVWGVSDQH SWIPEWFPGY TAALPFDDSY APKPALNAMV
QTLSRRHPRP PTH