Gene Ndas_0755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0755 
Symbol 
ID9244597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp925470 
End bp926993 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content73% 
IMG OID 
Productalpha-L-arabinofuranosidase domain protein 
Protein accessionYP_003678706 
Protein GI297559732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.426486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGAG CCTCCGTCAC CGTCGACCCG GCGGCCATCG TCTCCCCCGT GCACCGCCGC 
ACCTTCGGCT CGTTCGTCGA GCACATGGGC CGCTGCGTCT ACACCGGCAT CTACGAACCC
GGGCACCCGA CGGCCGACGC CGACGGCTTC CGACGCGACG TCGCGGACCT GGTCCGGGAA
CTGGGCGTCA CCACCGTGCG CTACCCGGGC GGCAACTTCG TGTCCGGGTA CCGGTGGGAG
GACGGCGTCG GCCCCCGGGA CCGCCGACCG GTCCGCCGCG ACCTGGCCTG GCACAGCATC
GAGACCAACC AGTTCGGCCT CGACGAGTTC ACCGCCTGGT GCCGCGGCCT GGACATCGAG
CCGATGATGG CGGTCAACCT CGGCACCCGC GGCCTGGCCG AGGCCCTGGA CCTGCTGGAG
TACTGCAACC ACCCCGGCGG CACCCACCTG TCCGACCAGC GCGCCGCCAA CGGGCACCCC
GAACCGCACG GCATCCGCAT GTGGTGCCTG GGCAACGAGA TGGACGGCCC CTGGCAGATC
GGCCACCTCG ACGCGCGCTC CTACGGACGC AAGGCCGGGC AGGTGGCGCG CGCCATGAGG
ATGGCCGACC GCGACCTCGA ACTGGTCGTC TGCGGCAGCT CCGGATCGGC CATGCCCACC
TTCGGCCAGT GGGAGGCCAC CGTCCTGGAG GAGACCTACG ACGCGGTGGA CCACATCTCG
CTGCACGCGT ACTACGAGGA GCGCGACGGG GACCTGGCCG ACTTCCTCGG CTCGTCGACC
GACATGGACC GCTTCATCGA CTCCGTCGTC TCCACCGCCG ACGCCGTCGG CGCGCGCCTG
CGCGACCCCA AACGCATCCA GCTCTCCTTC GACGAGTGGA ACGTGTGGTA CCTGAGCCGC
CACCAGGCCC GGGCCGCCGC GCAGCCCGCA GACGACTGGC AGGTGGCGCC CCGCGTCATC
GAGGACCGCT ACAGCGTCGC CGACGCCGTC GTCGTGGGCA ACATGCTCAT CAGCCTGCTC
CGGCACGGCG ACCGGGTCAC CGCCGCCAGC CAGGCCCAGC TCGTCAACGT CATCGCGCCG
ATCATGACCG AGCCCGGCGG CCCCGCGTGG CGCCAGACCG TCTTCCACCC CTTCGCCCTG
ACCGCTCGGG CCGCACGGGG CCGGGTGCTG CACACCGGCG TCACCGCGCC CCGGTACACG
ACCGCCAGCC ACGGCGAGAT CCCGCTCCTG GACGCCGTCG TCACCTTCGA CGAGGAGGAG
GGCACCGCGT CGCTGTTCGC GGTCAACCGC TCCACCGACC AGCACCTCGC CCTCGCCGCC
GACCTGCGCG GCCTGGCCCC GACGGCCGTC ACGGACGCCC GGACCCTCAG CGACGAGGAC
CCCTACGCCC ACAACACCAT GGACGCCCCC GACCGGGTCG TCCCGCGACC GGCCGGGGGC
GTGACCCTGG ACGGCGGCAG GCTCTCCGCC GTCCTGCCCC CCGTGTCCTG GTCCGTCATC
ACCCTCTCGA CCAGTCGGAA CTGA
 
Protein sequence
MLRASVTVDP AAIVSPVHRR TFGSFVEHMG RCVYTGIYEP GHPTADADGF RRDVADLVRE 
LGVTTVRYPG GNFVSGYRWE DGVGPRDRRP VRRDLAWHSI ETNQFGLDEF TAWCRGLDIE
PMMAVNLGTR GLAEALDLLE YCNHPGGTHL SDQRAANGHP EPHGIRMWCL GNEMDGPWQI
GHLDARSYGR KAGQVARAMR MADRDLELVV CGSSGSAMPT FGQWEATVLE ETYDAVDHIS
LHAYYEERDG DLADFLGSST DMDRFIDSVV STADAVGARL RDPKRIQLSF DEWNVWYLSR
HQARAAAQPA DDWQVAPRVI EDRYSVADAV VVGNMLISLL RHGDRVTAAS QAQLVNVIAP
IMTEPGGPAW RQTVFHPFAL TARAARGRVL HTGVTAPRYT TASHGEIPLL DAVVTFDEEE
GTASLFAVNR STDQHLALAA DLRGLAPTAV TDARTLSDED PYAHNTMDAP DRVVPRPAGG
VTLDGGRLSA VLPPVSWSVI TLSTSRN