Gene Ndas_1082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1082 
Symbol 
ID9244928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1329328 
End bp1330881 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content73% 
IMG OID 
ProductAlpha-N-arabinofuranosidase 
Protein accessionYP_003679030 
Protein GI297560056 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.286531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCAC GCACGTCCCC TCGCCCGCGA CACCACCTGA CGGCTCTCGG CGCCCTGACC 
CTGGCGCTGC TCGTACTCCC CCTCGCCACC ACCACCACCG CCGCGTCCTC CAGCGGTACC
GCCACCACGG CGGCCGATGC CGCCGGACCG GCAGTTCAGG CGGTTCCCGC CGAGTCCGCC
CAGGAGGGAA CGTTCCGCAA CCCCCTCAAC GCCGGAGCAG ACCCCACGAT CGTGCACCAC
GACGGGAACT ACTACCTGTC CACCACCCAG GGCGACCGCA TCTCCGTGTG GAGCTCGCCC
AGCCTGGCCA CCCTGGCCAC CGCCGAACCC GTGGAGGTGT GGCGCGACAG CGATCCCAGC
CGCGACACCG AACTGTGGGC CCCGGCGCTG CACCGGTTCC AGACCGGGGA CGGGCCGCGC
TGGTACCTCT ACTACACGGC CGCCGACAGC AGGCTGACCG ACCCCGTGGA GCGGGACGCC
AGCCACCGGC TCTACGTCCT GGAGTCGGCC GACGACGACC CGGCCGGACC CTACGAGTTC
AAGGCACGGA TCGCCGACAC CGGCACCTAC GCCATCGACG GCGAGCCGTT CGTGCACGAC
GGGCAGCCCT ACTTCGCCTG GAGCAGCCCC GGACGCGGGT TCGACGGCGG CCCCCAGCAG
CTCTACGCCG CGCGGATGAG CAACCCCTGG ACGATCGAGG GGGAACCCGT CGCGCTGCCC
AACGAGGGCG GCTGCCCCGA GGTCCGGGAG GGGCCGACCC CGCTGTACCG CGACGGCCGG
ACCTTCCTCA CCTACTCCAC CTGCGACACC GGCAAACCCG ACTACCAGAT CTGGTCGATC
GCGCTGGACG GGGGCGCCGA CCCGCTGTCG GCGGACGCCT GGGAGCAGCT GCCGGGGCCG
CTGTTCAGCC GCGACGACGC GGCCGGGGTC TGGGGGCCCG GGCACCACTT CTTCTTCCGC
TCGCCCGACG GCACCGAGGA CTGGATCGCC TACCACGCCA AGAACACGCC CGAGTACACC
TACTCCTTCC GGTCCACGCG CGCCCAGCGC ATCGGCTGGA CCCCCGAGGG GACCCCCGAC
CTCGGACGGC CGCTGGCGGC GGGGGCGACC CAGCGCCTCC CCTCCGGGGA CCCGGGCGCG
GGCAGCACGG CGGTCAACGA CACCGACACC GGCCGGGGCG GGCCGCGGGT CTCCTACGAG
GGCGACTGGA CCACGGGGGA CCGGTGCGGG GCGCACTGCT TCCACGGCGA CGACCACTAC
ACCGCCCAGG CCGGGGCCAC GGCCACCTAC CACTTCACCG GGTCGCGGAT CGCGGTGTAC
GGGTCGCTGG ACACCGACCA CGGCTACGCG ACGTTCTCGG TGGACGGCGG GCCGCCCTCG
GAGCCGGTGA GCTACCACCA CCCGTTCCGG GTCGGGGAGC AGCGCGTGTA CCTGAGCCCC
GAACTCGGCC CCGGCGAGCA CACCCTGACC GTCACGGTCA CCGGTGACCG GCCCGCCGGG
TCGAGCGACG CCATCGTCAC CGTCGACCGC GCGGAGGTCT ACCCCGCGCC CTGA
 
Protein sequence
MSPRTSPRPR HHLTALGALT LALLVLPLAT TTTAASSSGT ATTAADAAGP AVQAVPAESA 
QEGTFRNPLN AGADPTIVHH DGNYYLSTTQ GDRISVWSSP SLATLATAEP VEVWRDSDPS
RDTELWAPAL HRFQTGDGPR WYLYYTAADS RLTDPVERDA SHRLYVLESA DDDPAGPYEF
KARIADTGTY AIDGEPFVHD GQPYFAWSSP GRGFDGGPQQ LYAARMSNPW TIEGEPVALP
NEGGCPEVRE GPTPLYRDGR TFLTYSTCDT GKPDYQIWSI ALDGGADPLS ADAWEQLPGP
LFSRDDAAGV WGPGHHFFFR SPDGTEDWIA YHAKNTPEYT YSFRSTRAQR IGWTPEGTPD
LGRPLAAGAT QRLPSGDPGA GSTAVNDTDT GRGGPRVSYE GDWTTGDRCG AHCFHGDDHY
TAQAGATATY HFTGSRIAVY GSLDTDHGYA TFSVDGGPPS EPVSYHHPFR VGEQRVYLSP
ELGPGEHTLT VTVTGDRPAG SSDAIVTVDR AEVYPAP