Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1082 |
Symbol | |
ID | 9244928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1329328 |
End bp | 1330881 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Alpha-N-arabinofuranosidase |
Protein accession | YP_003679030 |
Protein GI | 297560056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.286531 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCAC GCACGTCCCC TCGCCCGCGA CACCACCTGA CGGCTCTCGG CGCCCTGACC CTGGCGCTGC TCGTACTCCC CCTCGCCACC ACCACCACCG CCGCGTCCTC CAGCGGTACC GCCACCACGG CGGCCGATGC CGCCGGACCG GCAGTTCAGG CGGTTCCCGC CGAGTCCGCC CAGGAGGGAA CGTTCCGCAA CCCCCTCAAC GCCGGAGCAG ACCCCACGAT CGTGCACCAC GACGGGAACT ACTACCTGTC CACCACCCAG GGCGACCGCA TCTCCGTGTG GAGCTCGCCC AGCCTGGCCA CCCTGGCCAC CGCCGAACCC GTGGAGGTGT GGCGCGACAG CGATCCCAGC CGCGACACCG AACTGTGGGC CCCGGCGCTG CACCGGTTCC AGACCGGGGA CGGGCCGCGC TGGTACCTCT ACTACACGGC CGCCGACAGC AGGCTGACCG ACCCCGTGGA GCGGGACGCC AGCCACCGGC TCTACGTCCT GGAGTCGGCC GACGACGACC CGGCCGGACC CTACGAGTTC AAGGCACGGA TCGCCGACAC CGGCACCTAC GCCATCGACG GCGAGCCGTT CGTGCACGAC GGGCAGCCCT ACTTCGCCTG GAGCAGCCCC GGACGCGGGT TCGACGGCGG CCCCCAGCAG CTCTACGCCG CGCGGATGAG CAACCCCTGG ACGATCGAGG GGGAACCCGT CGCGCTGCCC AACGAGGGCG GCTGCCCCGA GGTCCGGGAG GGGCCGACCC CGCTGTACCG CGACGGCCGG ACCTTCCTCA CCTACTCCAC CTGCGACACC GGCAAACCCG ACTACCAGAT CTGGTCGATC GCGCTGGACG GGGGCGCCGA CCCGCTGTCG GCGGACGCCT GGGAGCAGCT GCCGGGGCCG CTGTTCAGCC GCGACGACGC GGCCGGGGTC TGGGGGCCCG GGCACCACTT CTTCTTCCGC TCGCCCGACG GCACCGAGGA CTGGATCGCC TACCACGCCA AGAACACGCC CGAGTACACC TACTCCTTCC GGTCCACGCG CGCCCAGCGC ATCGGCTGGA CCCCCGAGGG GACCCCCGAC CTCGGACGGC CGCTGGCGGC GGGGGCGACC CAGCGCCTCC CCTCCGGGGA CCCGGGCGCG GGCAGCACGG CGGTCAACGA CACCGACACC GGCCGGGGCG GGCCGCGGGT CTCCTACGAG GGCGACTGGA CCACGGGGGA CCGGTGCGGG GCGCACTGCT TCCACGGCGA CGACCACTAC ACCGCCCAGG CCGGGGCCAC GGCCACCTAC CACTTCACCG GGTCGCGGAT CGCGGTGTAC GGGTCGCTGG ACACCGACCA CGGCTACGCG ACGTTCTCGG TGGACGGCGG GCCGCCCTCG GAGCCGGTGA GCTACCACCA CCCGTTCCGG GTCGGGGAGC AGCGCGTGTA CCTGAGCCCC GAACTCGGCC CCGGCGAGCA CACCCTGACC GTCACGGTCA CCGGTGACCG GCCCGCCGGG TCGAGCGACG CCATCGTCAC CGTCGACCGC GCGGAGGTCT ACCCCGCGCC CTGA
|
Protein sequence | MSPRTSPRPR HHLTALGALT LALLVLPLAT TTTAASSSGT ATTAADAAGP AVQAVPAESA QEGTFRNPLN AGADPTIVHH DGNYYLSTTQ GDRISVWSSP SLATLATAEP VEVWRDSDPS RDTELWAPAL HRFQTGDGPR WYLYYTAADS RLTDPVERDA SHRLYVLESA DDDPAGPYEF KARIADTGTY AIDGEPFVHD GQPYFAWSSP GRGFDGGPQQ LYAARMSNPW TIEGEPVALP NEGGCPEVRE GPTPLYRDGR TFLTYSTCDT GKPDYQIWSI ALDGGADPLS ADAWEQLPGP LFSRDDAAGV WGPGHHFFFR SPDGTEDWIA YHAKNTPEYT YSFRSTRAQR IGWTPEGTPD LGRPLAAGAT QRLPSGDPGA GSTAVNDTDT GRGGPRVSYE GDWTTGDRCG AHCFHGDDHY TAQAGATATY HFTGSRIAVY GSLDTDHGYA TFSVDGGPPS EPVSYHHPFR VGEQRVYLSP ELGPGEHTLT VTVTGDRPAG SSDAIVTVDR AEVYPAP
|
| |