Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2446 |
Symbol | |
ID | 9246296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2899680 |
End bp | 2900744 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | chitin-binding domain 3 protein |
Protein accession | YP_003680372 |
Protein GI | 297561398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGTCAC GCAGCACCCT CGCCATGGCC GTGGCGCTCG GCGGCCTCGC CGTCGGCACC ACCGTCGTCG CGCTGCCCGA CACGGCCCAG GCCCACGGCG CGTTCACCTA CCCGGCCAGC CGCACCTACG CCTGCTTCCA GGACGCCACC GGCGGCAGCA GCGGCGGCGC GCTCGCGCCG ACCAACGACG CCTGCGCCGA CGCGCTGGCC GAGGGCGGCA ACTACCCGTT CTGGAACTGG TTCGGCAACC TCATCAGCAA CTCCGACGGG CGGCACCAGG AGTTCGTGGC CGACGGCGAG CTGTGCGGAC CCACCGACAG CTTCTCCGCC TTCAACGCGG TCCGCGCCGA CTGGCCGACC ACCGAGCTGC CCGCCGACAC GACGGTGGAG TTCCACCACA ACGCCTGGGC CGCGCACCCG GGCACCTTCT ACACCTACGT CACCGAGGAC GGGTTCGACC CCGCGACCGA CGCGCTGACC TGGGACTCCC TGGAGCTCAT CGACGAGGTC ACCGACCCGC CGCTGCGCAG CGGCGGCGTC GCCGGAGCCG AGTACTACTG GGACGTGGAC CTGCCCGACA AGGAGGGCCA GCACGTCATC TACGTGGTGT GGGAGCGCTC CGACAGCCCC GAGGCGTTCT ACAACTGCTC CGACGTGGTC TTCGAAGGGG GCTCGGGCGG CAACCCGGAC CCCGAGCCGG AGCCGGAGCC GGAACCCGAG CCGGAGCCGG AGCCGGAGCC GGAGCCCACG GACCCGCCGG CGGGTGAGTA CTGCACGGCC GAGTACACGG TCCTCAACGA GTGGCAGGGC GGCTTCCAGG CCGAGGTCGA GGTCACCGCG GGCGAGAACG GGACCGACGG CTGGATGGTG GACTGGGTGT TCGCCAACGG CCAGGCGGTC AGCAGCGCCT GGAACGCCTC GCTCGTCAAC CACGGCGCCC ACTTCGAGGC CAGGAACGCC GCGCACAACG GCTCGCTCGC GGCCGGGGAG AGCGCCAGCT TCGGGTTCAC CGCCACCTCG GGGAACGTGA ACATCGAGCC GCGGGTGACC TGCCAGGAGC CGTGA
|
Protein sequence | MRSRSTLAMA VALGGLAVGT TVVALPDTAQ AHGAFTYPAS RTYACFQDAT GGSSGGALAP TNDACADALA EGGNYPFWNW FGNLISNSDG RHQEFVADGE LCGPTDSFSA FNAVRADWPT TELPADTTVE FHHNAWAAHP GTFYTYVTED GFDPATDALT WDSLELIDEV TDPPLRSGGV AGAEYYWDVD LPDKEGQHVI YVVWERSDSP EAFYNCSDVV FEGGSGGNPD PEPEPEPEPE PEPEPEPEPT DPPAGEYCTA EYTVLNEWQG GFQAEVEVTA GENGTDGWMV DWVFANGQAV SSAWNASLVN HGAHFEARNA AHNGSLAAGE SASFGFTATS GNVNIEPRVT CQEP
|
| |