Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4760 |
Symbol | |
ID | 9248642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5647956 |
End bp | 5649137 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | Glucan endo-1,3-beta-D-glucosidase |
Protein accession | YP_003682651 |
Protein GI | 297563677 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.68587 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.425556 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACC GAAGACTGTT CCTCGGCGGC GCATCGCTCG CCACCGTCAG CACCCTGCTC GTCACGACCG GTTGCTCCCG TGCGGAGAGC AACGGCGAAC TCACCTCGGG CGGCTCTCAG GCCGGCGCCC TTCCGGTGAG GGTCACCAAC GAGACCGGGA CCTTCCCCGA CAGCGAGATC CGCGCCTACA TCGTGGGAAC CGACCTGACC ACGAACACGC AGTCCTGGGT CGACGCCAGC GGCGTCGCGC ACCCGGTCGC GGAGTCCGAC AACACCGACG ACGGGTTCAC CGACTACTCG ATCCCGCTGG ACTCGGTCAG CGAGCTGTCC CTGCCGTTCA TGTCCGGCCG GGTGTACTTC GCCCTCGGCG GCAAGCTCCG GTTCAAGGTC GTCACCGACG GCAACGGCAG GCCCGCGCTC CAGTACCCGG CGGGCTGGGT CGAGTCCGAC CCCAGCCACG GGGTGCTCCA CGACACCGTG GAGTTCACCC ACAACGAGAC CGGCATGTAC TGCAACACCA GCATGGTGGA CCAGTTCAGC GTGCCGCTGG CCATCCGCCT CCAGGGCGAG GCCGACCAGA CCACCGGCAC GTTCGAGCCG GGCGGCCGCG ACGCGGTCTT CGCCACGCTC TCCGCCGACC CCGTCTTCTC GTCGCTCGTG CTGAGCGAGG GAGAACTCGT CATCGCCCCC GGGCACGGGC TGGACGCCGG ACTGTTCCCG GCCGACTACT ACGACTCCTA CATCGACCAG GTCTGGGAGC GCTACAGCAC CACCGACCTG CGCGTCACCA CCAACGCCGG GACCTTCACC GGCCGCGTGG ACGGCTCGGG CAACCTGGTC TTCGACGGCG GCGTGGCCCC GATCCCCAAG CCCTCCACCC GCGACGTCTT CTTCTGCGAC GGCGCCCTGG CGGCCCCCAA CGACGGCATC ACCGGTCCGG TGGCCGCGAT CCTGGGCGCG GCGTTCAACC GCTCGACGCT GCTGGACACC GCCGAGCACC CGGTCACCGA CCCCGCGGCG TTCTACAACC ACGAGACCAC CAACCGCTAC GCGGCGGTCT TCCACGAGAA CACCGTGGAC GGCAAGGCCT ACGGCTTCGC CTTCGACGAC GTGTCGAACT TCGCCTCCTA CGTCCAGGAC CACGCGCCGG TGTCCCTCGA CGTGACGCTG ACCGCGTTCT AG
|
Protein sequence | MIDRRLFLGG ASLATVSTLL VTTGCSRAES NGELTSGGSQ AGALPVRVTN ETGTFPDSEI RAYIVGTDLT TNTQSWVDAS GVAHPVAESD NTDDGFTDYS IPLDSVSELS LPFMSGRVYF ALGGKLRFKV VTDGNGRPAL QYPAGWVESD PSHGVLHDTV EFTHNETGMY CNTSMVDQFS VPLAIRLQGE ADQTTGTFEP GGRDAVFATL SADPVFSSLV LSEGELVIAP GHGLDAGLFP ADYYDSYIDQ VWERYSTTDL RVTTNAGTFT GRVDGSGNLV FDGGVAPIPK PSTRDVFFCD GALAAPNDGI TGPVAAILGA AFNRSTLLDT AEHPVTDPAA FYNHETTNRY AAVFHENTVD GKAYGFAFDD VSNFASYVQD HAPVSLDVTL TAF
|
| |