Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2267 |
Symbol | |
ID | 9246117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2709862 |
End bp | 2711691 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | malto-oligosyltrehalose trehalohydrolase |
Protein accession | YP_003680195 |
Protein GI | 297561221 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.767056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGAAG CCCAGACCAC CCCGACCACG TCCGACGCCC TCGCCGCCCC CGCCGTCGCG GGGGTCGGCG TCCGCGGCGA CTTCGCCGTG TGGGCGCCCC ACCGCGAACG CGTCCGCCTG CGCCTGTACG GCACCGCGGA GCACGGCGGC ACGGGGGAGC GCGACGTCGC CATGCGGCCC GACGACGACG GCTGGTGGCG CGTCCGCGTC GAGGACGCCG GGCCCGGCAC CGAGTACGCC TACCTCCTGG ACGACGACCC CCAGCCCCTG CCCGACCCCC GCTCACTCCA CCAGCCCCAC GGCGTCCACG GCCCCAGCCG CGTCCACGAC CACGCAGCCT TCGCCTGGAC CGACGCCGAC TGGACCGGGC GCCCCCTCGC GGGCGCCGTC GTCTACGAAC TCCACGTGGG CACCTTCACC CCCCAGGGCA CCCTCGCCGC CGTCGCCGAC CACCTCGACC ACCTGGCCGA CCTCGGCGTC ACCCACGTCG AACTCATGCC GGTCAACGCC TTCGACGGCA CCCACGGCTG GGGCTACGAC GGCGTCCTGT GGGCGGCTGT CCACGACCCC TACGGCGGCC CCGACGCCCT CAAGGCCCTC GTCGACGCCT GCCACCGCCG CGGCCTGGCC GTCCTGCTCG ACGTGGTCTA CAACCACCTC GGCCCCTCGG GCGCCTACAT GCCCCGCTTC GGCCCCTACT TCCGCGGCGA GAACGCCTGG GGCCCCTCCC TCAACCTGGA CGGCCCCGAC TCCGACCCGG TCCGCCGCAC GGTCCTGGAC AACGCCCTGG ACTGGCTGCG CCACTACCAC CTGGACGGGC TGCGCCTGGA CGCCGTGCAC GCCCTGCGCG ACGACCGCGC CACCCCGCTG CTGGCCGAAC TCGCCGAGGA GGTGGACGCC CTCGCCACCG CCCTGAACCG GCCGCTGTCC CTGGTCGCCG AGTCCGACCG CAACGACCCC CGCACCGTCC TGCCCCGCGA GGCGGGCGGC CTGGGCATGA CCGCCCAGTG GTCCGACGAC CTCCACCACG CCCTGCACGT CGCCCTCACC GGGGAGACGC ACGGCTACTA CGCCGACTTC GCCGACCCGG GGGCGCTGCC CGCCGCGCTC ACCCGAGCGT TCTGGCACGC GGGCACCCGC TCCAGCTTCC GCGGCCGCAC CCACGGCGCG CCCGTGGACA CCGCGCGCGT CCCCGGCAGC CGCTTCCTGG CCTACCTGAG CACCCACGAC CAGATCGGCA ACCGGGCCCG GGGCGACCGC ATGGGCGAAC ACCTCTCCCC TGGCCTGCTC GCCTGCGGCG CCGCGCTGGT GCTGTGCTCC CCCTACACCC CCATGATCTT CATGGGGGAG GAGTGGGGGG CCGCCACGCC CTGGCCGTTC TTCGCCTCCT TCACCGACCC CGACCTGGTC AGGGGCGTGC GCGAGGGACG CCGCCGCGAG TTCGCCGCGC TGGGGTGGGC CGAGGAGGAG ATCCCCGACC CCATGGACCC GGCCACCCGC GACGGCGCCG TCCTGGACTG GTCCGAGCCC GGACGCGAAC CGCACGGGCT GGTCCTGGAC ACCTACCGGG CGCTCATCGC CCTGCGGCGC GTGGAACCGG AGCTGTCCGA CCCGCGCCTG GACCGCTCCT CGGTCGAGGT GGGCGGCGGC GGACGCCTGC TCGTCCTGGC CCGGGGAAGC CTGCGCGTGG TGTGCAACCT GGACGCCGAC GGCGCCGAGG TGGAGCTGGA CGCGGCCCCG CGCGAACTCC TGCTCGCCAA CGGCGAGCCC AGGACCGCGG GGTCCACCGT CACCGTGCCG GGGGAGTGCT TCGCCGTCCT GAGGGTGTAG
|
Protein sequence | MSEAQTTPTT SDALAAPAVA GVGVRGDFAV WAPHRERVRL RLYGTAEHGG TGERDVAMRP DDDGWWRVRV EDAGPGTEYA YLLDDDPQPL PDPRSLHQPH GVHGPSRVHD HAAFAWTDAD WTGRPLAGAV VYELHVGTFT PQGTLAAVAD HLDHLADLGV THVELMPVNA FDGTHGWGYD GVLWAAVHDP YGGPDALKAL VDACHRRGLA VLLDVVYNHL GPSGAYMPRF GPYFRGENAW GPSLNLDGPD SDPVRRTVLD NALDWLRHYH LDGLRLDAVH ALRDDRATPL LAELAEEVDA LATALNRPLS LVAESDRNDP RTVLPREAGG LGMTAQWSDD LHHALHVALT GETHGYYADF ADPGALPAAL TRAFWHAGTR SSFRGRTHGA PVDTARVPGS RFLAYLSTHD QIGNRARGDR MGEHLSPGLL ACGAALVLCS PYTPMIFMGE EWGAATPWPF FASFTDPDLV RGVREGRRRE FAALGWAEEE IPDPMDPATR DGAVLDWSEP GREPHGLVLD TYRALIALRR VEPELSDPRL DRSSVEVGGG GRLLVLARGS LRVVCNLDAD GAEVELDAAP RELLLANGEP RTAGSTVTVP GECFAVLRV
|
| |