Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2269 |
Symbol | |
ID | 9246119 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2712903 |
End bp | 2715287 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | malto-oligosyltrehalose synthase |
Protein accession | YP_003680197 |
Protein GI | 297561223 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0794544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGC ACGACACGTC CACCGCTCCG ACGTCAACCT ACCGCCTCCA ACTGCGCCCC GGGTTCACCC TGGAGGACGC CGCGGACCTG CTCGACTACC TGGACCGCCT GGGAACCGGG GCGGTCTACC TGTCCCCCAT GCTCGCCGCC GCGCCGGGCT CCCAGCACGG CTACGACGTG GTCGACCCCA CCCGGGTCTC GCCCGAACTG GGCGGGGACG CCGCCCGCGA GGCCCTGGCC GCCAAGGCGC ACGAGCTGGG CCTGGGCGTG GTCGCCGACA TCGTCCCCAA CCACATGTCC GTGGTGCGCG CCGACGCCAA CCGCTGGTGG TGGGACGTCC TCGCCCACGG CCGGGACTCG GTGTACGCGC GCTGCTTCGA CATCGACTTC GACGCCGGAC CCCTGCTCAT CCCCGTGCTG GGCGACGACG GCGACGGCGG ACGGGCCGCC CTGGCCGACC TCGTCCTCGC CGACGGCTGC CTGGCCTACC ACGACAAGCG CTACCCGCTC GCCCCGGGCA CCCACGAACC CGGCGACTCC GTCGAGGCGG TGCACGAGCG CCAGCACTAC CGGCTGGTGT CCTGGCGGCG CGGCGACGGC GAGCTCACCT ACCGCCGGTT CTTCGACGTC AGCGAGCTGG CCGCCGTGCG GGTGGAGGAC ACGGGGGTCT TCGACGCCAC CCACGCCGAG ATCCTGCGCT GGGCCGAGCT CGGCCAGCTC GACGGTCTGC GGGTGGACCA CGTGGACGGG CTCACCGACC CCGGCGGCTA CCTGCGCCGC CTCGCCGAGC GCTTCGGCGG CTGGGTCGTG GTGGAGAAGA TCCTCGCCCC CGGCGAGGAC CTGCCGCAGT CCTGGCCGGT CGCCGGGACC ACCGGGTACG ACGCCCTGCG CGAGCTCTGC GGCGTGTTCG TCGACCCCTC CGGTGAGGCC GATCTCACCA CCCTCGCCCT CGACCAGGGG GTGGAGGTGG ACGTCGCCCG GGCCGACCTG CGCGCACGCC GCCACGCCGC CACCGAGCTG CTGCGCGCCG AGGTGCGGCG GATCGCCGCC CTGCTGCCGC CCCGGCCGGA ACGGGGGCGG GAGACGCCCT CGCCCGAGGC GGTGCGCCGC CGCGAGGAGG CCGTGGCCGA GCTGCTGTGC GCCTTCGACG TGTACCGCTC CTACCTGCCC GAGGGCGAGC GCGACTGGGC CCGGGCTGTG GAGACCGCCA TGGGGCACCG CCCCGACCTG GCCCACGACC TGGAGGCCCT CGACATGCGG GTGCGGCGCG ACTCCCGGGG CGAGGCCGCG CGGCGCATCC AGCAGACCAG CGGGATGGTC GTGGCCATGG GCACCGAGAA CACGGCCTTC TACCGCACCA CCAGGTTCGT CGCCCTCAAC GAGGTGGGCG GTGACCCCGC CGACTTCGCC GTCACGCCGC GCGAGTTCCA CGACGGCGCG GCCCGCCGCG AGGCCGCCCG GCCGCACACC ATGACCACCC TGTCCACGCA CGACACCAAG CGCTCGGAGG ACGTGCGCGC CCGCCTGGCC GCCCTGTCGG AGATGGCCGA ACCCTTCGCC GAGGCCGTGC GCCGCTGGAC CGGGCGGTGC GCCCTGCCCG AACCCGCGCT CAACCTCCTG GGGTGGCAGA CCCTGGTCGG CGCCTGGCCC ATCGGGGAGG AGCGCCTGGC CGACTACCTG CTCAAGGCCG CCCGCGAGGC GCGGCTGGGC ACCTCCTGGA CCGAGCCGGA CCCCGACTTC GAGGACCGCG TCCGCTCCTG GCCAGCGCGT GTGCTGGGCG ACAGGGTCCT GCGCGGGGAG GTGGCCTCCG TGGTGGAGTC GGTGCGCGAG GCCGGGTGGA GCAACTCCCT GGGGCAGAAG GCCGTGCAGC TGCTGGGCCC GGGCGTCCCG GACCTGTACC AGGGCACCGA GCTGTGGGAC CTGTCCCTGG TGGACCCCGA CAACCGCAGG CCCGTGGACC ACCAGGCCCG CGCGGACCTG CTGACCCGGA TCGAGGACGG CTGGCGCCCG CCGGTGGACG CCACCGGCGC GGCCAAGCTC CACCTGGTCC GCACGTGCCT GCGCGCGCGC CGCGAGCTGC GCCCCCGGGG GTACCTGCCG CTCACCGCCT CCGGCCCGGC CCGGGACCAC GCGCTGGCCT TCGCGCGGAC CTCGGCCGGA GCCGACCGGC CCGACCTGGC GGTGATCGCC ACCCGGCTGC CCGTCGGGCT GGCCGACGCG GGGGGCTGGG CGGACACGGT CGTGGCGCTG CCCTCGGGGC CGGGGACGTG GACCGACCGG CTCACGGGCC GGGCCGTGGT GTCCGACAGC CCCGACGGGC CCACCCTGGC GCACCTGGCC GGGGTCCTCG ACCGGTACCC GGTGGCGGTC CTGACCAGGG AGTGA
|
Protein sequence | MSTHDTSTAP TSTYRLQLRP GFTLEDAADL LDYLDRLGTG AVYLSPMLAA APGSQHGYDV VDPTRVSPEL GGDAAREALA AKAHELGLGV VADIVPNHMS VVRADANRWW WDVLAHGRDS VYARCFDIDF DAGPLLIPVL GDDGDGGRAA LADLVLADGC LAYHDKRYPL APGTHEPGDS VEAVHERQHY RLVSWRRGDG ELTYRRFFDV SELAAVRVED TGVFDATHAE ILRWAELGQL DGLRVDHVDG LTDPGGYLRR LAERFGGWVV VEKILAPGED LPQSWPVAGT TGYDALRELC GVFVDPSGEA DLTTLALDQG VEVDVARADL RARRHAATEL LRAEVRRIAA LLPPRPERGR ETPSPEAVRR REEAVAELLC AFDVYRSYLP EGERDWARAV ETAMGHRPDL AHDLEALDMR VRRDSRGEAA RRIQQTSGMV VAMGTENTAF YRTTRFVALN EVGGDPADFA VTPREFHDGA ARREAARPHT MTTLSTHDTK RSEDVRARLA ALSEMAEPFA EAVRRWTGRC ALPEPALNLL GWQTLVGAWP IGEERLADYL LKAAREARLG TSWTEPDPDF EDRVRSWPAR VLGDRVLRGE VASVVESVRE AGWSNSLGQK AVQLLGPGVP DLYQGTELWD LSLVDPDNRR PVDHQARADL LTRIEDGWRP PVDATGAAKL HLVRTCLRAR RELRPRGYLP LTASGPARDH ALAFARTSAG ADRPDLAVIA TRLPVGLADA GGWADTVVAL PSGPGTWTDR LTGRAVVSDS PDGPTLAHLA GVLDRYPVAV LTRE
|
| |