Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1544 |
Symbol | |
ID | 9245394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1891073 |
End bp | 1892296 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | cysteine/1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase |
Protein accession | YP_003679479 |
Protein GI | 297560505 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.145189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.965113 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTCAT GGTCTGCGCC TGACATCGTC CCCCTGCCGG GTACCGGCGG TCCCCTTCGT GTCCACGACA CCGCCACCGG CCGGATAAGG ACGACGACGC CGGGCCCCCG GGCCGGGATG TACGCCTGCG GCATCACCCC CTACGACGCC GCCCACCTGG GCCACGCCTT CACCTACCTC ACCTTCGACC TGGTCAACCG GGTCTGGCGC GACGCGGGCC ACGACGTCAA CTACGTGCAG AACACCACCG ACATCGACGA CCCGCTCCTG GAGCGCGCGG AGGCCACCGG CGTCGACTGG CGCGACCTCG CCCACCGCGA GATCGACGTC TTCCGCGAGG ACATGGCCGC CCTGCGGATC ATCCCCCCGA CCTCCTACGT CGGCGTGGTG GAGTCCGTCG ACCTCATCAG CGACCTCGCC GCCCGCATCC GCGACACCGG CGCCGCCTAC GAGCTGGACG GGGACCTGTA CTTCTCCGTC GCCGAGGCGC CCGAGTTCGG CGAGATCAGC AACCTGGACC GCGGGCAGAT GCTGGAGCTG TTCGGAGAAC GCGGCGGCGA CCCCCAGCGC ACCGGCAAGA AGGACCCGCT CGACTGGCTG CTCTGGCGTG CCGAGCGCCC CGGCGAGCCC GCCTGGGACA GCCCCCTGGG CCGCGGGCGC CCCGGCTGGC ACATCGAGTG CAGCGCCATC GCCCTGGACC GGCTCGGCCC GGCCTTCGAC CTCAACGGCG GCGGCAGCGA CCTGATCTTC CCCCACCACG AGATGGGCGC GGCCGAGACC CGATGTGCCA CGGGCGGACC CAACGCCCAC AACCACCTGC ACGTGGGCAT GGTCGGCCTC GACGGCGAGA AGATGTCCAA GTCCCTGGGC AACCTGGTCT TCGTCTCCAA GCTGCGCCAG CAGGGCGTGG ACCCGGCCGT CATCCGCCTG GCCATGCTCG CCCACCACTA CCGCGCCCCG TGGGAGTGGA CCGACGCCGA ACTCCCCGCC GCCACCGCCC GCGCCGAGCG CTGGCGCTCC GCCCTCGCCC TGGGCGCGGC GCCCGACGCC GCCCCGGTGC TCGCCGCCGT GCGCGCGGCC CTGTCCGAGG ACCTGGACTC CCCGGCGGCC CTGGCCGCGG TGGACGCCTG GGCCGACACC GCCCTCACCG AGGGCGGCGC CGACACCGGC GCGCCCGCCC TGGTGCGCGC GACCGTGGAC ACCCTGCTGG GCGTGCGCCT GTAA
|
Protein sequence | MRSWSAPDIV PLPGTGGPLR VHDTATGRIR TTTPGPRAGM YACGITPYDA AHLGHAFTYL TFDLVNRVWR DAGHDVNYVQ NTTDIDDPLL ERAEATGVDW RDLAHREIDV FREDMAALRI IPPTSYVGVV ESVDLISDLA ARIRDTGAAY ELDGDLYFSV AEAPEFGEIS NLDRGQMLEL FGERGGDPQR TGKKDPLDWL LWRAERPGEP AWDSPLGRGR PGWHIECSAI ALDRLGPAFD LNGGGSDLIF PHHEMGAAET RCATGGPNAH NHLHVGMVGL DGEKMSKSLG NLVFVSKLRQ QGVDPAVIRL AMLAHHYRAP WEWTDAELPA ATARAERWRS ALALGAAPDA APVLAAVRAA LSEDLDSPAA LAAVDAWADT ALTEGGADTG APALVRATVD TLLGVRL
|
| |