Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1000 |
Symbol | |
ID | 9244846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1224329 |
End bp | 1225270 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, GntR family |
Protein accession | YP_003678950 |
Protein GI | 297559976 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.801369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGGAG CAGTCGAGAC GGGCGCCAGG AAGAGCGCGG ACGGCCGGGC CTCCGCGCTC GCGCTCTCGC TAGCGACCCA GATCAGCGAG CTGGCGCGGA CGGGCGGCCT GGACGAGGGC GCGCACCTGA CCGAGCAGTG GATCGCCGAC CGGCTCCAGG TGTCGCGGTC CCCGGTGCGG CGCGCGCTGG CGCTGCTGGA CGAGATGGGG ATCGTCGAGC ACATCCCGAA CCGGGGGTAC TTCCTCACCC GGCCGGGCAG CGAACTGGCC TCCGTGGACG CCGTTCCGGA GCGGGACGGG CAGGAGGACC TGTACTTCCG CGTGGTGGAC GACTTCCTCA ACGACCGGCT CGACCGGGAG TTCACGGCCG CGGAGATCGC GCGCCGCCAC GGGGTGGCGG CCCGGCACGT GCAGCGGGTC CTGGTCCGGA TGGAGGCCGA GGACCTGGTG CGGCGGCGGA CGGGCCGCGG CTGGGAGTTC CAGGAGGTGC TCTCCACCGC GGAGGGGCAC GACCACAGCT ACCGGTTCCG GATGATCGTG GAACCCGCCG CGCTGCTGGA GCCGGGGTTC GCGGTGGACG CGGAGGCCTT CGCGCTCCAC CGGGAGCGCC AGGAGGGCCT GGTACGGGGC AGGGCGCTGT CGTCGGCGCG CGGCACCCTG TTCCAGACGG GGGCGGAGTT CCACGAGATG CTCGTGGGGT GCGCGAACAA CCCGGTCCTG CTCGACGCGG TGCGTCGGCA GAACCGGGTG CGCAGGCTCA TCGAGTACCG TCACCAGGTG GACCGCACGC GGATGGTGCA CCAGGCCCGC GAGCACCTGC TCCTGATGGA CCTGCTCCAG GAGGGGCGGA TCGAGGAGGC CTCCCGGGCC CTGCGCGCGC ACCTGGACCG GGTCCGGTGG ATCAAGACCG GGATCGGCGC GGAGCCGCCC GCCCTGCTGT GA
|
Protein sequence | MTGAVETGAR KSADGRASAL ALSLATQISE LARTGGLDEG AHLTEQWIAD RLQVSRSPVR RALALLDEMG IVEHIPNRGY FLTRPGSELA SVDAVPERDG QEDLYFRVVD DFLNDRLDRE FTAAEIARRH GVAARHVQRV LVRMEAEDLV RRRTGRGWEF QEVLSTAEGH DHSYRFRMIV EPAALLEPGF AVDAEAFALH RERQEGLVRG RALSSARGTL FQTGAEFHEM LVGCANNPVL LDAVRRQNRV RRLIEYRHQV DRTRMVHQAR EHLLLMDLLQ EGRIEEASRA LRAHLDRVRW IKTGIGAEPP ALL
|
| |