Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3284 |
Symbol | |
ID | 9247146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3919944 |
End bp | 3921287 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | putative transcriptional regulator, PucR family |
Protein accession | YP_003681196 |
Protein GI | 297562222 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.946669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTACC CCCTGCGGGA GAGCCGTCCG ACGGACCCGT TCGCCCGTAT CCCCCCGGAG TTGGGCGCCC GGCTGCGCCC GCTGGTGCCG TCGCTGGCGG AGGAGGCCCT GGAGGGCATC CGGCGCCGGG CGGAGTTCCA CGCCGCGCCC CTGGAGCCCG CCGACGCCGA GGGTCTGCGG CGGACGGTGG AAGCCGGTCT GCGCGCCTTC CTGGACCGGG TGGAGGGCGG CCGCTCCCCC GGGCACGTGG ACGCGCTGGC CGCCGAGTTC CGGGCCTTCG GCGCGGACGA GGCCCGGCAC GGGCGCACCC TGGAGTGCCT GCACACGGGG ATGCGCACGG CGGCGGCCGT GGCCTGGCGG CGGCTGGCCG ACGCGGACGC GCTCACCCGC GACCACATGG CGGTGCTGGG AGAGGCGATC TTCGCGTTCC AGGAGGAGGT CTCGACCGCC GCGGCCGAGG GCCACGCGCG GGTGCGCGGC GCCGGGGTGG ACGCGCTGCG CAGGCGCCGG GCGCGGTTGC TGGAGGTGCT GTTGGCCGAG ACGGGCCCGC GGCGGGAGGA GCAGGCGGTG CCCGCGCTGG CCCGGGCCGC CCGGTGGCGG CTGCCCGGGC GGGTGGCGGT GGCGCTGCTG TACCGGGGTA CGGACGGCGC GGCGGCGGCC CCGCTGGAGG CGACCCCGAG CGGCCCGTCG GCGGTGGCCC AGACGACCGT GGGCGGCCCG TCGGCGGACG CGCTGTCCGC CTCGGCGGGG GCGTTCTCAC ATCTCAGCGC GTTGCCGTCG GACGTGCTCG TGGGCCTGGA CCGCGTGGAG CCGTGCGCGG TGGTGCCCGA TCCCGAGGGG CCGGGGCGGC TGCGGTCGCT GGAACGCTCG CTGCGCGGTT CCTGCGCGGT GCTGGGCCCG AGCGTGCCCC TGGACCGGGC GCCGCTGTCG CTGGCCCGGG CCCGGGACCT CGCGGAACTC GTGCGCACGG GCGTGGTGCC GGGCGGGGGC GTGGTGCGGT GGGACGACCA TCTGCCCGTG CTGCTGCTGT CGCGCGATCC CGAGCTGGTC GCGGAGATGA CGCGGTCCCG GCTCGCTCCG CTGGCGCCGT TGCGCCCGCC GCAGCGGGAG CGGATGGCCG ACACCCTGCT GGCCTGGCTG GAGTCGGGGC TCAACGCCAA CGAGGCCGCC GAGCGGCTGC GCATCCACCC GCAGACCGTG CGCTACCGGC TGCGCCAGCT GGAGGAGCTG TTCGGCGAAC GGCTGCGCGA GCCCGGAGAG CGTTTCGAAC TGGAACTGGT CCTGCGGGCC CGCCGTCTGC TCGGTCCCCT GGAGGAGGGT CCGGTGTGGC CGGACGGCGG GTGA
|
Protein sequence | MSYPLRESRP TDPFARIPPE LGARLRPLVP SLAEEALEGI RRRAEFHAAP LEPADAEGLR RTVEAGLRAF LDRVEGGRSP GHVDALAAEF RAFGADEARH GRTLECLHTG MRTAAAVAWR RLADADALTR DHMAVLGEAI FAFQEEVSTA AAEGHARVRG AGVDALRRRR ARLLEVLLAE TGPRREEQAV PALARAARWR LPGRVAVALL YRGTDGAAAA PLEATPSGPS AVAQTTVGGP SADALSASAG AFSHLSALPS DVLVGLDRVE PCAVVPDPEG PGRLRSLERS LRGSCAVLGP SVPLDRAPLS LARARDLAEL VRTGVVPGGG VVRWDDHLPV LLLSRDPELV AEMTRSRLAP LAPLRPPQRE RMADTLLAWL ESGLNANEAA ERLRIHPQTV RYRLRQLEEL FGERLREPGE RFELELVLRA RRLLGPLEEG PVWPDGG
|
| |