Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3662 |
Symbol | |
ID | 9247531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4395727 |
End bp | 4397172 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | transcriptional regulator, PucR family |
Protein accession | YP_003681566 |
Protein GI | 297562592 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0446387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACCGA CCCTGGGGGC CCTGATGCGC ACGCCCCGCC TGCGCCTGAA CCTGCTCACC GGCCAGGAGA ACCTGGACCG GAGGGTGGAG TGGGTCGCGG TCAGCGAGCT GGAGGACCCC ACGCCCTACC TGGCGGGCGG CGAACTCCTG CTCACCACCG GCGTGCGCTG GGCCACCGGC GTCCCCGACC TGCGCGACTA CACGCGCCGC CTGGCCGAGC GCCGCGTCAC CGCGCTGGGC TTCGCGGTCG GCGTGGTCCT GGAGCGCACC CCCGAGCCGC TCCGCGAGGC CGCCGCCGAG TTCGGCCTCA CCCTGCTGGA GGTGGCCAGG GAGACCCCCT TCATCGCGAT CGGCAAGGAG GTGTCCCGGC TCCTGGCCAA GGAGGAGTAC GAGGGGCTGA GCCGGGCCTT CGCCGCCCAG CGCGACCTCA CCCGCGCCGC GCTCACCGGG GAGGCGGCGA TCGTGGACCG GCTGGCCCGC GAACTCGGCG CCTGGGTGCT GCTGCTGTCC GCCGACGGCG CCCCCCGGCA CGCCGCCCCC GCCGGGGCCT CCCCCCGCGC CGCCGGACTC GCCGAGGAGC TGGACCGGCT GCGCGAGGCG GGCGTGCGCG CCAGCGTCTC CCTCACCTCG GGCGGCGAAC ACGTCTCCGT GCAGCCCCTG GCCACCGGCC GCCGGGTCCG CGGCTTCCTT GCCGTGGGCA CCGGCGGGCG CCTCGGCTCG GACGAGCGCA CGCTGGTCAA CGCGGCGGTG TCGCTGCTCT CCCTCGAACT GGAGCGCACC GCGCATGACG CCACGGCCCG GGTGCGCGAG GGGGTGCTCG CGGCGCTGCT CACCGGGGCG CTGGACCCCC TGCACCCGGG GGCGGAGCGG CTGCGCGGGG TCCTTCCCGC CGGACCCGTC CTCGTGGCGG CCGCCGACGG GGTCGGGCCC GCACAGCCGC CCGAGGGCGT CCTGGTCACC GAGCACGACG GCCGCGTCCT GCTGCTGGCC CCCGCCGACA CCGGAACGGG AGTGCTCGCC GAGGTCCTGG AGGGGCCGGT CGGGGTGAGC GACCCCTCCC CCTACGCGGA ACTGTCCGCG GCGCTGTCCC AGGCCGAACG CGCGCTGGCC GCCGCGCGCG ACGCGGGCGG GGGCCTGCTC CGCGTCGGCG ACCTGCCCGG CGGACTGCTC GGGCTGGCCG ACACCCCCGC CGGCGCCCGC ATGGCCGGGG ACCTGCTCGC CCCGCTGCTG CGCCAGCGCA CCTCGGCCGA ACTGCTGGCC TCGCTGCGCG CCTACCTCGC GGCCTCGGGC CGGTGGGACG CGGCGTCGGA GGCACTGGGG ATCCACCGGC ACACGCTGCG CTACCGCATG CGGCGCATCC GCGACCTGCT GCCCGGCGAC CTGGACGATC CCGACTACCG CACCGAGCTG TGGATCGCCC TGCGCGTCCA CGGCAGCGCG GGCTGA
|
Protein sequence | MPPTLGALMR TPRLRLNLLT GQENLDRRVE WVAVSELEDP TPYLAGGELL LTTGVRWATG VPDLRDYTRR LAERRVTALG FAVGVVLERT PEPLREAAAE FGLTLLEVAR ETPFIAIGKE VSRLLAKEEY EGLSRAFAAQ RDLTRAALTG EAAIVDRLAR ELGAWVLLLS ADGAPRHAAP AGASPRAAGL AEELDRLREA GVRASVSLTS GGEHVSVQPL ATGRRVRGFL AVGTGGRLGS DERTLVNAAV SLLSLELERT AHDATARVRE GVLAALLTGA LDPLHPGAER LRGVLPAGPV LVAAADGVGP AQPPEGVLVT EHDGRVLLLA PADTGTGVLA EVLEGPVGVS DPSPYAELSA ALSQAERALA AARDAGGGLL RVGDLPGGLL GLADTPAGAR MAGDLLAPLL RQRTSAELLA SLRAYLAASG RWDAASEALG IHRHTLRYRM RRIRDLLPGD LDDPDYRTEL WIALRVHGSA G
|
| |