Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2023 |
Symbol | |
ID | 9245873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2443577 |
End bp | 2444812 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | Microsomal epoxide hydrolase |
Protein accession | YP_003679955 |
Protein GI | 297560981 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.128391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGCACCC CGCACGCCTT CCCCCTGGAG CCCGCTCCGA TCCACGTGCC CGACGGGGTC CTGGACGACC TGCGCGCCCG CCTCGCGTCG ACCCGTGCGC CGCTGGACGA GGGAAACGAG GACTGGTCCT ACGGCGTCCC CGACAGCTAC CTGCGTGAGC TGGTCGCCCA CTGGCGGGAC GGCTACGACT GGCGCCGGGC CGAGGCCGCC ATCAACGCCC ACGAGCACTA CCGGGTGAGC GTCGCCGGTG TCCCGGTGCA CTTCATGCGC GAGCCCGGCC GCGGACCCCG GCCGATCCCG CTGATCCTCA CCCACGGCTG GCCGTGGACG TTCTGGCACT GGTCGAAGGT GATCGGCCCG CTCGCCGACC CGGCCGCGTT CGGCGGCGAC CCCGCCGACG CCTTCGACGT CATCGTGCCG TCCCTGCCGG GCTTCGGTTT CCCCGGCCCG CTCACCGGCT TTCCCGACGT CAACTTCTGG AAGGTCTCCG ACCTCTGGCA CACCCTGATG ACCCGGACCC TGGGATACGA GAAGTACGCC GCCGGGGGCT GCGACATCGG CGGGATCGTC TCCAGCCAGC TCGGCCACAA GTACGCCGAC CAGCTGTACG GCGTCCACAT CGGCTCCGGG CTGCCGCTCG ACTTCTTCAA CGGCCCCCGG GCCTGGGACT TCGCCCGGAA CCAGCCCCTC ACCGACGACC AGCCCGCCGA CGTGCGCGCC CGGATCGTGG AGACGGACCA CCGCTCGGCC TCCCACCTGG CCGTCCACAT GCTCGACGGG GCCACCCTGG CCCACGGGCT GAGCGACTCG CCCGCCGGGC TGCTCGCCTG GCTGCTGGAG CGCTGGAGGT CCTGGAGCGA CAACGGCGGC GACGTCGAGT CGGTCTTCAC CAAGGACGAC CTGCTCACCC ACGCCACGAT CTACTGGGCG AACAACTCCA TCGCCACGTC GATGCGCTAC TACGCCAACG CCAACCGCTA CCCCTGGGTC CCCGCCCACG ACCGCACCCC GGTCGTGCAG GCCCCGGTCG GCCTCACCCT GGTCACGTAC GAGAACCCGC CCGGCGTCCA CACCGCCGAC GAGCGCGTCC GGGCGTTCAG GGAGGGCCCA CAGGGCGCCT GGTTCAACCA CGTCAACGTC ACCGCCCACG AGCGCGGCGG CCACTTCATC CCCTGGGAGA ACCCCGACGC CTGGGTGGAC GACCTGCGCC GCACCTTCCG CGGCCGCAGG CCCTGA
|
Protein sequence | MSTPHAFPLE PAPIHVPDGV LDDLRARLAS TRAPLDEGNE DWSYGVPDSY LRELVAHWRD GYDWRRAEAA INAHEHYRVS VAGVPVHFMR EPGRGPRPIP LILTHGWPWT FWHWSKVIGP LADPAAFGGD PADAFDVIVP SLPGFGFPGP LTGFPDVNFW KVSDLWHTLM TRTLGYEKYA AGGCDIGGIV SSQLGHKYAD QLYGVHIGSG LPLDFFNGPR AWDFARNQPL TDDQPADVRA RIVETDHRSA SHLAVHMLDG ATLAHGLSDS PAGLLAWLLE RWRSWSDNGG DVESVFTKDD LLTHATIYWA NNSIATSMRY YANANRYPWV PAHDRTPVVQ APVGLTLVTY ENPPGVHTAD ERVRAFREGP QGAWFNHVNV TAHERGGHFI PWENPDAWVD DLRRTFRGRR P
|
| |