Gene Ndas_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2023 
Symbol 
ID9245873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2443577 
End bp2444812 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content72% 
IMG OID 
ProductMicrosomal epoxide hydrolase 
Protein accessionYP_003679955 
Protein GI297560981 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.128391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCACCC CGCACGCCTT CCCCCTGGAG CCCGCTCCGA TCCACGTGCC CGACGGGGTC 
CTGGACGACC TGCGCGCCCG CCTCGCGTCG ACCCGTGCGC CGCTGGACGA GGGAAACGAG
GACTGGTCCT ACGGCGTCCC CGACAGCTAC CTGCGTGAGC TGGTCGCCCA CTGGCGGGAC
GGCTACGACT GGCGCCGGGC CGAGGCCGCC ATCAACGCCC ACGAGCACTA CCGGGTGAGC
GTCGCCGGTG TCCCGGTGCA CTTCATGCGC GAGCCCGGCC GCGGACCCCG GCCGATCCCG
CTGATCCTCA CCCACGGCTG GCCGTGGACG TTCTGGCACT GGTCGAAGGT GATCGGCCCG
CTCGCCGACC CGGCCGCGTT CGGCGGCGAC CCCGCCGACG CCTTCGACGT CATCGTGCCG
TCCCTGCCGG GCTTCGGTTT CCCCGGCCCG CTCACCGGCT TTCCCGACGT CAACTTCTGG
AAGGTCTCCG ACCTCTGGCA CACCCTGATG ACCCGGACCC TGGGATACGA GAAGTACGCC
GCCGGGGGCT GCGACATCGG CGGGATCGTC TCCAGCCAGC TCGGCCACAA GTACGCCGAC
CAGCTGTACG GCGTCCACAT CGGCTCCGGG CTGCCGCTCG ACTTCTTCAA CGGCCCCCGG
GCCTGGGACT TCGCCCGGAA CCAGCCCCTC ACCGACGACC AGCCCGCCGA CGTGCGCGCC
CGGATCGTGG AGACGGACCA CCGCTCGGCC TCCCACCTGG CCGTCCACAT GCTCGACGGG
GCCACCCTGG CCCACGGGCT GAGCGACTCG CCCGCCGGGC TGCTCGCCTG GCTGCTGGAG
CGCTGGAGGT CCTGGAGCGA CAACGGCGGC GACGTCGAGT CGGTCTTCAC CAAGGACGAC
CTGCTCACCC ACGCCACGAT CTACTGGGCG AACAACTCCA TCGCCACGTC GATGCGCTAC
TACGCCAACG CCAACCGCTA CCCCTGGGTC CCCGCCCACG ACCGCACCCC GGTCGTGCAG
GCCCCGGTCG GCCTCACCCT GGTCACGTAC GAGAACCCGC CCGGCGTCCA CACCGCCGAC
GAGCGCGTCC GGGCGTTCAG GGAGGGCCCA CAGGGCGCCT GGTTCAACCA CGTCAACGTC
ACCGCCCACG AGCGCGGCGG CCACTTCATC CCCTGGGAGA ACCCCGACGC CTGGGTGGAC
GACCTGCGCC GCACCTTCCG CGGCCGCAGG CCCTGA
 
Protein sequence
MSTPHAFPLE PAPIHVPDGV LDDLRARLAS TRAPLDEGNE DWSYGVPDSY LRELVAHWRD 
GYDWRRAEAA INAHEHYRVS VAGVPVHFMR EPGRGPRPIP LILTHGWPWT FWHWSKVIGP
LADPAAFGGD PADAFDVIVP SLPGFGFPGP LTGFPDVNFW KVSDLWHTLM TRTLGYEKYA
AGGCDIGGIV SSQLGHKYAD QLYGVHIGSG LPLDFFNGPR AWDFARNQPL TDDQPADVRA
RIVETDHRSA SHLAVHMLDG ATLAHGLSDS PAGLLAWLLE RWRSWSDNGG DVESVFTKDD
LLTHATIYWA NNSIATSMRY YANANRYPWV PAHDRTPVVQ APVGLTLVTY ENPPGVHTAD
ERVRAFREGP QGAWFNHVNV TAHERGGHFI PWENPDAWVD DLRRTFRGRR P