Gene Ndas_2691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2691 
Symbol 
ID9246542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3206309 
End bp3207451 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content69% 
IMG OID 
ProductMicrosomal epoxide hydrolase 
Protein accessionYP_003680612 
Protein GI297561638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGG ACAACGCACT CACACCGTTT CGCATCGCCA TCCCGCAGGC CGACATCGAC 
GACCTGCGCG ACCGGCTGGC GCACACGCGC TGGCCGGTCC CGGTGCCGGG CCGAGACGAT
CGCACCGACT TCAGCCGCGG CATCCCGCTG GTGTACCTGA AGGAGCTCGC CGAGTACTGG
CGCGACGGGT TCGACTGGCG TGCGCAGGAG GAGGGGCTCA ACGAGTACGG ACAGTTCACG
ACGGCCGTCG ACGGCCAGAC CTTCCACGTC GTCCACGTGC GATCGACGAA CCCGGAGGCC
GTCCCGCTGA TCCTGAACCA CGGCTGGCCG GGCTCGTTCG TCGAGTACCA GCGGCTCATC
CCGCTGCTGA CCGATGAGTT CCACGTGGTC GTCCCGTCGC TGCCCGGTTT CGGGTTCTCC
ACCCCGCTGT CGGGGACCGG CTGGGAGCTG GCGCGGACGA CGGAGGCCTA CGCCGAGATC
ATGACGCGTC TGGGCTACGA GAGGTTCGCG GCCCACGGCA CCGACATCGG TGCGGGCACC
ACCGGCCGCC TCGCGGCGCT CCACCCGGAG CGCGTCATCG GCACGCACAT CGGCAGCGAC
CCGCGGTGGC TCGGGTTGCT CGGCGACAAG TTCCCCTACC CCGACGGTCT GTCCGATGAC
GAGACCGCCC AGATCGAGGC GGTGCGCGCC GAGGCCGCGG CTGAGCGCGG GTACCTGGCG
ATGCAGGACC ACCGCCCCGA CACGATCGGC GCGGCGCTCA CCGACTCGCC GGTCGGTCAG
CTCGCGTGGA TCGCCGAGAA GTTCAAGACC AGGGCCGATG GCGCCTACCG GACGCCGGAC
GAGACGGTCG ACCGCGACCA GCTCCTCACG AACATCAGCC TGTACTGGTT CACCCGCAGC
GGCGCGTCGA GCGCGCAGTT CTACTACGAG TCCGCGCACT CCGGAATCGA CTTGGTCACG
GCCTCCGACG TGCCGTCCGG ATGGGCCGTG TTCGACACCC ACCCGCTCAT GCGCCGCGCG
GTGGACCCGT GGAAGGCGAT CGGTCACTGG AGCGAGTTCA CCGAGGGCGG TCACTTCCCC
GCGATGGAGG CGACGGAGCT GCTCGCGGAC GACATCCGTG CCTTCTTCCA CGGCGTTTCC
TGA
 
Protein sequence
MNEDNALTPF RIAIPQADID DLRDRLAHTR WPVPVPGRDD RTDFSRGIPL VYLKELAEYW 
RDGFDWRAQE EGLNEYGQFT TAVDGQTFHV VHVRSTNPEA VPLILNHGWP GSFVEYQRLI
PLLTDEFHVV VPSLPGFGFS TPLSGTGWEL ARTTEAYAEI MTRLGYERFA AHGTDIGAGT
TGRLAALHPE RVIGTHIGSD PRWLGLLGDK FPYPDGLSDD ETAQIEAVRA EAAAERGYLA
MQDHRPDTIG AALTDSPVGQ LAWIAEKFKT RADGAYRTPD ETVDRDQLLT NISLYWFTRS
GASSAQFYYE SAHSGIDLVT ASDVPSGWAV FDTHPLMRRA VDPWKAIGHW SEFTEGGHFP
AMEATELLAD DIRAFFHGVS