Gene Ndas_5034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5034 
Symbol 
ID9248923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp175539 
End bp177164 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content67% 
IMG OID 
ProductCyclohexanone monooxygenase 
Protein accessionYP_003682921 
Protein GI297563948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.623836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAACG CGAAGAACAC CGATCTCGAC GCGATCGTCG TCGGTGCCGG GTTCGCCGGC 
ATCTACGCGC TGCACAAGCT CCGCAACGAA CTGGGCCTGT CCGTCCGGGC CTTCGACAAG
GCGGGCGGGG TCGGGGGCAC CTGGTACTGG AACCGCTACC CCGGCGCCAT GTCCGACAGC
GAGGGCTTCA TCTACCAGTA CTCCTTCGAC CGCGACCTGC TGCGGGAGTG GACCTGGAAG
AAGCGCTACC TGTCCCAGGC GGAGATCCTG GGCTACCTGG AGGCGGTCGT GGAGCGGCAC
GACCTCGCCA GGGACATCCA GCTCAACACC GGCGTCGAGA CGCTGGTCTA CGACGAGGCC
GCCGCCCTGT GGACCGCGAC CACCAGCGAC GGCCAGACCC TCACCGCCCG CTACGTGGTG
ACCGCCCTCG GACCGCTGTC CACCTCCCAC TTCCCCGACT TCAAGGGCCG CGACAGCTTC
CAGGGCCGCC TGGTCCACAC CGGCTCCTGG CCCGACGACC TCGACATCGA GGGCAAGAGG
GTCGGCGTCA TCGGCACCGG CTCCACCGGA ACCCAGTTCA TCTGCGCGGC CTCGAAGGTG
GCCGGGCAGC TCACCGTGTT CCAGCGGACC CCCCAGTACA ACGTGCCCTC GGGCAACGCC
GAGGTGGACG AGGCCTACTT CACCGACCTG CGCGGCCGCT ACGACCAGGT CTGGGAGCAG
GCCAAGAAGT CCCGCGTGGC GTGCGGCTTC GAGGAGAGCG AGATCGCCGC GATGAGCGTC
TCCGAGGAGG AGCGCCGACG CGTCTTCCAG GAGAACTGGG ACCGGGGCAA CGGCTTCCGC
TTCATGTTCG GCACCTTCTC CGACATCATC TTCGACCCCA GGGCCAACGA GGCGGCGGCC
GACTTCATCC GGTCCAAGAT CCGGGAGATC GTCAAGGACC CAGAGACCGC CCGCAAGCTC
CAGCCGACCG ACTACTACGC CAAGCGCCCG GTCTGCAACG AGGACTACTA CGAGTCCTAC
AACCGGGACA ACGTCAGCCT GGTGAGCCTC AAGGAGACCC CGATCCGGGA GTTCACCCCC
ACGGGGATCG TGACCGAGGA CGGGGTGGAG CACGAGCTGG ACGTCGTCGT CTTCGCGACC
GGTTTCGAGG CCGTCGAGGG CAGCTACCGG CAGATGGAGA TCCGCGGCCG GGGCGGCGTG
ACCATCGAGG AGCACTGGGG GGACACGCCC GCCAGCTACC TCGGGGTCAA CGTCTCGGGC
TTCCCCAACA TGTTCATGGT CTACGGCCCC AACAGCGTCT TCAGCAACCT GCCCACGGCC
ATCGAGACCC AGGTCGAGTG GATCACCGAC CTGGTCCGGA TGATGGAGGA GCGCGACCTG
ACCTCCATCG AGCCGACCCC GGAGGCGGAG GAGGGCTGGA CCGAGCTGTG CACGCAGATC
GCCGACCACT CCCTGTTCCC CAAGGTCAAC TCCTGGATCT TCGGGGCCAA CATCCCGGGC
AAGAAGAAGC GGGTCCTGTT CTACTTCGCG GGGCTCGGCA ACTACCGCCA GAAGCTCGGT
GACGTGGCCG CGGCCGACTA CGAGGGCTTC ATGCTCAAGG GCAACCCCTC GGTGGTGACC
GCCTGA
 
Protein sequence
MTNAKNTDLD AIVVGAGFAG IYALHKLRNE LGLSVRAFDK AGGVGGTWYW NRYPGAMSDS 
EGFIYQYSFD RDLLREWTWK KRYLSQAEIL GYLEAVVERH DLARDIQLNT GVETLVYDEA
AALWTATTSD GQTLTARYVV TALGPLSTSH FPDFKGRDSF QGRLVHTGSW PDDLDIEGKR
VGVIGTGSTG TQFICAASKV AGQLTVFQRT PQYNVPSGNA EVDEAYFTDL RGRYDQVWEQ
AKKSRVACGF EESEIAAMSV SEEERRRVFQ ENWDRGNGFR FMFGTFSDII FDPRANEAAA
DFIRSKIREI VKDPETARKL QPTDYYAKRP VCNEDYYESY NRDNVSLVSL KETPIREFTP
TGIVTEDGVE HELDVVVFAT GFEAVEGSYR QMEIRGRGGV TIEEHWGDTP ASYLGVNVSG
FPNMFMVYGP NSVFSNLPTA IETQVEWITD LVRMMEERDL TSIEPTPEAE EGWTELCTQI
ADHSLFPKVN SWIFGANIPG KKKRVLFYFA GLGNYRQKLG DVAAADYEGF MLKGNPSVVT
A