Gene Ndas_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1037 
Symbol 
ID9244883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1278330 
End bp1279559 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content74% 
IMG OID 
Productcytochrome P450 
Protein accessionYP_003678986 
Protein GI297560012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.066208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0434723 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAT CCGCGCCCGA GGAACACGGT TCCGCCGCCC CGCCCCCTGC GGAGTTCCCC 
CTGGCCCGCT CCTGCCCCTT CTCCCCGCCC GAGGCCTACG AGGGGATGCG CGCCGCCGCC
CCCCTGACCC GGGTCACGAT CCCCAGCGGC AAGCAGGCCT GGCTCCTCAC CCGCTACGAC
GACGTGCGGA CCGCGCTGTC CCACCCCGCC TCCAGCTCCG ACGCCCGCCA CCCCGCCTTC
CCCGCCCTGG GCAGGGGCGA ACAGGCGGTC GCCGCCGACC TGCGCCCCTT CATCCGCATG
GACCCGCCCG ACCACACCCG TGTCCGCCGC ATGCTCGTGG GCGAGTTCAC CGTGCGCCGG
GTGCGCGAGA TGCGCCCGGA GATCGAGCGG ATCTGCGCCG AGCAGGCCCG CCACCTGCTC
TCCCTGCCGA CCCCCGCCGA CCTGGTCGGC GAGTACGCCA ACCGGGTGTC CACGACCGTC
ATCTGCCGCC TGCTCGGCGT CCCCCTGGAG GACCTGGAGT TCTTCCGGCG CATCACCACG
GTCTCGGGCG GGCGCGACAG CACCGAGGAG GAGGTGGGCG CCGCGCTCGC CGACCTGTTC
GGCCTGATCA ACCGGCTCAT CGACGAGAAG GCCGACGACC CGCGGGACGA CCTCATCAGC
CGCCTGGTCA CCGGCCCCCT GGCGCGCGGG GAGATCACCC GCCAGTCCCT GCTCTCCCAG
ATCGGCATCA CGCTCAACGC CGGGCACGAG ACCTCGCGCA ACATGATCCC GCTCGCGGCG
CTGGCCCTGC TGGACCACCC CGACCAGCTG GCCCTGCTGC GCGAGGACCC CTCGCTGTGG
CCGGGCGCGG TGGACGAACT CCTGCGCTAC CTGTCCGTGG CCGACGTCAT CACCCTGCGC
GTGGCCACCG AGGACATCCC CCTGGACGGG GCCACCATCC CGGCGGGCGA CGGATACATC
GCGCTGCTCG GCGCCGCCAA CCGCGACCCC GCGGCCTTCC CCGAGCCCGG GAAACTGGAC
GTGACCCGCT CGCAGCGGGG CCACGTCGCC TTCGGCTACG GGACGCACCA GTGCGTGGGG
CAGAACCTGG GGCGCCTGGA GATCGAGGTG GCCCTGCGGA CGCTGTTCGA GCAGGTGCCC
ACCCTGCGCC GGGCGGTGCC GCTGGAGGAG CTGCCCCAGC GTTCGGCCTC GGCGATCTTC
GGGCTGGAGG AGGTGCCGGT GTCGTGGTGA
 
Protein sequence
MTASAPEEHG SAAPPPAEFP LARSCPFSPP EAYEGMRAAA PLTRVTIPSG KQAWLLTRYD 
DVRTALSHPA SSSDARHPAF PALGRGEQAV AADLRPFIRM DPPDHTRVRR MLVGEFTVRR
VREMRPEIER ICAEQARHLL SLPTPADLVG EYANRVSTTV ICRLLGVPLE DLEFFRRITT
VSGGRDSTEE EVGAALADLF GLINRLIDEK ADDPRDDLIS RLVTGPLARG EITRQSLLSQ
IGITLNAGHE TSRNMIPLAA LALLDHPDQL ALLREDPSLW PGAVDELLRY LSVADVITLR
VATEDIPLDG ATIPAGDGYI ALLGAANRDP AAFPEPGKLD VTRSQRGHVA FGYGTHQCVG
QNLGRLEIEV ALRTLFEQVP TLRRAVPLEE LPQRSASAIF GLEEVPVSW