Gene Ndas_4556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4556 
Symbol 
ID9248437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5398271 
End bp5399599 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID 
Productputative cytochrome P450 
Protein accessionYP_003682449 
Protein GI297563475 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.833609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTTCCC GCTGCCCCTA CGCCGAACTC CACGCCCCCG AGTTCCACGC CGACCCGCAC 
GACGTCTACG CGCGGATGAC CAAGGAACAC GGCTCCGTCT TCCCCGCCGA GATCTCCCCA
GGGATCAGCG CCTGGGTGGT CGCCGACTAC GCCCTCATCA CGGCCTGGTG CAGGGACAGC
CGCACCTTCC GGCGCGACTC CCGGCTCTGG CTCGACTGGT CCGAGGGGCG CGTCCCCGAC
GACGCCCCCG TGGCCGCGAT GATGGCCTAC CGCCCCAACC TGCTCTTCGC CGACGACGAC
GCCCACGCCC GCCTCAAGCG GTCGGTGGTC GACTCCCTGG GCCGAGTGCC CGAGAGCCGT
GTCGGCGAGA GCACCCGCCG CCACGCCGAC AGCCTCATCG ACGCCTTCGT CGAACGCGGC
AGCGCGGAGC TGGTGTCCGA GTACGCCCGC CTGCTGCCGC TGCTGGTGCT CTGCGACCTG
TTCGGCTTCT CGCGCGAGGT CGGCGAGCGC GTCCTGACCT CGCTCGTGGA CCTGTGGGAC
GGCAACGACG TCGTGCGCGC CAACGCCGAG TACGAGCGCG CCCTGTCCGA CGCCATCTCG
CACAAGCACG CGTCGCCGGG GGAGGACATC ACCACCTGGC TGCTCCAGCA CCGGGCGGGG
CTCACCGACG AGGAGATGCT GCACCAGCTC GTCGTGACCA TCGGCGCCGG CGCCGAGCCG
ACCGCCAACC TCATCGCGGC CGCCACGCAC ACGATCCTCA CCGACCCCGA GGTGAGCCGG
GCCCTCAACG GCGCCCTCGT CGGGATCGAC GAGGTGGTCG AGCAGGTCCT GTGGCGCGAG
CCGCCCATCA CCAACTACCC GGTCCTCTAC CCGGTGCGCG ACGTGGAGAT CAGCGGCGGA
CGGGTCATCC GCAAGGGCGA GCCCATCCTC CTCGGCTACG CCGCGGCCCA CAGCGCGATG
GCCGAGAACG GACCCGACAT CGAGCACGTC ACCCGCGCCC ACCTGGCCTT CGGGGTCGGT
CCGCACCGCT GCCCGGCCCA CGGGTTCGCC CAGGCCATGG CCCAGGTCGG CGTGGAGGTC
CTCACCAGGC GCCTGCGCGG CCTGCGCGTG ACCGACCCGG CCCCGCGCTG GCGCGTCTCC
CCGTTCGCGC GGGCCCTGGC CGAACTCCCC GTGGAATTTC CGCCCGACTC CCCGAAGGGA
ACACCGCCAT GGCAGCAGAC ATCCCCGTCC TCCACCCCTA CGACCTCCAC CGACAGACAG
CCGGACTCCG CGACCGCGGC CCCGTCGTCG AGGTTGAGAT CACTGGTGGC GTGGTTGTTC
GGGCGCTGA
 
Protein sequence
MASRCPYAEL HAPEFHADPH DVYARMTKEH GSVFPAEISP GISAWVVADY ALITAWCRDS 
RTFRRDSRLW LDWSEGRVPD DAPVAAMMAY RPNLLFADDD AHARLKRSVV DSLGRVPESR
VGESTRRHAD SLIDAFVERG SAELVSEYAR LLPLLVLCDL FGFSREVGER VLTSLVDLWD
GNDVVRANAE YERALSDAIS HKHASPGEDI TTWLLQHRAG LTDEEMLHQL VVTIGAGAEP
TANLIAAATH TILTDPEVSR ALNGALVGID EVVEQVLWRE PPITNYPVLY PVRDVEISGG
RVIRKGEPIL LGYAAAHSAM AENGPDIEHV TRAHLAFGVG PHRCPAHGFA QAMAQVGVEV
LTRRLRGLRV TDPAPRWRVS PFARALAELP VEFPPDSPKG TPPWQQTSPS STPTTSTDRQ
PDSATAAPSS RLRSLVAWLF GR