Gene Ndas_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4555 
Symbol 
ID9248436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5397189 
End bp5398289 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content74% 
IMG OID 
Productcytochrome P450 
Protein accessionYP_003682448 
Protein GI297563474 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTGTTC GGGCGCTGAC CGGCCACGAC GTCATCATGG AGGCCCTCGC CCACCCCGGC 
GTGCGCAAGG AGGCGCGGCA CTGGCGGGCG TGGGCGCAGG GCGAGATCCC CATGGACTGG
CCGCTGATCT CCTGGGTGGC CCCGGACAAC ATGCTCACCG CGGACGGCGA CCGCCACCGG
CGCCTGCGGA CCCTGGTCTC ACAGGCGTTC ACCCCGCGCC GGGTGGAGGA GCTGCGCCCG
AGGATCACCG AGATCACCGC CGAACTGCTC GACGGCCTCG CCGCCGCCGG GCCGGGACCG
GTGGACCTCA AGGCGGCGCT GTCGCTGCCG CTGCCCATGA CGGTCATCTC CGAGCTGTTC
GGGGTCGGGG CGGAGCACCG CGGCACCCTC CACACCCTGA TGCACCGGGT CTTCGACGCG
ACCACCACCC CGGAGGTGGC GGCACGGACC AACGCCGACA TGCAGGCCTT CCTGGCGGAG
CTGGCCGAGC GCAAGTCCCG TGAGCCGGGC GACGACCTCA CCAGCGCGCT GCTCCAGGCG
CGGGCGGAGG GCGACGAGCG GCTCTCCCAC ACCGAACTGG TGTGGACGCT GATCCTCATG
ATCGGCGCGG GCTACGAGAC CACGATGAAC CTCATCACCA ACGCCGTGCA CGCGCTGCTC
ACCCACCGGG ACCAGCTGGA GCTGGTGCGG TCCGGCGGCG CCTTCTGGGC CGACGCGGTC
GAGGAGACCC TGCGCTGGGA CGCCAGCATC CAGTACCTGC CGCTGCGCTA CACGGCCGAG
GACGTCACCC TGGCGGGCAC GCGCGTGCCC GCGGGGGAGG CGCTGCTCAT GGGCTTCGGC
GCGGCGGGGC GCGACCCGGG GCGCCACGGC GAGGACGCGC ACGCCTTCGA CCTGCGGAGG
GAGCAGCGGG GGCACCTGGC CTTCTCGCAC GGTCCGCACT TCTGCCTGGG CGCGGGGCTG
GCCCGGCTGG AGGGCGTGAC CGCGCTGGAG GCGCTCTTCA CGCGCTTCCC GGACCTGCGC
CTGGCCGAGG GGGCGGAGGT GGAGCAGGCC CCGTCCATCG TGGCGAGCGG TCGGGCCGCG
CTCCCCGTCG TCTGGGGCTG A
 
Protein sequence
MVVRALTGHD VIMEALAHPG VRKEARHWRA WAQGEIPMDW PLISWVAPDN MLTADGDRHR 
RLRTLVSQAF TPRRVEELRP RITEITAELL DGLAAAGPGP VDLKAALSLP LPMTVISELF
GVGAEHRGTL HTLMHRVFDA TTTPEVAART NADMQAFLAE LAERKSREPG DDLTSALLQA
RAEGDERLSH TELVWTLILM IGAGYETTMN LITNAVHALL THRDQLELVR SGGAFWADAV
EETLRWDASI QYLPLRYTAE DVTLAGTRVP AGEALLMGFG AAGRDPGRHG EDAHAFDLRR
EQRGHLAFSH GPHFCLGAGL ARLEGVTALE ALFTRFPDLR LAEGAEVEQA PSIVASGRAA
LPVVWG