Gene Ndas_2750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2750 
Symbol 
ID9246601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3292864 
End bp3294114 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID 
Productcytochrome P450 
Protein accessionYP_003680669 
Protein GI297561695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.650686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGCC CCGTCCAGCA CACCCCCGAC GGAGTCCCGC TGGTGCACGC CGTGCCCGAC 
GACGTGCAGG CCGACCGCGA ACGCCTCCAC CGGGCCGGAC CCGTCACCCG GGTCGAACTC
CCCGGCGGGG TCCGCGCCTG GGCCACCACC CACCACGAGG TCAGCCGCGC CACCCTCAAC
GACCCCCGGT TCGTCAAGAG CGTCGACCAC TGGGACGACT ACCAGAGCGG CCGGGTCCCC
GAGGGCTGGC CGCTGATGGG CACGATCCCC ACCGACAGCT CCAACATGCT GGCCCAGGAC
GGCGCCGCCC ACCGGCGCAT GCGCAGACTC ACCGCCAGCC CGTTCTCGGC GCGCCGGGTG
GAGCGCCTGC GCCCGCGCAT CGAGGAGATC ACCGCCCGGG CGCTGGACGC CCTGGAGCCG
CGCGCGCACG AACCCCTGGA CCTGAAGTCG GAGTTCACCT TCCGGGTCCC CATGGGGGTG
ATCGGCGAAC TGTACGGGGT GGCCGAGGCG GAGTACGCCC AGCTCGGCGA GATGTACGCC
AAGCTCTTCT CCGGCACCAC CGAGGAGGGC GAGCACCTGC GGATCTACGG AGCCCTGTTC
CAGTTCTTCG CCGAGATGGT CGCCCGCAAG CGCGCCAGCC TGGACGAGCA CGACGACTTC
ACCGCCGACC TGCTCAGAGC TAGGGAGGAC GGCGACTCCC TCAGCGACAC CGAGGTCACC
ATCACCCTGC TGACGGTGGT GGCGGCCGGG CACGAGACCA CCGTCAACCT GCTCAACAAC
GTGGTGCGCG CCCTGCTCGC CCACCCCGAC CAGTTCGCCC TGCTCAAGGC GGGCAAGGTG
ACGTGGGAGC AGGTCATCGA GGAGACGCTG CGCTACGACC CGCCCAACAA CGTCATGATG
TTCCGCTTCG CCACCGAGGA CGTGGAGGTC GGCGGGCAGA CCATCCGCAA GGGCGAGGCG
CTGATGACGC ACTACGGCGC GGCCACCCGC GACCGCGCGG AGTTCGGTGA GGACCCGGAC
CCGTCCGTCT TCGACCCGCA GCGCACCCAG GGGCGCCACA TCACCTTCGG GTACGGCCCG
CACATCTGCC CCGGGGCGCC GCTGTCGCGG CTGGAGGCCG GGATCATCCT GCCGATGCTC
TTCGAGCGCT TCCCGGACCT GCGGCTGGCC GTCCCCGACG AGGAGCTGCG GGTGCAGTCC
GCGCTCTCGG TCACCAGCCT GAGGGAGTTC CCGGTCGTGC TGCGTCCCTG A
 
Protein sequence
MPCPVQHTPD GVPLVHAVPD DVQADRERLH RAGPVTRVEL PGGVRAWATT HHEVSRATLN 
DPRFVKSVDH WDDYQSGRVP EGWPLMGTIP TDSSNMLAQD GAAHRRMRRL TASPFSARRV
ERLRPRIEEI TARALDALEP RAHEPLDLKS EFTFRVPMGV IGELYGVAEA EYAQLGEMYA
KLFSGTTEEG EHLRIYGALF QFFAEMVARK RASLDEHDDF TADLLRARED GDSLSDTEVT
ITLLTVVAAG HETTVNLLNN VVRALLAHPD QFALLKAGKV TWEQVIEETL RYDPPNNVMM
FRFATEDVEV GGQTIRKGEA LMTHYGAATR DRAEFGEDPD PSVFDPQRTQ GRHITFGYGP
HICPGAPLSR LEAGIILPML FERFPDLRLA VPDEELRVQS ALSVTSLREF PVVLRP