Gene Ndas_3490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3490 
Symbol 
ID9247359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4182414 
End bp4183634 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content74% 
IMG OID 
Productcytochrome P450 
Protein accessionYP_003681397 
Protein GI297562423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGCCG ACGGGACCGA ATCGCTGATG GACTACCCCT TCCCCGGCCC GGAGGCGCTG 
GAGCCGCCCG CCGAGTGGGC CGAGCTGAGG CGGCGGTGCC CCGTCGCCGG GGTACGGCTG
CCCAGCGGCG ACCGGGCGGA GGTGGTCACC CGCTACGAGG ACGTCAGGCG GGTGCTCTCC
GACCCGCGCT TCACCCGGCG GCTCGACGCC GAGGACGCGG CCCGGGTCTC GGCCAGCGCG
ACGGGCGGCG TGTTCAACAG CGACCTGGCG GCGGCGGTCC CCGACAGCGG GGAGGAGCAC
CGGCAGTGGC GGCGGCTGGT CGGCAAGTGG TTCACCGCCA GGCGCATGAA CGCGCTGCGG
CCCGGCATCG AGGCCATGGC CGACCAGCTC GTCGACGGCA TGGTCGAGCG CGGCCACCCG
GCCGACCTCA GGGCGGACCT GGGCTTCCCC CTGCCGGTGT GGGTCATCTG CGACATGCTC
GGTGTGCCCG ACTCCGACCG CGACCGCTTC TCCCACTGGT CGGACGCGCT GCTCAATCTC
ACCCGCTACA CCCAGGAGGA GATGGACCGG GCCCAGGCCG AGTTCCTCGC GTACATGGGC
GGCCACGTCG AGGCCAAGCG GGCGGAGCCG CGCGAGGACC TGATCAGCTC CCTGATCACC
GAGACCGCCG AGGCCGGGTA CGACCTGCCC ACCTCCGCGC TGGTCGCCAC CAGCATGGGC
CTGCTGGTCG CCGGGCACGA GACCACCGCC AACATGATCG GCAAGATGGT GGCGATGCTG
CTGGACGACC GGAGCCGCTG GGAGAGGCTG CTGGCCGACC GGACGCTGGT GCGCTCCGCC
GTGGAGGAGG CGCTGCGCTT CGACGCCAAC TCCGGGCTCG GCCTGCCGCG CTACATCACC
GAGGAGACCG AGATCGGCGG CGCCACCGTC CCCGCCGGGA CCACCGTCAT GTGCAGCATG
GGCGCCGCCA ACAGGGACGA GAGCGCGTTC GAGAACGCGG ACGAGATGGA CCTGGGGCGC
GTGCCCAACC CGCACCTGGG CTTCGGCGCC GGGGCGCACT CGTGCCTGGG GCAGGCGCTG
GCCCGCACCG AACTCCAGGC CGTGCTCGAC GTGCTGCTGC GCAGACTCCC CACGCTGGAG
CTGGCCGTCC CCGCGGCGGA GCTGCGGCGA CTCGACGGCC TGGTCGTGGG CGGACTGCGC
GAGGTCCCGG TCCGGTGGTG A
 
Protein sequence
MGADGTESLM DYPFPGPEAL EPPAEWAELR RRCPVAGVRL PSGDRAEVVT RYEDVRRVLS 
DPRFTRRLDA EDAARVSASA TGGVFNSDLA AAVPDSGEEH RQWRRLVGKW FTARRMNALR
PGIEAMADQL VDGMVERGHP ADLRADLGFP LPVWVICDML GVPDSDRDRF SHWSDALLNL
TRYTQEEMDR AQAEFLAYMG GHVEAKRAEP REDLISSLIT ETAEAGYDLP TSALVATSMG
LLVAGHETTA NMIGKMVAML LDDRSRWERL LADRTLVRSA VEEALRFDAN SGLGLPRYIT
EETEIGGATV PAGTTVMCSM GAANRDESAF ENADEMDLGR VPNPHLGFGA GAHSCLGQAL
ARTELQAVLD VLLRRLPTLE LAVPAAELRR LDGLVVGGLR EVPVRW