Gene Ndas_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1142 
Symbol 
ID9244992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1396135 
End bp1397400 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content76% 
IMG OID 
Productputative cytochrome P450 
Protein accessionYP_003679089 
Protein GI297560115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.15529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCACA GCCCGGGTCC GGAGGCCGCG GCACCGGCCA GCAGCGGCCC GCCGCCCCCC 
TCCCCGGACC ACGCGGGCGC CGCCCGCCTG TACGGCCCCG ACATCGCCCG CGACCCGGCC
GGGCTCTACG AGGAGCTGCG CGGACGCTAC GGCCCGGTCG CGCCCGTGCT GCTCGACGGC
GACGTTCCGG CGTGGTTCGT CATGGGCTAC CGGGAGCTGC ACCACGTCAC CAGCAGACCC
GAGTGGTTCG CCCGCGACTG CCGCCGCTGG AACCAGTGGG ACCGCGTCGG CCGGGACTGG
CCGCTCCTGC CCTACGTGAT GTGGACGCCC TCGGTCATGT TCGCCGAGGG GGCCGAACAC
CAGCGGCGCG CGGGCGCGAT CGGCGACGCC CTGGACGCCG TGGACCGCGC CGACCTCAGG
ACCCTGTGCC AGCGGGCCGC CGACGGCCTC GTCGACGCCT TCCGCGCGGA GGGCGAGGCC
GACCTGGTCT CCCAGTACGC GCACCGCATC CCCGCGATGG TGGTCGCCCG GCTCTGCGGC
CTGCCGGAGG CGGAGGTCCC GGCACTGGTG CGCGACGTCG TGCTCTCGCT CGACGTCGCC
GCCGACGCGG GCGGCGCCCA CCGGCGCCTG CACGCCCGCC TGGACGCCCT GGTCGCCGAC
CGCCGCGCCG CGCCGCGCGA CGACGTGCCC TCGCGGCTGC TGCTGCACCC GGCGGGCCTG
AGCGACGCCG AGGCGGTCAT CGACCTGCTG GTGGTCCTCG CCGCGGCCCA GGCGCCCACC
GGCGACTGGA TCGGCAACAC CCTGCTCCTG ATGCTCACCG ACGACCAGTT CTCGCTCACC
CTCCAGGGCG GGCGCGGCAG CGCGGGCGAG GCCCTCAACG AGGTGCTGTG GAAGGACACC
CCCACGCAGA ACTTCATCGG CCGGTGGGCC GTGCAGGCCT GTGAGCTGGG CGGGCGCGGC
ATCCGACGCG GCGACATGCT GGTGCTGGGA CTGTCGGCGG CCAACGCCGA CCCCCGCCTG
CACGCAGGGC TCCCCGACGT CCACAGCGGC AACCGCGCGC AGATGTCCTT CGGCCACGGC
GAGCACGGGT GCCCCTTCCC GGCACCGGAG ATCGCGGAGA CGATCGCCCG CACGGCGGTG
GAGAGCCTGC TGGACCGGCT GCCGGACGTG CGGCTGGCGG TGCCTTCGGA GCGGCTGGAG
TGGCGGCCGT CGCTGTGGAT GCGCGGACTG TTCGAGCTCC CGGCGCGCTT CACGCCGCAA
CGGTAG
 
Protein sequence
MNHSPGPEAA APASSGPPPP SPDHAGAARL YGPDIARDPA GLYEELRGRY GPVAPVLLDG 
DVPAWFVMGY RELHHVTSRP EWFARDCRRW NQWDRVGRDW PLLPYVMWTP SVMFAEGAEH
QRRAGAIGDA LDAVDRADLR TLCQRAADGL VDAFRAEGEA DLVSQYAHRI PAMVVARLCG
LPEAEVPALV RDVVLSLDVA ADAGGAHRRL HARLDALVAD RRAAPRDDVP SRLLLHPAGL
SDAEAVIDLL VVLAAAQAPT GDWIGNTLLL MLTDDQFSLT LQGGRGSAGE ALNEVLWKDT
PTQNFIGRWA VQACELGGRG IRRGDMLVLG LSAANADPRL HAGLPDVHSG NRAQMSFGHG
EHGCPFPAPE IAETIARTAV ESLLDRLPDV RLAVPSERLE WRPSLWMRGL FELPARFTPQ
R