Gene Ndas_3579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3579 
Symbol 
ID9247448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4291513 
End bp4292718 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content76% 
IMG OID 
ProductDyp-type peroxidase family 
Protein accessionYP_003681486 
Protein GI297562512 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.519759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGTG AGTCCCGGCT CAGCCGCAGA GGGCTGCTCC TGGGCGGCGC GGCGGCCGGT 
GTCGGCGCGG CCGGGGGCGC GCTCGCCCAC CGGTGGGCCG CGGAACCTGC GGCGGCCCCC
CAGGCGCCCC CGCTCAACGG CACGCTGACC GTACCGTTCC ACGGCGTGCG GCAGGCGGGC
GTGGAGACCC CGCCGCAGAC GCACGGCACC TTCCTCGCGC TGGACCTGGA ACCGGGGACG
GACGCGGACG GGGTCGGCCG GCTCCTGCGG CTGCTCAGCG ACGACGCGGC CCGCCTGTCC
CGGGGCGAGC CCGCCCTGGC CGACACCGAG CCCGAACTCG CCCTGGTCCC GGCCCGGCTC
ACCACCACGT TCGGCTTCGG GCGGGGTCTG GTGGAGCGGG TGGACCCCGA CGCGGTACCG
GAGTGGCTCG GGCCACTGCC CGAGTTCGGG CACGACCGGC TCGACCCGGC CTGGGGCGGG
GCGGACCTGC TGCTCCAGGT GTGCGCGGAC GACCCCGTCA CCGTCTCCCA CGCGGTGCGG
ATGATGCTCA AGGACGCGCG GGCCTTCGCG CGGGTGCGGT GGACGCAGAG CGGGTTCCGC
CGGGCCCACG GCTCCCAGCC CGAGGGCACC AGCATGCGCA ACCTGATGGG GCAGGTGGAC
GGGACCGTCA ACCCGGCGCC GGGAACCGGG GACTTCGACC GGCTGGTCTG GGGCGGGAAC
CCACCGCGGT GGCTCAGGGG AGGCACGAGC CTGGTGCTGC GCCGTATCGC CACCCACCTG
GACACCTGGG ACGAGCTGGA CCGGCCCGCC CGCGAGGCGG TCATCGGCCG CCGCCTGGAC
AACGGCGCGC CGCTGACCGG TACCGAGGAA CACGACGAGG CGGACCTGGA GGCCACGGAC
GCCTCCGGGC TGACCGTCAT CGCGGACTTC GCGCACATCA GGCGGGCCCG CACCGACGAC
CCGGACCAGC GGATCTTCCG GCGCGCCTAC AACTACGACG AGCGCGGCTC GGGCGGCGAG
GAGGCGGGGC TGCTGTTCGC CTCCTTCCAG GCCGACCCGC TGCGCCAGTT CGTGCCCATC
CAGCGGCGCC TGGACGAGCT GGACCTGCTC AACGAGTGGG TGACCGCCGT GGGCTCGGCG
GTGTTCGCCG TCCCGCCGGG CTGCGAGGAG GGAGGGTACG TGGGGCAGGC CCTGCTGGAG
GGGTGA
 
Protein sequence
MGGESRLSRR GLLLGGAAAG VGAAGGALAH RWAAEPAAAP QAPPLNGTLT VPFHGVRQAG 
VETPPQTHGT FLALDLEPGT DADGVGRLLR LLSDDAARLS RGEPALADTE PELALVPARL
TTTFGFGRGL VERVDPDAVP EWLGPLPEFG HDRLDPAWGG ADLLLQVCAD DPVTVSHAVR
MMLKDARAFA RVRWTQSGFR RAHGSQPEGT SMRNLMGQVD GTVNPAPGTG DFDRLVWGGN
PPRWLRGGTS LVLRRIATHL DTWDELDRPA REAVIGRRLD NGAPLTGTEE HDEADLEATD
ASGLTVIADF AHIRRARTDD PDQRIFRRAY NYDERGSGGE EAGLLFASFQ ADPLRQFVPI
QRRLDELDLL NEWVTAVGSA VFAVPPGCEE GGYVGQALLE G