Gene Ndas_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1777 
Symbol 
ID9245627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2175293 
End bp2176552 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content77% 
IMG OID 
ProductDyp-type peroxidase family 
Protein accessionYP_003679711 
Protein GI297560737 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.822896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.374196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGGG ACGCGCACGA ACCGCCCCGT TTCTCCCGGC GCGGTCTGCT CACCGCCGTG 
GGGGCGGCGG GGATCGCCGG AATCGGCGCC GGAGGGCTGA CGGGGTACGC CTCCGCCGCC
GCGGGCGCGG AGGAACGGGA CGCCCCCGCC CTCGACCCCG CCCGCTCCAG GACCGGCTCG
CGGGAGGGAC GCCCTCCCGC GCTGCTGACG CCCACGCCCG CGCACGTGCG GGTGGTGGCC
GTGGACGTGA ACGCCCAAGA CGCGGCGGAC GTGCGCGTGG CGGCGCGCGA GGTCCTCGGC
GCCTGGAGCC GTGAGGCCCG TTCCCTGCAC GAACGGGGCC CGGCCGCGCT CGGGGAGGGG
GCGCCCTCCC AGGGCCTGCA CCCCGCCTCG CTGGGGGTCA CCCTCGGGCT GGGCCCCTCC
CTGCTGGAAC GCGCGGGGCT GGCCGACCGG CGTCCGCCGC ACATGGAGGA CCTGCCCGCC
TTCGACTCCG ACCGCCTCGA CCCCGCCTGG TGCGGCGGCG ATCTCATGCT GCACGTGGGA
GCCGAGGACC CCCTGGTCCT CAGCTCCGCG GTCGACCACC TGCTGCGCGC GGCCCGGGGC
CGGGTCGGGG TCCGCTGGTC GCTGTCCGCC TTCCAGCGGT CGGCCGCGGC CGCCGCCGAC
CCCGCCGCCA CACCGCGCAA CCTCATGGGC CAGATCGACG GGACGGTCAA CCCGCGCCCC
GACGAGGCCC TGTTCGCCAC CCAGGTCCTG GCCTCCCACA CCGAGCCGTC CCTGGCCTGG
ATGGACGGGG GGTCCTACGT GGTCGTGCGG CGCATCCGCA TGCTGCTGGA CGACTGGTTC
GCCCTGGAGA CCCGACGGCG CGAGGACGTC ATCGGACGCC GCCTGTCCGA CGGCGCGCCC
CTGGGCGGGG ACCGCGAGCA CGACCGGCCC GACCTGTCGG CCAGGGACGG CGCGGGGGAG
CCGGTCATCG CCCGTGACGC CCACATCAGG CTCGCCAGCC CCGAGAGCAC GCTGGGAGCA
CGGATGCTGC GCCGGGGCTT CAGCTACGAC CTGGGCTGGG ACGCCGACGG CCGCAGGCAG
GCGGGCCTGC TCTTCACCGC CTGGCAGGCC GATCCGCGCA CCGGGTTCAC GGCGGTGCAG
CGCAACCTCG ACGAGGGCGG GGACGCGCTC GGCGCCTACG TCAGACACGA GGGCAGCGCA
CTGTTCGCGG CGCCGCCGGT GCGGGAGGGG GAGCCCCGTG TGGCGCACAC CCTGCTGTGA
 
Protein sequence
MTGDAHEPPR FSRRGLLTAV GAAGIAGIGA GGLTGYASAA AGAEERDAPA LDPARSRTGS 
REGRPPALLT PTPAHVRVVA VDVNAQDAAD VRVAAREVLG AWSREARSLH ERGPAALGEG
APSQGLHPAS LGVTLGLGPS LLERAGLADR RPPHMEDLPA FDSDRLDPAW CGGDLMLHVG
AEDPLVLSSA VDHLLRAARG RVGVRWSLSA FQRSAAAAAD PAATPRNLMG QIDGTVNPRP
DEALFATQVL ASHTEPSLAW MDGGSYVVVR RIRMLLDDWF ALETRRREDV IGRRLSDGAP
LGGDREHDRP DLSARDGAGE PVIARDAHIR LASPESTLGA RMLRRGFSYD LGWDADGRRQ
AGLLFTAWQA DPRTGFTAVQ RNLDEGGDAL GAYVRHEGSA LFAAPPVREG EPRVAHTLL