Gene Ndas_4585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4585 
Symbol 
ID9248466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5434411 
End bp5436519 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content71% 
IMG OID 
ProductCatalase 
Protein accessionYP_003682478 
Protein GI297563504 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACTG AAGGACGTGG GGGAAGCGAG GGGCGCGACG CCAAGGACCG CCAGCTCGAC 
GCGGTCAGGA CCGACTCCGA CCAGCCCCTG ACCACCGAGA CCGGCGTCCG GGTGGGCGAC
ACCGACAACT CGCTCCGGGC CGGGGCGCGC GGCCCCACAC TGCTGGAGGA CTTCCACGCC
CGGGAGAAGA TCACCCACTT CGACCATGAG CGGATCCCCG AACGGGTGGT GCACGCGCGC
GGCGCGGGCG CCTACGGCCA CTTCCGCCCC TACGAGGGGC TCTCCGAGCT GACCGCGGCC
CACTTCCTGA GCGGCCCCGA CGTCACCACG CCGGTCTTCG TCCGCTTCTC CACGGTCGCC
GGGTCGCGCG GATCCGCCGA CACGGTGCGC GACGTGCGCG GGTTCGCGGT GAAGTTCTAC
ACCGAGGAGG GCAACTACGA CCTCGTCGGC AACAACATGC CGGTCTTCTT CATCCAGGAC
GGCGTCAAGT TCCCCGACTT CGTCCACGCG GTCAAGCCCG AGCCCCACAA CGAGATCCCC
CAGGCGCAGT CCGCGCACGA CACCCTCTGG GACTTCGTGG GCCTGCAGCC CGAGACCACG
CACATGATGA TGTGGCTGAT GTCGGACCGG GCGATCCCGC GCAGCTACCG GACGATGCAG
GGCTTCGGGG TCCACACGTT CCGGTTCGTC AGCGCCGAGG GCCGCGGCAC CTTCGTCAAG
TTCCACTGGA GGCCCCTGCT CGGCACGCAC TCGCTGGTCT GGGACGAGGC GCAGCGCGTC
CAGGGCAAGG ACCCCGACTT CAACCGGCGC GACCTGTGGG AGGCCATCGA ACGCGGCCAG
TACCCCGAGT GGGAGCTGGG CGTGCAGCTG GTGCCGGAGG AGGACGAGCA CGCCTTCGAC
TTCGACCTGC TGGACGCCAC CAAGATCATC CCCGAGGAGC AGGTCCCGGT CCGGCCGATC
GGAAAGCTGA CCCTGAACCG GAACCCGGAC AACTTCTTCG CGGAGACGGA GCAGGTGGCC
TTCCACACCG CCAACGTGGT CCCGGGCATC GACTTCACCA ACGATCCGCT GCTCCAGGCC
CGCAACTTCT CCTACCTGGA CACCCAGCTG CTGCGCCTCG GCGGCTCCAA CTTCCAGCAG
ATCCCGATCA ACCGTCCGGT GGCCCCGGTG AGCAACAACC AGCGGGACGG CTTCGCCCAG
CACCGCGTGC ACCGGGGCGG GACCTCCTAC CACCCCAACA GCCTGGGCGG CGGCTGCCCG
GCCCTGGCGC ACGGGGAGGG CGTCTACCGC CACTACACGG AGAAGGTCGA GGGCGACAAG
ATCCGGGTGC GCAGCGAGAG CTTCGCCGAC CACTACTCCC AGGCCACGCT CTTCTACAAC
AGCCTCGCCG ACTGGGAGCG CGAGCACGTG GTGGAGGCCT TCCGCTTCGA GCTGGGCAAG
TGCGAGGAGC TGGAGGTGCG CCAGCGGGTG GTGGCCAACC TCAACCACGT GGACCACGGG
CTGTCCGTCC GGGTCGCGGA GGGGATCGGC GTCGAGCCGC CGGAGCGGGA GGCCACGCCC
AACCACGGCC GCTCCTCGCC CGCCCTGAGC CAGCTCAACA CCGCGTTCGG CTCCGTCGTC
GGCCGCAAGG TGGCCGTCCT GGTCGCGGAC GGCGTGGACG AGGCCGCCGT CGACGCGTTC
CGCGAGCCCC TGATCGGGGC CGGTGCCGTG GTCGAACTCC TCGCGGCCGC CGACGGCGCG
GTCCGCACCT CGCGGGGCGG CGTCGTCACC GTCGACCGGG CCTACCCGAC CGTCTCCTCG
GTGCTCTACG ACGCGGTCGT GGTGACCGGC GGTTCCGACG CCGCACGGAC GCTGGCGGGC
GACGGGCTCG CGCGGCACTT CGTGCTGGAG GCGTACAAGC ACCACAAGCC GGTGGCCGCG
GTCGGCGCGG GGACGGAGCT GCTCGCCCCG CCGCTGCCGG AGGAGCTGAC CGGGCCCGTG
GACGCCCTGC GGGTGGAACT GGGCGTGGTC GCCTCGCCGG ACGCGGGTGA GGACGGCGTC
GCCGAGTTCA CGAGGCTGCT GGGCGGGCAC CGGTTCTGGG ACCGGCCGAC GGCGTCCGTG
CCCGCCTGA
 
Protein sequence
MSTEGRGGSE GRDAKDRQLD AVRTDSDQPL TTETGVRVGD TDNSLRAGAR GPTLLEDFHA 
REKITHFDHE RIPERVVHAR GAGAYGHFRP YEGLSELTAA HFLSGPDVTT PVFVRFSTVA
GSRGSADTVR DVRGFAVKFY TEEGNYDLVG NNMPVFFIQD GVKFPDFVHA VKPEPHNEIP
QAQSAHDTLW DFVGLQPETT HMMMWLMSDR AIPRSYRTMQ GFGVHTFRFV SAEGRGTFVK
FHWRPLLGTH SLVWDEAQRV QGKDPDFNRR DLWEAIERGQ YPEWELGVQL VPEEDEHAFD
FDLLDATKII PEEQVPVRPI GKLTLNRNPD NFFAETEQVA FHTANVVPGI DFTNDPLLQA
RNFSYLDTQL LRLGGSNFQQ IPINRPVAPV SNNQRDGFAQ HRVHRGGTSY HPNSLGGGCP
ALAHGEGVYR HYTEKVEGDK IRVRSESFAD HYSQATLFYN SLADWEREHV VEAFRFELGK
CEELEVRQRV VANLNHVDHG LSVRVAEGIG VEPPEREATP NHGRSSPALS QLNTAFGSVV
GRKVAVLVAD GVDEAAVDAF REPLIGAGAV VELLAAADGA VRTSRGGVVT VDRAYPTVSS
VLYDAVVVTG GSDAARTLAG DGLARHFVLE AYKHHKPVAA VGAGTELLAP PLPEELTGPV
DALRVELGVV ASPDAGEDGV AEFTRLLGGH RFWDRPTASV PA