Gene Dshi_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3801 
Symbol 
ID5714330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp8358 
End bp10535 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content66% 
IMG OID641276716 
Productcatalase/peroxidase HPI 
Protein accessionYP_001542012 
Protein GI159046341 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.409615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.295897 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCA ACGATCTGCG CAATGTCACG GGATGCCCCG TCATGCATGG CGGCAACACC 
GGCATGAACA CCAGCCCCGT GCGCTGGTGG CCCAACGCGC TCAACTTGGA CATCCTGCAC
CAGCATGGCG CCCGGGTCAG CCCGATGGAC CCCGATTACG ACCACCGCGA GGCGGTGAAG
GCTCTCGATT TCCAGGGCGT CTGGGATGAC GTGGACAAGC TGCTGACCGA CAGCCAGGAC
TGGTGGCCGG CCGACTGGGG GCACTATGGT GGCCTCTTCA TTCGCCTGTC GTGGCATGCC
GCGGGCTCCT ACCGTCTGGG CGACGGCCGT GGCGGGGCCG GCACCGGCAA CCTGCGGTTC
GAGCCGCTGA ACTCCTGGCC CGACAATGCC AGCCTCGAGA AAGCCCGCCG CCTGCTCTGG
CCGGTGAAGA AGAAATACGG CAACGCGCTG TCCTGGGCGG ACCTTCTGGT GCTCGCCGGG
ACCGTGGCCT ACTCCAACAT GGGGCTCAAG ACCTTCGGCT TCGCCTTTGG CCGCAAGGAC
ATCTGGGGCC CCGAGATCGA CATCAACTGG GGCAGCGACA GCGAGATCCT CGCCCCCACG
GACGAGCGTG TGACCGATGT GGCCGACGCC AACTCCATGG CCAATCCCCT GGCCGCGTCC
CATATGGGCC TGATCTACGT GAACCCCGAG GGGGTGAACG GCACGCCCGA TCCGGCCCAG
ACCGCGAAAT ACGTGCGCAT GACCTTCGCG CGCATGGCGA TGAATGACGA GGAAACCGCC
GCCCTGACCG TGGGCGGGCA CACCGTGGGC AAGGCCCATG GCGGCACGAT GGCCGACAAG
GTCGGCGCGG ATCCCGCCGG TTGCCCGGTG CACATGCAGG GCTTCGGCTG GGAAAACCCG
GGCTTCGACG GCAACGCCAA CACCGCCCAT ACCTCGGGGC TCGAAGGTGC GTGGACCTCC
AACCCGACCC AATGGGACAA CGGCTATCTA GAGTTGCTGT TCAAGTATGA CTGGGAAGTC
ACGAAATCCC GCGCAGGCGC CTTCCAGTGG GAGCCGGTCA ACATCGCCGA GGAGGACATG
GTCCCCGACG CCACCGACCC CTCGATCAAG CACAATCCGA TCATGACCGA CGCCGACATG
GCGATGAAGG TCGATCCGAT CTATCGCGAG ATCTGCGAGC GGTTCCACAA GGATCCGGAC
TACCTCGCCG ACACCTTCGC GCGCGCCTGG TTCAAGCTGA CCCACCGCGA CATGGGGCCC
AAGGCCAACT ACTACGGCCC CTTCGTGCCG CAAGAGGACC TGATCTGGCA GGATCCGATC
CCCGAAGGCC CCACCGGCTA TGACGTGGAC GCGCTCAAGG CCAAGATCGC CGAAAGCGGG
CTGAGCGCCG CCGAAATGAT CGCCACCGCC TGGGACAGCG CGCGGACCTT CCGCGGCTCT
GACATGCGCG GCGGAGCCAA CGGCGCCCGC ATCCGCCTCG TGCCCCAAAA GGACTGGGCT
GGCAACGAGC CGGAGCGTCT GGCCAAGGTC CTCGGCATTC TGGAGCCGCT GGCAGCGGCC
GCCGGAGCCT CCATCGCGGA TACCATCGTG CTGGCGGGCA ATGTCGGCAT CGAGATGGCG
ATCAAGGCCG CGGGCCAGGA TGTGCCGGTG CCCTTCGCAC CGGGCCGGGG CGACGCCACG
GATGCCATGA CCGATGCCGA AAGCTTCGAG GTGATGGAAC CCTTCGCCGA CGGCTTCCGC
AACTGGTCGA AAGAGCGGTA CTCCGTCAGC CCCGAAGAGC TGATGCTCGA CCGCGCGCAG
CTGCTGGGCC TGACGGCGGC GGAAATGGCC GTGCTCGTCG GCGGGCTCCG GGTGCTCGGG
GCCAACCATG GCGGCAGCGC CCATGGCGTC TTCACCGACC GGGTCGGCAC GCTGACCACG
GATTTCTTCC AGACCATCAC CGACATGGCC TATAAATGGG TCCCGCTCGA CGATGGCACC
TACGAGATCC GCGACCGCAA GTCCGGCGAC ACCGTCTACA CCGCGACCAG CGCGGACCTG
GTCTTCGGCT CCAACTCGCA ACTGCGGGCC ATTGCCGAGG TCTACGCCCA GGACGACAAC
GAGACGAAAT TCGTCCGTGA TTTCGTCGCC GCCTGGACCA AGGTGATGAA CGCCGACCGG
TTCGACCTGC TGGCCTGA
 
Protein sequence
MDGNDLRNVT GCPVMHGGNT GMNTSPVRWW PNALNLDILH QHGARVSPMD PDYDHREAVK 
ALDFQGVWDD VDKLLTDSQD WWPADWGHYG GLFIRLSWHA AGSYRLGDGR GGAGTGNLRF
EPLNSWPDNA SLEKARRLLW PVKKKYGNAL SWADLLVLAG TVAYSNMGLK TFGFAFGRKD
IWGPEIDINW GSDSEILAPT DERVTDVADA NSMANPLAAS HMGLIYVNPE GVNGTPDPAQ
TAKYVRMTFA RMAMNDEETA ALTVGGHTVG KAHGGTMADK VGADPAGCPV HMQGFGWENP
GFDGNANTAH TSGLEGAWTS NPTQWDNGYL ELLFKYDWEV TKSRAGAFQW EPVNIAEEDM
VPDATDPSIK HNPIMTDADM AMKVDPIYRE ICERFHKDPD YLADTFARAW FKLTHRDMGP
KANYYGPFVP QEDLIWQDPI PEGPTGYDVD ALKAKIAESG LSAAEMIATA WDSARTFRGS
DMRGGANGAR IRLVPQKDWA GNEPERLAKV LGILEPLAAA AGASIADTIV LAGNVGIEMA
IKAAGQDVPV PFAPGRGDAT DAMTDAESFE VMEPFADGFR NWSKERYSVS PEELMLDRAQ
LLGLTAAEMA VLVGGLRVLG ANHGGSAHGV FTDRVGTLTT DFFQTITDMA YKWVPLDDGT
YEIRDRKSGD TVYTATSADL VFGSNSQLRA IAEVYAQDDN ETKFVRDFVA AWTKVMNADR
FDLLA