Gene Daro_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3836 
Symbol 
ID3568195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4126504 
End bp4128714 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content62% 
IMG OID637682310 
Productheme catalase/peroxidase 
Protein accessionYP_287034 
Protein GI71909447 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.141049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG AACAGAAATG CCCATTTTCT GGAACACATG GTGCCCGCAC GACGGTTGGC 
ACCCAATCCA ATCGTGACTG GTGGCCTAAG GTGTTGAACC TCAACATCCT CCACCAGCAC
GCACCTGCCG CCAATCCGAT GGATGCGGAT TTCGACTATT CCGAAACCTT CAAGACGCTC
GATTTCGGCG CCCTGAAGCA GGATCTCTAC GCGTTGATGA CCACGTCCCA GGATTGGTGG
CCAGCCGACT GGGGCCACTA TGGCGGACTT TTCATTCGCA TGGCCTGGCA TAGTGCCGGC
ACTTATCGCA CTGGCGATGG GCGCGGTGGT GCGGGGACCG GCAACCAGCG TTTCGCCCCG
ATCAACAGCT GGCCGGACAA TGGCAACCTG GACAAGGCGC GCCGCCTGCT CTGGCCAATC
AAGCAGAAGT ACGGCAACAA GATTTCCTGG GCTGACCTGA TGATCCTCGC CGGCAACTGT
GCGCTTGAAT CGATGGGCTT CAAGACCTTC GGCTTCGGTG GTGGCCGTGT CGATATCTGG
CAACCGGAAG AAGACATCTA CTGGGGCGCC GAGCGGGAAT GGCTGGCAAC CAGCGACAAA
CCCAACAGTC GCTACTCTGG TGAGCGCAAT CTGGACAATC CGCTGGCTGC CGTGCAGATG
GGCCTCATCT ACGTCAATCC GGAAGGCCCG GATGGCAATC CCGACCCGGT CGCTTCCGGG
CGCGACATTC GTGAAACCTT TGCCCGCATG GCCATGAACG ATGAAGAAAC GGTCGCCTTG
ACCGCCGGTG GCCACACCTT CGGCAAGGCC CACGGCGCTG GCGACCCTGC CCTGGTCGGT
CCTGAGCCGG AGGCCGCGCC AATCGAGGAA CAGGGCTTGG GCTGGATCAA CAAGTTCGGT
TCCGGCAAGG GGATACATGC GACCACCAGC GGCATCGAAG GTGCCTGGAA GCCAAACCCG
ACGAAATGGG ATAACGGCTA CTTCGACATG TTGTTCGGTT ACGAGTGGGA GCTGACCCGG
AGCCCGGCTG GCGCCAAGCA GTGGGTGGCC AAGGACTGCA AGCCGGAACA CCTGATTCCG
GATGCTCACG ACCCGAGCAA GAAACATCCG CCAATGATGA CGACCGCTGA CCTGGCGATG
CGCTTTGACC CGATTTACGG GCCTATCTCG CGCCGCTTTC ACCAGGATCC GGCTGCCTTC
GCCGATGCAT TCGCCCGTGC CTGGTTCAAG CTCACCCATC GTGACCTAGG ACCCAAGGCA
CGCTACCTCG GACCGGAAGT CCCGGCCGAA GACCTGGTCT GGCAGGACCC GATCCCGGCA
GTCGATCATC CGTTGATTGA GGTGACCGAT GTCGCCAGCC TCAAGGCCAA GTTGCTCGCC
TCCGGCCTGT CGACCGCCGA ACTGGTATCG ACGGCCTGGG CCTCGGCGTC GACCTTCCGC
GGTTCCGACA AACGGGGTGG CGCCAATGGC GCCCGTATCC GCCTTGCGCC GCAGAAGGAT
TGGGCGGCCA ACCAGCCAGC CCAACTGGCC AAGGTGCTGG GCGTGCTCGA AGGTATCCAG
CAGGCGTTCA ATAGCGCGCA AACCGGCGGC AAGAAAGTCT CGCTGGCCGA CCTCATCGTG
CTGGGTGGCT GTGCTGCCGT CGAGGCGGCG GCCAAGGCAG CCGGCTTTGC GGTCGCGGTG
CCTTTCACTC CGGGCCGAAC CGATGCATCG CAAGAACAGA CCGATGCCGA GTCCATCGCA
GTGCTGGAGC CGGAAGCCGA CGGTTTCCGC AACTACCAGA AGAAAACCTA TTCGGTATCG
GCCGAGGAAA TGCTGGTGGA CAAGGCACAA CTGCTCACCC TCAGCGCCCC GGAAATGACG
GTGCTGGTCG GAGGCCTGCG CGTCCTGGGC GGCAATGTCG GCGGTTCGTC GGATGGCGTC
TTCACGACAA CGCCAGGCAC ATTGAGCAAC GACTTCTTCG TCAATCTGCT CGATATGGGA
ACGGTCTGGA AACCGGCTGC CGAATCCGCC GGCCGCTATG AGGGACGTGA CCGCCAGACG
GGCGTGGCCA AATGGACTGC GAGCCGCGTC GATTTGATCT TCGGATCGAA TTCCCAGCTC
CGTGCCCTGG CTGAGGTCTA TGCCCAGAAC GATGCGCAGG AGAAATTCGT GCGGGACTTC
ATCGCCGCCT GGAGCAAAGT GATGGAACTG GATCGCTTCG ATCTGAAGTA A
 
Protein sequence
MSNEQKCPFS GTHGARTTVG TQSNRDWWPK VLNLNILHQH APAANPMDAD FDYSETFKTL 
DFGALKQDLY ALMTTSQDWW PADWGHYGGL FIRMAWHSAG TYRTGDGRGG AGTGNQRFAP
INSWPDNGNL DKARRLLWPI KQKYGNKISW ADLMILAGNC ALESMGFKTF GFGGGRVDIW
QPEEDIYWGA EREWLATSDK PNSRYSGERN LDNPLAAVQM GLIYVNPEGP DGNPDPVASG
RDIRETFARM AMNDEETVAL TAGGHTFGKA HGAGDPALVG PEPEAAPIEE QGLGWINKFG
SGKGIHATTS GIEGAWKPNP TKWDNGYFDM LFGYEWELTR SPAGAKQWVA KDCKPEHLIP
DAHDPSKKHP PMMTTADLAM RFDPIYGPIS RRFHQDPAAF ADAFARAWFK LTHRDLGPKA
RYLGPEVPAE DLVWQDPIPA VDHPLIEVTD VASLKAKLLA SGLSTAELVS TAWASASTFR
GSDKRGGANG ARIRLAPQKD WAANQPAQLA KVLGVLEGIQ QAFNSAQTGG KKVSLADLIV
LGGCAAVEAA AKAAGFAVAV PFTPGRTDAS QEQTDAESIA VLEPEADGFR NYQKKTYSVS
AEEMLVDKAQ LLTLSAPEMT VLVGGLRVLG GNVGGSSDGV FTTTPGTLSN DFFVNLLDMG
TVWKPAAESA GRYEGRDRQT GVAKWTASRV DLIFGSNSQL RALAEVYAQN DAQEKFVRDF
IAAWSKVMEL DRFDLK