Gene Cag_0386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0386 
Symbol 
ID3747554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp445775 
End bp447976 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content47% 
IMG OID637772914 
Producthaem catalase/peroxidase 
Protein accessionYP_378702 
Protein GI78188364 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0376] Catalase (peroxidase I) 
TIGRFAM ID[TIGR00198] catalase/peroxidase HPI 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGG AACGTAAGTG CCCGATAACG GGAGCAACCC ACAAACCTTC TGCTGAAAAG 
GGGAGATCGA ACCACGATTG GTGGCCTAAC CAGTTAAACC TGAAAATTCT TCATCAACAC
TCCGCATTGA GCAATCCGAT GGATAAGGAT TTCAACTATG CAGAGGAATT TAAAAAGCTT
GATTTAGCAG CAATTAAACA AGACCTTTAT GCCTTAATGA CCGACTCGCA AGAGTGGTGG
CCTGCTGATT ATGGGCATTA TGGTCCGCTT TTTATTCGTA TGGCTTGGCA CAGTGCAGGA
ACGTACCGTA CCAGCGATGG GCGTGGTGGT GCTGGAACTG GCAGTCAACG GTTTGCTCCG
CTGAATAGTT GGCCTGATAA CGCAAACCTC GACAAAGCTC GCCGATTACT TTGGCCTATT
AAGCAGAAAT ATGGTCGCCA AATATCGTGG GCTGACTTGA TGATACTTAC TGGTAACTGT
GCGCTTGAAT CTATGGGATT GAAAACGTTT GGTTTTGCAG GAGGGCGCGA AGATATTTGG
GAACCAGAAG AGGATATTTA TTGGGGAACT GAGGGTGAAT GGCTTGCCGA TAAACGTTAT
TCGGGTGAGC GGGAACTTGA GAAACCCCTT GCGGCAGTAC AAATGGGGCT TATCTACGTG
AATCCTGAAG GTCCAAACGG TAAACCTGAT CCATTGGCAG CAGCCAAAGA TATTCGCGAA
ACCTTTGCTC GCATGGCAAT GAATGATGAA GAGACGGTTG CACTCATTGC TGGTGGACAC
ACCTTTGGTA AAACGCACGG TGCTGGTGAT GCCTCGCAGG TTGGTCCTGA ACCTGAAGCG
GCTGGTATTG AAGAACAAGG TTTAGGGTGG AAAAACCAAT ATGGTACAGG AAAAGGCAAA
GATACCATTA CCAGTGGCTT GGAAGTGATT TGGACAACAA CCCCAACAAA GTGGAGCAAC
AACTTTTTCT GGAATCTTTT TGGATACGAG TGGGAACTCA CCAAAAGCCC AGCCGGCGCC
TATCAATGGA CACCAAAGTA TGGCGTTGGA GCCAATACCG TGCCCGATGC GCACGACCCC
TCAAAGCGCC ACGCACCTGC TATGATGACC ACCGACCTTG CTTTGCGTTT TGATCCTGAT
TATGAAAAGA TTGCACGTCG GTATTACGAA AATCCTGATC AGTTTGCAGA TGCTTTTGCG
CGTGCATGGT TTAAGCTCAC CCACCGCGAT ATGGGTCCAC GCTCACGCTA TCGTGGTGCA
GAGGTTCCAG TTGAGGAGCT TATTTGGCAG GATACTATTC CTGCTCTTGA TCATGAACTT
ATTGGAGCTG ATGAGATTGC AGCCTTAAAA GCAACAATTC TTGCTTCAGA ACTCTCCATT
GCACAACTTA TTTCAACAGC TTGGGCATCA GCCGCAACCT TCCGCAATTC TGATAAGCGT
GGTGGAGCAA ATGGTGCGCG CCTGCGCCTT GCGCCTCAAA AAGATTGGGA AGTCAATCAA
CCTGATGAGT TGCAGAAAGT GTTACAAGTT CTCGAAACCA TACAAACAGA GTTTAATGCC
TCTCGAAATG ATGGTAAAAA AGTCTCTCTT GCTGACCTGA TTGTGTTGGG TGGATGTGCT
GCCATTGAAG CCGCCGCCGA AAAAGCGGGC TACAAGGTTA CCGTACCTTT TACTCCGGGA
CGTATGGATG CAACTCAAGA GGAAACCGAT GCACACTCTT TTGCGGTGCT TGAACCCGTT
GCTGACGGTT TTCGTAACTA TCTCAAAGCG AAATACTCTT TTTCAGTCGA AGAAATGCTC
ATTGATAAAG CACAGCTTTT AACGCTTACG GCACCAGAAA TGACCGTTCT TATTGGTGGA
ATGCGTGTAT TAAACACAAA CGCAGGGCAC ACAACGCATG GTGTGTTTAC TAAGCGCCCT
GAAACATTAA GCAACGATTT CTTTGTTAAT TTACTCGATA TGGGTACCGT ATGGAAAGCA
ACATCCGAAG CAAGTGATAT TTTTGAGGGA CGCAATCGCT CTACAGGTGA ACTGCAATGG
ACAGCCACAC GCGTTGATTT AGTGTTTGGT TCAAATTCGC AACTCCGTGC ATTAGTAGAA
GTGTATGGCT GCAAGGATTC TCAAGAGAAG TTTTTGAATG ACTTTATAGC CGCATGGAAT
AAGGTGATGA ATCTTGATCG TTTTGATCTT TCGGGTTTAT AA
 
Protein sequence
MNEERKCPIT GATHKPSAEK GRSNHDWWPN QLNLKILHQH SALSNPMDKD FNYAEEFKKL 
DLAAIKQDLY ALMTDSQEWW PADYGHYGPL FIRMAWHSAG TYRTSDGRGG AGTGSQRFAP
LNSWPDNANL DKARRLLWPI KQKYGRQISW ADLMILTGNC ALESMGLKTF GFAGGREDIW
EPEEDIYWGT EGEWLADKRY SGERELEKPL AAVQMGLIYV NPEGPNGKPD PLAAAKDIRE
TFARMAMNDE ETVALIAGGH TFGKTHGAGD ASQVGPEPEA AGIEEQGLGW KNQYGTGKGK
DTITSGLEVI WTTTPTKWSN NFFWNLFGYE WELTKSPAGA YQWTPKYGVG ANTVPDAHDP
SKRHAPAMMT TDLALRFDPD YEKIARRYYE NPDQFADAFA RAWFKLTHRD MGPRSRYRGA
EVPVEELIWQ DTIPALDHEL IGADEIAALK ATILASELSI AQLISTAWAS AATFRNSDKR
GGANGARLRL APQKDWEVNQ PDELQKVLQV LETIQTEFNA SRNDGKKVSL ADLIVLGGCA
AIEAAAEKAG YKVTVPFTPG RMDATQEETD AHSFAVLEPV ADGFRNYLKA KYSFSVEEML
IDKAQLLTLT APEMTVLIGG MRVLNTNAGH TTHGVFTKRP ETLSNDFFVN LLDMGTVWKA
TSEASDIFEG RNRSTGELQW TATRVDLVFG SNSQLRALVE VYGCKDSQEK FLNDFIAAWN
KVMNLDRFDL SGL