Gene Clim_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1952 
Symbol 
ID6355007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2168642 
End bp2170063 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content53% 
IMG OID642669550 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_001943963 
Protein GI189347434 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00538] oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0253288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCA ATCTCGCAGT AAAGTACAGC AATCCCGGTC CGCGTTATAC AAGCTATCCC 
ACCATTCCAT CCTGGAGCAG CGACGGCGTA ACCCAGGAGC AGTGGAAAGA GGCTATGGTG
AAAGGTTTCA ACGACAGTAA CGAGACCACC GGCATAAGCA TGTACATCCA TATTCCTTAC
TGCGAGAACT ACTGTTATTT CTGCGGATGC AATGCTCATC GCACTCAGGA TCACTCGTTC
GAGGAACCCT ATCTCGAAGC GCTCCAGAAA GAGTGGCAGA TGTACCTCGA TGTATTTCCC
GGAACTCTCA ATGTAAAGGA ACTGCATATC GGCGGCGGTA CTCCGACCTT TTTCAGCCCG
GAAAACCTCG CCCGTCTTGT CGAGAATCTC TTCAGGACCG TCAATCGCAT GGACAACTAC
ATGTTCAGTT TCGAAACCAA TCCCCGTTCC ACCTCCAAAG AGCACCTTGA AGCGCTCTAC
CGTGTCGGTT TCCGGAGAAT GAGTTTCGGC ATCCAGGACT TCGATCCGGT TGTGCAGCAG
GAGATCAATC GTCCCCAGTC CTTCGAACTG GTCAAGGAGA AGGTTGACCT TGCCCGTCAG
ATCGGCTTTA CCTCTGTGAA CTTCGATCTT GTGTACGGTC TTCCCAAGCA GACGCTTGCC
ACCATAACCG ACACCATCGA CAAGGTCATG CAGCTCCGGC CCGACCGCCT CGCGTTCTAT
GCCTACGGTC ACAACCCTCA CATGTACGAA GGACAGCGCA AATTCAAGGA AGAGGACCTT
CCGGTCGGCG ACGTCAAACA GGAGCTTTAC GACAAGGGGC GTGCCATGCT TGAGTCCATC
GGCTATCATG AGATAGGCAT GGATCATTTC GCGCTTGAGG GCGATTCGCT CTACCTTGCA
GCAAAAGACG GCTCCCTGCA CCGGAACTTC ATGGGCTATA CCGAAAACAC AACGCAGATG
ATGATCGCCC TCGGCGCATC GTCGATCAGC GATACCTGGT ACGCCTTTGC GCAGAACGAA
CGCGGTGATC AGAAATATAT CGAAGAGATC AATAAAGGCC GTTTCCCGAT TCTTCGCGGC
CATCTGCTTA CCGATGAAGA TCTTGTGCTC AGACGCCACA TCCTCAATCT CATGTGCCGT
CAGGAGACGT CATGGGAAGA TCCGAAAATG TACACCGAAG AGCTCGACAT CGCGCGTTAC
CGCCTTGAGG ATATGGAGAA CGACGGTTTG GTGGTGCTTC TGGAAAACGG CGTGCGGGTA
ACCGAAATCG GCATTCCTTT TCTCCGGAAC ATCTGTATGG CTTTCGATGC GCGCCTCTGG
CGTTCCGACA GCCTTTCGAA AGCGTACAAC GTATCCAGGG ATATACAGAA AGAGTATATC
GAAAAGGCAC GTCAGGCCAA AGCCCAGCAG CAGGTCAGTT GA
 
Protein sequence
MATNLAVKYS NPGPRYTSYP TIPSWSSDGV TQEQWKEAMV KGFNDSNETT GISMYIHIPY 
CENYCYFCGC NAHRTQDHSF EEPYLEALQK EWQMYLDVFP GTLNVKELHI GGGTPTFFSP
ENLARLVENL FRTVNRMDNY MFSFETNPRS TSKEHLEALY RVGFRRMSFG IQDFDPVVQQ
EINRPQSFEL VKEKVDLARQ IGFTSVNFDL VYGLPKQTLA TITDTIDKVM QLRPDRLAFY
AYGHNPHMYE GQRKFKEEDL PVGDVKQELY DKGRAMLESI GYHEIGMDHF ALEGDSLYLA
AKDGSLHRNF MGYTENTTQM MIALGASSIS DTWYAFAQNE RGDQKYIEEI NKGRFPILRG
HLLTDEDLVL RRHILNLMCR QETSWEDPKM YTEELDIARY RLEDMENDGL VVLLENGVRV
TEIGIPFLRN ICMAFDARLW RSDSLSKAYN VSRDIQKEYI EKARQAKAQQ QVS