Gene Synpcc7942_2382 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2382 
Symbol 
ID3774666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2448879 
End bp2450069 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content58% 
IMG OID637800830 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_401399 
Protein GI81301191 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0342073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATT TGCCGATTGC TGCTTACCTA CACATCCCCT TTTGTCGTCG TCGCTGCTTT 
TACTGTGACT TCCCGATTAG CGTCGTGGGC GATCGCCTGC GCGGGGATCA GTCCGGCATG
ATCCGCCAGT ATGTGGACGC GATCTGCCAA GAAATTGCTG CGACGCCTGT GCTTGGGCCA
GCACTGCAAA CGGTCTTTTT CGGTGGGGGA ACGCCCTCCC TGCTGAGTCC CAATCAGCTC
GATCGCATTC TGCAGACCCT CGATCGCCGT TTTGGGATTG CAGCGACGGC TGAAATTTCA
ATCGAAATGG ATCCCGCTAG CTTTGACCGA TCGCAAGCGA TCGCAACGCG ACAGCTAGGC
TTTAACCGCG TCAGTATTGG CGCTCAGGCG TTTCAAGATG GCCTGTTGGA ACGGTGTGGT
CGTACCCATC GGCGGGCCGA TATCGATCGC GCTGTGGCGG ATTTGCGGGC TGCTGGTTTC
GAGAATCTCA GTCTGGATTT GATCTCGGGT CTACCGGAAC AGACGCTGGC GGATTGGCAG
GCTTCATTAG AAGCCGCGAT CGCCCTAGAA CCGACTCACC TCTCGGCCTA CGATCTGGTG
CTAGAACCAG AGACGGTGTT TGGCAAGCGC TATCAACCGG GCGATCGCCC CTTGCCAGCC
GATGAGCAAA CCGCTGTCAT GTATCGACTG GCGCATAGGA CCTTGGAAGC GGCGGGCTTT
GAGCACTATG AAATCTCGAA CTATGCGCGA TCGGGATTTC AGTGTCGCCA CAATCGCGTC
TACTGGCAGG ATCAGTCCTT CTATGGCTTT GGGGTCGGTG CGACGAGTGC CCTGCAAGGC
CAGCGGTTTG GCCGTCCGCG CCGCCGAGCT GATTACTTTA TCTGGCTCGA GACGCCGGGT
GCGATCGCGG CAGCTTGTGC ACCGGTTACC TCTGATCCGG CTGATGCCTT AGCGGAAACG
CTGATGTTGG GGCTGCGGCT GGCGGAAGGG TTGGATTGGA CCGCATTAGA GCAGCCGTTT
GGGGCAGAAA TCCTGCGATC GCTGCAACCT GTGATTCAGC GCTACCAACA GGCGGGCTGG
TTGCAATGGC AGGGCGATCG CCTCAGCTTG ACCCAGCCGG AGGGCATGCT CTTCTCGAAT
CAGGTCTTAG CCAGTTTGTT TGAACGACTC GAGGCGATCG CGACTGTCTA G
 
Protein sequence
MINLPIAAYL HIPFCRRRCF YCDFPISVVG DRLRGDQSGM IRQYVDAICQ EIAATPVLGP 
ALQTVFFGGG TPSLLSPNQL DRILQTLDRR FGIAATAEIS IEMDPASFDR SQAIATRQLG
FNRVSIGAQA FQDGLLERCG RTHRRADIDR AVADLRAAGF ENLSLDLISG LPEQTLADWQ
ASLEAAIALE PTHLSAYDLV LEPETVFGKR YQPGDRPLPA DEQTAVMYRL AHRTLEAAGF
EHYEISNYAR SGFQCRHNRV YWQDQSFYGF GVGATSALQG QRFGRPRRRA DYFIWLETPG
AIAAACAPVT SDPADALAET LMLGLRLAEG LDWTALEQPF GAEILRSLQP VIQRYQQAGW
LQWQGDRLSL TQPEGMLFSN QVLASLFERL EAIATV