Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2601 |
Symbol | |
ID | 7103593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2692767 |
End bp | 2694011 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643475640 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_002372761 |
Protein GI | 218247390 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAC TTTTCCCTAA CATTGGCTTC AATCAATTGC CTAAATCAGC TTATATTCAT ATTCCTTTTT GTCGTCGTCG TTGCTATTAT TGTGATTTTC CTATTTCCGT TTTAGGAGAT AAAACAGATA TTTATACTGC CTCTTCTATT ACCCATTATG TAGAGGTTTT ATGTCAAGAA ATTCTGGTTA CTCCGTTTCA AGGGAATGGG TTAGAAACCG TCTTTTTTGG CGGAGGAACT CCCTCATTAT TACCTCCTAA CCATCTTGAA ACTATTCTGA GTACCTTAGA TCAAAAATTT GGGATCTTTG CTAATGCCGA AATTTCTCTA GAAATAGACC CTGCAACGTT TACACTAGAA CAATTACAAC GTTATCATGA CTTAGGGATC AATCGAGTTA GTTTAGGGGG ACAAGCATTT CAGGACAATT TATTAGAAAC TTCTGGACGA TTACACCGAG TTAAAGATAT TTTTGAAGCC ATTGACTATA TTCATAAAGT AGGGATAAAC AATTTTAGTC TAGATTTAAT TTCTGGATTG CCTAATCAAA GTATAGAAGA TTGGATTTTT TCCTTAGAAA CTGCTATTAA AATCTCTCCA AGTCATCTTT CTTGTTATGA TTTAGTTCTT GAACCAGTAA CTGCATTTGG CAAGCAATAT AAACCTGGAA AAAAACCTTT ACCAAGCGAT GAAATAACCG CTAAAATGTA TCGAATATCT CAAAAAATAC TGACGGATGC AGGGTATGAC CATTATGAAA TTTCTAATTA TGCAAAACCG GGTTATCAAT GTCTACATAA TCGAGTCTAT TGGGAAAATA AACCCTACTA TGGTTTTGGA ATGGGAGCAG CTAGTTATAC CAATCAACAA CGATTTACTC GACCTCGTAC TCGAAAAGAT TATTATGAAT GGGTCAATAA GTTATCTAAA ACACAAGGAT TAATCGATTG TGAAGTATCA TCAAAAACAG ATTTTTTGCT AGAAACATTA ATGCTTGGTT TGCGCCTTAA AGAAGGGATT AAATTATCAT TTATTAGCGA TGTTTTTGGC AAGCAAATCT CTCAAAAAAT TTTAGATATT TTAGCACCTT TTATTAAGCA AAGTTGGATA GAGTTACCTC CTAACAGTAG TTTAATTCAC TTAGAGAATA TTGATCGTAT TTCTTTAAGT GATCCCGAAG GATTTTTATT CTCTAATACC ATTTTATCGA CTTTATTTGA GAAATTAGAA GAATGTGGGA ACTAA
|
Protein sequence | MTQLFPNIGF NQLPKSAYIH IPFCRRRCYY CDFPISVLGD KTDIYTASSI THYVEVLCQE ILVTPFQGNG LETVFFGGGT PSLLPPNHLE TILSTLDQKF GIFANAEISL EIDPATFTLE QLQRYHDLGI NRVSLGGQAF QDNLLETSGR LHRVKDIFEA IDYIHKVGIN NFSLDLISGL PNQSIEDWIF SLETAIKISP SHLSCYDLVL EPVTAFGKQY KPGKKPLPSD EITAKMYRIS QKILTDAGYD HYEISNYAKP GYQCLHNRVY WENKPYYGFG MGAASYTNQQ RFTRPRTRKD YYEWVNKLSK TQGLIDCEVS SKTDFLLETL MLGLRLKEGI KLSFISDVFG KQISQKILDI LAPFIKQSWI ELPPNSSLIH LENIDRISLS DPEGFLFSNT ILSTLFEKLE ECGN
|
| |