Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1194 |
Symbol | |
ID | 4242537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1867440 |
End bp | 1868705 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638106412 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_721024 |
Protein GI | 113474963 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0134211 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAGGC CAGTCGGAAA TTATTGTATT CCCAGTTCGG CTTATGTCCA TATCCCTTTC TGCCGACGGC GATGCTATTA CTGTGATTTC CCAATCTCTG TAGTAGGTGA TGGTAAGAAA GGTGATAATT TTTCTCCAAT TCAAGAATAT GTAGAGGTAA TTTGTCAGGA AATAAGCACT ACAAAGTCTT TTGATCAACC CTTAAAAACA ATTTTTTTTG GTGGTGGTAC TCCTTCCCTA TTGTCAGTGG GTCAATTAAG TCGGATTTTA GATGCCCTAG AACAAAAATT TGGAATTGTG GCCAATGCTG AAATTTCTAT AGAAATGGAC CCCGGAACTT TTGACTTAGA ACAAGTGCAA GGATATAAAT TCCTAGGAGT AAATCGAGTC AGCCTTGGAG TACAAGCATT TCAAGATGAT TTATTACAGG TTTGTGGGCG ATTACACAAT GTCTCAGATA TTTACAAAGC AGTAAATACA TTGCATCAAG CAGGAATTAT TAACTTCAGT ATTGATTTAA TTTCAGGACT GCCCCACCAA ACTTTAGAAC AATGGCAAAT TTCTTTGTTA AGTGGAATAG CTATTTCTCC AACTCATATA TCCAGCTATG ACCTTGTACT AGAAAAAGTT ACAGCTTTTG GACATTACTA TAAACCAGGT CATGCTCCTT TACCTACAGA TGAAACAGCA GCTGAAATGT ATCGAATTGC ACAGCAACTG ATATCAATTT CGGGATATGA ACATTATGAG ATATCTAATT ATGCCAAGCA AGGCTATCAG TGTAGCCACA ATCGAGTTTA CTGGGAAAAT CATCCTTATT ATGGCTTTGG CATGGGTGCA GCCAGCTATT TAGAAGGACA AAGATTTACC AGGCCGCGTA CGCGGAAAAA ATATTATCAA TGGGTGCTAT CGTTTCAAGA TCATAGTTTA GAAAGTCAGG GTATTAATGC ATCAAATCAG GATTTTTTGT TAGAAACACT GATGTTAGGA TTTCGCTTGG CACAAGGGAT AAATGTCTTA ACATTATCCC AGCAGTTTGG TCAAAAAACT GTAGAAAAAT TATTAATCTA TCTACAACCC TATCAAAAGT TAGGGTGGGT AGAGTTTATC AATCAAAAAG GTGTAGCAAC CCCTTTCTCT GATAACCAGA AACTTCCGAT AGAGGGACAT CTGAGATTGA CTGATCCTGA AGGTTTTTTG TTTTCTAATA CTGTTTTATC AACATTATTT AGTAAGATTA GTAATTCTAT CAGTATGAAA TGCTAA
|
Protein sequence | MKRPVGNYCI PSSAYVHIPF CRRRCYYCDF PISVVGDGKK GDNFSPIQEY VEVICQEIST TKSFDQPLKT IFFGGGTPSL LSVGQLSRIL DALEQKFGIV ANAEISIEMD PGTFDLEQVQ GYKFLGVNRV SLGVQAFQDD LLQVCGRLHN VSDIYKAVNT LHQAGIINFS IDLISGLPHQ TLEQWQISLL SGIAISPTHI SSYDLVLEKV TAFGHYYKPG HAPLPTDETA AEMYRIAQQL ISISGYEHYE ISNYAKQGYQ CSHNRVYWEN HPYYGFGMGA ASYLEGQRFT RPRTRKKYYQ WVLSFQDHSL ESQGINASNQ DFLLETLMLG FRLAQGINVL TLSQQFGQKT VEKLLIYLQP YQKLGWVEFI NQKGVATPFS DNQKLPIEGH LRLTDPEGFL FSNTVLSTLF SKISNSISMK C
|
| |