Gene Tery_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1194 
Symbol 
ID4242537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1867440 
End bp1868705 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content37% 
IMG OID638106412 
Productcoproporphyrinogen III oxidase 
Protein accessionYP_721024 
Protein GI113474963 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0134211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAGGC CAGTCGGAAA TTATTGTATT CCCAGTTCGG CTTATGTCCA TATCCCTTTC 
TGCCGACGGC GATGCTATTA CTGTGATTTC CCAATCTCTG TAGTAGGTGA TGGTAAGAAA
GGTGATAATT TTTCTCCAAT TCAAGAATAT GTAGAGGTAA TTTGTCAGGA AATAAGCACT
ACAAAGTCTT TTGATCAACC CTTAAAAACA ATTTTTTTTG GTGGTGGTAC TCCTTCCCTA
TTGTCAGTGG GTCAATTAAG TCGGATTTTA GATGCCCTAG AACAAAAATT TGGAATTGTG
GCCAATGCTG AAATTTCTAT AGAAATGGAC CCCGGAACTT TTGACTTAGA ACAAGTGCAA
GGATATAAAT TCCTAGGAGT AAATCGAGTC AGCCTTGGAG TACAAGCATT TCAAGATGAT
TTATTACAGG TTTGTGGGCG ATTACACAAT GTCTCAGATA TTTACAAAGC AGTAAATACA
TTGCATCAAG CAGGAATTAT TAACTTCAGT ATTGATTTAA TTTCAGGACT GCCCCACCAA
ACTTTAGAAC AATGGCAAAT TTCTTTGTTA AGTGGAATAG CTATTTCTCC AACTCATATA
TCCAGCTATG ACCTTGTACT AGAAAAAGTT ACAGCTTTTG GACATTACTA TAAACCAGGT
CATGCTCCTT TACCTACAGA TGAAACAGCA GCTGAAATGT ATCGAATTGC ACAGCAACTG
ATATCAATTT CGGGATATGA ACATTATGAG ATATCTAATT ATGCCAAGCA AGGCTATCAG
TGTAGCCACA ATCGAGTTTA CTGGGAAAAT CATCCTTATT ATGGCTTTGG CATGGGTGCA
GCCAGCTATT TAGAAGGACA AAGATTTACC AGGCCGCGTA CGCGGAAAAA ATATTATCAA
TGGGTGCTAT CGTTTCAAGA TCATAGTTTA GAAAGTCAGG GTATTAATGC ATCAAATCAG
GATTTTTTGT TAGAAACACT GATGTTAGGA TTTCGCTTGG CACAAGGGAT AAATGTCTTA
ACATTATCCC AGCAGTTTGG TCAAAAAACT GTAGAAAAAT TATTAATCTA TCTACAACCC
TATCAAAAGT TAGGGTGGGT AGAGTTTATC AATCAAAAAG GTGTAGCAAC CCCTTTCTCT
GATAACCAGA AACTTCCGAT AGAGGGACAT CTGAGATTGA CTGATCCTGA AGGTTTTTTG
TTTTCTAATA CTGTTTTATC AACATTATTT AGTAAGATTA GTAATTCTAT CAGTATGAAA
TGCTAA
 
Protein sequence
MKRPVGNYCI PSSAYVHIPF CRRRCYYCDF PISVVGDGKK GDNFSPIQEY VEVICQEIST 
TKSFDQPLKT IFFGGGTPSL LSVGQLSRIL DALEQKFGIV ANAEISIEMD PGTFDLEQVQ
GYKFLGVNRV SLGVQAFQDD LLQVCGRLHN VSDIYKAVNT LHQAGIINFS IDLISGLPHQ
TLEQWQISLL SGIAISPTHI SSYDLVLEKV TAFGHYYKPG HAPLPTDETA AEMYRIAQQL
ISISGYEHYE ISNYAKQGYQ CSHNRVYWEN HPYYGFGMGA ASYLEGQRFT RPRTRKKYYQ
WVLSFQDHSL ESQGINASNQ DFLLETLMLG FRLAQGINVL TLSQQFGQKT VEKLLIYLQP
YQKLGWVEFI NQKGVATPFS DNQKLPIEGH LRLTDPEGFL FSNTVLSTLF SKISNSISMK
C