Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3793 |
Symbol | |
ID | 4243741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5828485 |
End bp | 5829816 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638108728 |
Product | pheophorbide a oxygenase |
Protein accession | YP_723312 |
Protein GI | 113477251 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.726 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCG AGTTTAACTT TTTTCAGCAT TGGTATCCCC TTTCTCCTGT TGAAGACCTT GACCCAGAAA TTCCTGTTTC TATAACTTTG TTGGGAATGG GTTTAGTAAT TTGGAAGCCA AGATATTCTC AGAATTACCA GGTATTTTTA GATGTTTGTC CTCATCGTCT CGCACCTTTA AGCGAGGGTC GTGTGGATGA AAAAACTGGG AATTTAATGT GTAGTTATCA TGGTTGGCAA TTTAATTCTG AAGGTATCTG TACTGATATT CCCCAAGCTG AAAATCAGGA AATTATAGAA AAAAATCGGC AAAATTTCTG TGTAACTTCG TTGCCAGTAC GTCAAGAAAA TGATTTACTA TGGGTTTGGC CTGATAGGAA AACAGCAGAC TTAGCTGCTA GTACACCCCT ACCATTATCA CCACAAATAG ATGCTGAGAA AGGTTTTGTG TGGTCTTCTT ATGTTCGCGA TCTAGAATAT GATTGGGAAA CTCTAGTAGA AAATGTCGCA GATCCGAGTC ATGTTCCTTT TTCTCATCAC GGAGTACAAG GTAATAGAAA CAAAGCTAGA CCAATAGTTA TGGAGATTTT ACAATCAACG ATCAATGTAA TTAAAGTTTC TATTTCGAGA GGTTTACCTA CAATAATTAC TTTTGAACCA CCTTGTCGTT TAGAATATGA AATTAGTATT GGTAAGGATG GCAAAAAAAT AGGACTAGTT ACTTATTGTT TGCCAGTATC TCCTGGAAAA TCTAGAATTG TGGCTCAATT TCCTAGGAAC TTTGCGAAAA TAATTCATCG GTTGACACCC CGTTGGTGGG ACCACATTCA AAATCGCAAT TTGGTTCTTG ATGGAGATAT GGTTTTACTA AATCAGCAAG AGTATTTACT GCAGCAAAAA CAATCAAGTA AAAGTTGGAA AACAGCTTAT CAATTACCTA CAAGTGCAGA CCGTTTAGTA ATTGAGTTTA GAAATTGGTT TGATAAATAT TCTCATGGTC AATTACCTTG GAGTGAAGTT GGAGTTAGTG ATTCAAATAA TTCTTTAGTA AATAATAATC ATGGCTCACT ATTAAATCGT TATCAACAAC ATACTCAGCA TTGTAGTAGT TGTCGGCAGG CATTAAAAAA TCTAAAAAGG TTACAAATTT TCCTGATAAC TTATTTCGTA GTTACTGTTT CTGGAGTTGC TATATTGCCT GATAATTTAC GGATTAAAAT AGGTTTACCT TTAATAATTA CAGCAGTTTT AGGAATGGGA ATTTATGCTG GATTGCAATT TTGGTTGATT CCTCAATTTT ACTTTGTTGA TTATATCCAT GCTCGAAAAT GA
|
Protein sequence | MQTEFNFFQH WYPLSPVEDL DPEIPVSITL LGMGLVIWKP RYSQNYQVFL DVCPHRLAPL SEGRVDEKTG NLMCSYHGWQ FNSEGICTDI PQAENQEIIE KNRQNFCVTS LPVRQENDLL WVWPDRKTAD LAASTPLPLS PQIDAEKGFV WSSYVRDLEY DWETLVENVA DPSHVPFSHH GVQGNRNKAR PIVMEILQST INVIKVSISR GLPTIITFEP PCRLEYEISI GKDGKKIGLV TYCLPVSPGK SRIVAQFPRN FAKIIHRLTP RWWDHIQNRN LVLDGDMVLL NQQEYLLQQK QSSKSWKTAY QLPTSADRLV IEFRNWFDKY SHGQLPWSEV GVSDSNNSLV NNNHGSLLNR YQQHTQHCSS CRQALKNLKR LQIFLITYFV VTVSGVAILP DNLRIKIGLP LIITAVLGMG IYAGLQFWLI PQFYFVDYIH ARK
|
| |