Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4346 |
Symbol | |
ID | 4245998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6699593 |
End bp | 6700990 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638109233 |
Product | pheophorbide a oxygenase |
Protein accession | YP_723811 |
Protein GI | 113477750 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.18517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGCTA ATACCATCAA AAAATCCGAC AATATTCCTG TAAGTGCAGG TGGAACAGAC CCAGAATATT TTGATTGGCA AGAAGTTTGG TATCCAATTT ATTATATAGA AGACTTAGAG AAAAATAAAC CTACAACATT TACATTATTA GAACAAGATC TTGTGATTTG GTGGGAAGAA AAAAACAATC AATGGCGGGT ATTTGAAGAT CAATGTCCCC ATCGTTTAGC ACCCCTTTCT CAAGGTAGAA TTAATGAGGC TGGATGCTTA GAATGCCCCT ATCATGGTTG GGCATTTTCA GGGGGAGGAA ATTGTGAAAT TATTCCTCAA CAAATTGCAG GAGGAAAGGC AGAAAAATCA TCAAGAGCAA AAGTTAAATC TCTACCTACA AAAGTTTGTC AAGGTCTTTT ATTTGTCTAT GCAGGAAAAA TCGAAAATGC TGCTCAAACA CCTATTCCTA AAGTTGATGT TTTAGATGAA AATTCTAATG AATGGATTTG TTTAAATACT TTTCGAGATG TACCTTATGA TGGTTTAACA TTAATGGAAA ATGTCTTAGA TGCTAGTCAT ATTCCTTATA CTCATCATCG TACTGTGGGA AACCGCGCTA ATGTTGCTCC TGTAGAATTA GAAGTTTTAG AATCTGGAAA ATGGGGTTTT AAAGGGGTCT GGGAAGAAGG TCCAAGAAAA GGAACTTTAG GAAGACAAGA GACTACTTTT ATTGCTCCTG GAATGATGTG GCATGACCTT ACTTCTAAAC AATTTGGTCG TACATTAACA GTTGTATATG CAACACCTAT TCGTAAAGGA GAATGTCGTT TATTTGCCCG TTTCCCTTTT AAGTTTTCTT CTCCCTTACC GAAATTTTTT ATTCAGTTAA GCCCCCGTTG GTATTCTCAT ATTGGGCAAA ATGGAGTATT GGAAGATGAC CAAATATTTT TACATTATCA AGAACGATAT TTAGAAGCAA AGGGTGGTAG TGCTAATTTT TCTAAGGCTT TTTATTTACC TACAAAAGCA GATTTATTTG TATTTGAATT GCGTTCTTGG GTGAATAAAT ATAATGCTCA ATTATTTCCT AATGCAACTT TAAGTTCTGC TCTGAACTCA GAAATATTGT TAGATAGGTA TCATTCTCAT ACTAAAAAAT GTAGTAGTTG TCGCAGAGCT TTAAAAAATT TACAACGAAT AAAGGTTGGA GTCGTGCTTG TGACTTCATT TATCTGGGCA AGTATTTTTT TTATGCTGTT AATATTAGAT GATTTTAATA TGACTTTGAT GACTTTTTTG ATTTTAAGTT TGCCTGTTGG AGTTGTTTTT TGGTTACTGT TAAGTAAATT AGAAAAACAG TTTTATCAAG GACGAGAAAT ACCTCCAAGA AATTTATTAA ATAATTGA
|
Protein sequence | MIANTIKKSD NIPVSAGGTD PEYFDWQEVW YPIYYIEDLE KNKPTTFTLL EQDLVIWWEE KNNQWRVFED QCPHRLAPLS QGRINEAGCL ECPYHGWAFS GGGNCEIIPQ QIAGGKAEKS SRAKVKSLPT KVCQGLLFVY AGKIENAAQT PIPKVDVLDE NSNEWICLNT FRDVPYDGLT LMENVLDASH IPYTHHRTVG NRANVAPVEL EVLESGKWGF KGVWEEGPRK GTLGRQETTF IAPGMMWHDL TSKQFGRTLT VVYATPIRKG ECRLFARFPF KFSSPLPKFF IQLSPRWYSH IGQNGVLEDD QIFLHYQERY LEAKGGSANF SKAFYLPTKA DLFVFELRSW VNKYNAQLFP NATLSSALNS EILLDRYHSH TKKCSSCRRA LKNLQRIKVG VVLVTSFIWA SIFFMLLILD DFNMTLMTFL ILSLPVGVVF WLLLSKLEKQ FYQGREIPPR NLLNN
|
| |