Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3212 |
Symbol | |
ID | 4243807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4910934 |
End bp | 4912403 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638108213 |
Product | carotenoid oxygenase |
Protein accession | YP_722804 |
Protein GI | 113476743 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.415925 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.375816 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATT TACAAATTCA ACAACCTAAA TCTTATACTA GTAAAGATTG GCAACAAGGA TATAAATCTC AACCACAAGA ATATAACTAT TGGATTGATG ATATAGAAGG TGAAATACCA GAAGATTTAA ACGGTACTTT TTTCCGTAAC GGACCAGGTT TATTAGACAT TAATGGTCAA CTTATTGCTC ATCCTTTTGA CGGAGATGGA ATGGTTTGTG CAATTAGTTT TAAAAACCGT CGCGCCCACT TCCAAAATAG ATTTGTTAGA ACAGAAGGTT ATGTGGCAGA AAAAGCGGCA GGAAAAATTC TCTATAGAGG TGTTTTTGGT ACTCAAAAAA CAGGTGGTTG GTTAGCTAAT CTTTTTGATT TTAAACTTAA AAATATTGCC AATACTGGTA TTATTTATTG GGGTGATAAA CTTTTGGCAT TGTGGGAAGG AGGGCAACCT CATCGTTTAA ATCCCCAGAA TTTAGAAACC ATTGGCCTTA ACGATTTAGA TGGGCTTTTA CAACCAGGTC AAGCTTTTTC TGCTCATCCC AGAATTGATA AAGGAAAGGA TGGAAAAGGA GATGTTTTAG TTAATTTTTC TGTCAAACCT GGTTTATCAA GTACCATTAC TATTTTTGAA TTTAATAGTC AGGGAAAATT ACTCAAACGT TACTCTAATT CTATTCCTGG TTTTGCCTTT TTACACGATA TGGTAATTAC ACCAAATTAC TGTATTTTTT TTCAAAATCC TGTTGCTTTT AATCCTTTTC CTTTATTACT AGGGTTACGA ACTCCAGGTC AATGTTTAGA GTTTTTACCT AATAATTCAA CACAAGTTAT TTTAATTCCT CGTGATGGTA GTAAAGCTAT AAAAATTTTG AAAACGAAAC CTTGTTTTGT ATTTCATCAT GCTAATGCTT GGGAAAAGGA CGGGGAAATT TATGTAGATT CTATTTGTTA TGAATCTGTC TCACAAACTG ACCTAGGTGA TAATTTTCTG GAGGTGGATT TTGACTCAAT GACAGAAGGT AAGTTATGGC GATTTAAGAT TAATTTATCA GAGAATAATG TGGAACATAA ATTGCTTGAA AGTCGTTGTT GTGAGTTTCC GACTTTAAAT CCGAATAATG TAGGAAAAGC TTATCGATAT TTATTTATTG GAGCAGCAGA TAAGCCTAGT GGAAATGCTC CTTTACAAGC AATATTAAAA ATTGATTTGC ATACAGGAAA ACGTCAAACT TTTAGTGTCG CACCGCGAGG TTTTGCAGGA GAACCTTTAT TTGTTCCTTT TCCAAATGGG GTGAATGAGG ATGATGGTTG GTTATTAATG TTGATGTATG ATGCAGCAGA ACATCGGTCG GATATTGTGA TTTTGGATGC TCGTGATTTG AATAAAAAAC CTGTGGCAAG ATTACATTTA AAGCATCATA TTCCTTATGG TTTACATGGT AGTTTTACCC CTAATTATTT TCAAGAGTAA
|
Protein sequence | MTNLQIQQPK SYTSKDWQQG YKSQPQEYNY WIDDIEGEIP EDLNGTFFRN GPGLLDINGQ LIAHPFDGDG MVCAISFKNR RAHFQNRFVR TEGYVAEKAA GKILYRGVFG TQKTGGWLAN LFDFKLKNIA NTGIIYWGDK LLALWEGGQP HRLNPQNLET IGLNDLDGLL QPGQAFSAHP RIDKGKDGKG DVLVNFSVKP GLSSTITIFE FNSQGKLLKR YSNSIPGFAF LHDMVITPNY CIFFQNPVAF NPFPLLLGLR TPGQCLEFLP NNSTQVILIP RDGSKAIKIL KTKPCFVFHH ANAWEKDGEI YVDSICYESV SQTDLGDNFL EVDFDSMTEG KLWRFKINLS ENNVEHKLLE SRCCEFPTLN PNNVGKAYRY LFIGAADKPS GNAPLQAILK IDLHTGKRQT FSVAPRGFAG EPLFVPFPNG VNEDDGWLLM LMYDAAEHRS DIVILDARDL NKKPVARLHL KHHIPYGLHG SFTPNYFQE
|
| |