Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0513 |
Symbol | |
ID | 4242361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 816445 |
End bp | 817824 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 638105827 |
Product | photosystem II 44 kDa subunit reaction center protein |
Protein accession | YP_720441 |
Protein GI | 113474380 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.50837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTAACGC TGTCTAATTC TATAATAGGG GGTCGTGACC AACAATCCAC CGGTTTTGCT TGGTGGTCTG GAAACGCCCG TCTAATCAAT CTATCTGGTA AACTGCTTGG TGCTCACGTA GCCCACGCAG GTTTAATTGT ATTTTGGGCC GGAGCAATGA CTTTGTTTGA AGTTGCTCAC TTTGTCCCAG AAAAGCCAAT GTATGAACAA GGCTTAATTC TAATGCCCCA CGTTGCCACT ATAGGCTGGG GTGTTGGTCC TGGTGGTGAA ATAGTTGACA TTTTTCCATT CTTTGTAGTA GGCGTTTTAC ACCTAATTTC ATCTGCTGTT CTGGGTTTGG GCGGAATTTA TCATGCCGTT CGTGGCCCAG AAACTTTGGA AGACTATTCT TCTTTCTTTG GATATGACTG GAAAGATAAG AACCAGATGA CTAATATTAT TGGCTACCAC TTAATTATAT TAGGTTTGGG CGCATTCTTA TTAGTAATCA AAGCTGTGTT CTTGGGTGGT GTCTATGATA CTTGGGCACC AGGTGGCGGT GATGTACGAG TAATTACTAA CCCTACTTTG AATCCTGCTG TTATATTTGG TTATCTACTT AAAGCGCCTT TTGGTGGTGA AGGCTGGATC ATTGGCGTTA ATAACATGGA AGATATTATC GGTGGTCATA TCTGGATTGG CCTAATTTGT ATCTCCGGTG GTATTTGGCA TATTCTAACT AAGCCTTTTG GTTGGGCACG TCGCGCTTTC ATCTGGTCTG GAGAAGCTTA TCTATCCTAC AGTTTGGGCG CCCTGTCTTT GATGGGTTTA ATTGCTGCCG CTTTCGTATG GTTTAACAAC ACTGCTTATC CTAGCGAATT CTATGGTCCT ACTAATGCTG AAGCTTCTCA AGCTCAGTCT TTTGTGTTCT TAGTCCGTGA CCAAAAATTA GGTGCTAATA TTGGTTCTGC TCAAGGTCCT ACTGGTCTTG GTAAGTACCT AATGCGCTCT CCCACTGGTG AGATCATATT CGGTGGTGAA ACAATGCGTT TTTGGGACTT TCGTGGTCCT TGGTTAGAGC CTCTTCGTGG TCCTAACGGT TTAGACTTGA GTAAACTGAA GAACGATATT CAGCCTTGGC AAGTTCGTCG TGCTGCTGAG TACATGACTC ATGCTCCTAA TGGTTCTATC AACTCTGTAG GTGGTATTAT TACAGATATT AACGGTTTCA ATTATGTAAA CCCTCGTGCT TGGTTAGCTG CCGCTCACTT TATTCTTGGT TTCTTCTTCT TAATTGGTCA CTTGTGGCAT GCTGGTCGCG CTCGTGCTGC TGAGGGTGGT TTTGAGAAGG GTCTGGACCG TCAAACTGAG CCAGTACTAT CTATGCCTAA CCTTGACTAA
|
Protein sequence | MVTLSNSIIG GRDQQSTGFA WWSGNARLIN LSGKLLGAHV AHAGLIVFWA GAMTLFEVAH FVPEKPMYEQ GLILMPHVAT IGWGVGPGGE IVDIFPFFVV GVLHLISSAV LGLGGIYHAV RGPETLEDYS SFFGYDWKDK NQMTNIIGYH LIILGLGAFL LVIKAVFLGG VYDTWAPGGG DVRVITNPTL NPAVIFGYLL KAPFGGEGWI IGVNNMEDII GGHIWIGLIC ISGGIWHILT KPFGWARRAF IWSGEAYLSY SLGALSLMGL IAAAFVWFNN TAYPSEFYGP TNAEASQAQS FVFLVRDQKL GANIGSAQGP TGLGKYLMRS PTGEIIFGGE TMRFWDFRGP WLEPLRGPNG LDLSKLKNDI QPWQVRRAAE YMTHAPNGSI NSVGGIITDI NGFNYVNPRA WLAAAHFILG FFFLIGHLWH AGRARAAEGG FEKGLDRQTE PVLSMPNLD
|
| |