Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2485 |
Symbol | |
ID | 4245254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3828573 |
End bp | 3830141 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638107567 |
Product | photosystem antenna protein-like |
Protein accession | YP_722166 |
Protein GI | 113476105 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03041] chlorophyll a/b binding light-harvesting protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.90279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTAA CTCAATCAAC AAATCCATTA AGGTTTTTAG GATTTTCCAC AGACAATAAT GTTGATGTTA CGAACTATGT TGGTAATTCC CAAGAACCCT ATTCCTGGTG GGCAGGTAAT TTTCGCTTTG TTGACCTATC TGGGAAATTA TTAGGCGCTC ACATTGCCCA TGCCGGCTTA ATTGTACTCT GGGCCGGTGC TATGACTCTA TTTGAGTTAT CCAGATTCGA CCCTAGTTTA CCTATGTACA ACCAGGGGCT AATTTTGCTT CCCCACATGG CTACTCTGGG TTTAGGAGTT GGAGCAAATG GGGAAATTAT CGATACCTAT CCTTATTTTG CTATTGGTGT TGTTCACCTA GTTAGTTCAG CAATTTTAGG AGCTGGTGGG ATTTACCATG CCGTACTTGG ACCTGAGAAA CTCGACGAAA AAGGATTTGG CTATCAGTGG AATGATGGTA ACAAAATGAC TACTATTCTT GGCATTCATC TAGTACTACT AGGGATTGGA GCATTGATGC TGGTAGTCAA AGCCGTTTAT GGTAGTGGTT TGTACGATCC GGCGATCGCC AATGTACGAG TGATTACTGA ACCAACTCTG AATCCACAAA CTATCTTCGG GTACCTGGTA GGTATCACTC CTGATGGCTG GACATTGAAG GGAATGGCTG CAGTCAATTC TTTAGAAGAT GTAGTAGGCG GTCATATATG GATTGGCGTC ATCTGCATTT TAGGAGGTTT GTGGCATATT AATACTGCCC CTACTAAATG GGCTAAAGGG TTATTTGTTT GGTCCGGAGA AGCATATCTT GCCTACAGTC AGGCAGCTCT AGCTTACATG GGCTTTTTCG CAGCTTACTT TGTCTGGGTA AATGATACAG TTTATCCCTC AGTATTTTAT GGTCCTGTGG GAGTGACTAA CGTTGATGGA ACTATTACTC CTCGTACCTG GTTAATGTTG TTCCAACTTA TTTTTGCCTG TCTGCTACTG GCGGGTCACT TTTGGCATGG CCTCAGATCC AGAGCGATCG CTAGTGGATT TGTCTTCAGT AGTCTGAAAT TCAATCCCGG TGCTCTCTAT GGAGACGCTC AATTCAATAA CGAATCCCTA GTTACAGGTA TTGTGCAGCC GTACCAAAAC AACCCCCAGT TGGGTAACCT GGCAACCCCA ATTAACTCCA GTCAGCTAAC TCTAACTTGG GTCAGAAACT TGCCAATTTA TCGCAATGGT TTATCTCCCA TTGCACGAGG GTTAGAAATT GGCATGGCTC ATGGATACCT GTTATTAGGA CCGTTCTTGA AACTCGGACC ATTGCGCAAT ACTGACCAAG CTCTTTTAGC TGGAGGTGGT AGTGCATCAG GATTAGTAGT AATTTTAAGT ATCTGTCTGT TCGTTTATGG GATGGCAGTG TTTCAAGGTT CAAGTAAGCC AGTGGGTGTT TTACCAGAAA ACCTACAAAC TTATGGGCAT TGGAGCATGT TTACATCTGG TTTCTTGATA GGCGGAATAG GTGGTGTAAT TTTTGCCTGC TTTATTCTGT TAGAAATTGG CCGAGCAGGA ATTGTTTGA
|
Protein sequence | MSVTQSTNPL RFLGFSTDNN VDVTNYVGNS QEPYSWWAGN FRFVDLSGKL LGAHIAHAGL IVLWAGAMTL FELSRFDPSL PMYNQGLILL PHMATLGLGV GANGEIIDTY PYFAIGVVHL VSSAILGAGG IYHAVLGPEK LDEKGFGYQW NDGNKMTTIL GIHLVLLGIG ALMLVVKAVY GSGLYDPAIA NVRVITEPTL NPQTIFGYLV GITPDGWTLK GMAAVNSLED VVGGHIWIGV ICILGGLWHI NTAPTKWAKG LFVWSGEAYL AYSQAALAYM GFFAAYFVWV NDTVYPSVFY GPVGVTNVDG TITPRTWLML FQLIFACLLL AGHFWHGLRS RAIASGFVFS SLKFNPGALY GDAQFNNESL VTGIVQPYQN NPQLGNLATP INSSQLTLTW VRNLPIYRNG LSPIARGLEI GMAHGYLLLG PFLKLGPLRN TDQALLAGGG SASGLVVILS ICLFVYGMAV FQGSSKPVGV LPENLQTYGH WSMFTSGFLI GGIGGVIFAC FILLEIGRAG IV
|
| |