Gene Tery_0513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0513 
Symbol 
ID4242361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp816445 
End bp817824 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content44% 
IMG OID638105827 
Productphotosystem II 44 kDa subunit reaction center protein 
Protein accessionYP_720441 
Protein GI113474380 
COG category 
COG ID 
TIGRFAM ID[TIGR01153] photosystem II 44 kDa subunit reaction center protein (also called P6 protein, CP43), bacterial and chloroplast
[TIGR03041] chlorophyll a/b binding light-harvesting protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.50837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAACGC TGTCTAATTC TATAATAGGG GGTCGTGACC AACAATCCAC CGGTTTTGCT 
TGGTGGTCTG GAAACGCCCG TCTAATCAAT CTATCTGGTA AACTGCTTGG TGCTCACGTA
GCCCACGCAG GTTTAATTGT ATTTTGGGCC GGAGCAATGA CTTTGTTTGA AGTTGCTCAC
TTTGTCCCAG AAAAGCCAAT GTATGAACAA GGCTTAATTC TAATGCCCCA CGTTGCCACT
ATAGGCTGGG GTGTTGGTCC TGGTGGTGAA ATAGTTGACA TTTTTCCATT CTTTGTAGTA
GGCGTTTTAC ACCTAATTTC ATCTGCTGTT CTGGGTTTGG GCGGAATTTA TCATGCCGTT
CGTGGCCCAG AAACTTTGGA AGACTATTCT TCTTTCTTTG GATATGACTG GAAAGATAAG
AACCAGATGA CTAATATTAT TGGCTACCAC TTAATTATAT TAGGTTTGGG CGCATTCTTA
TTAGTAATCA AAGCTGTGTT CTTGGGTGGT GTCTATGATA CTTGGGCACC AGGTGGCGGT
GATGTACGAG TAATTACTAA CCCTACTTTG AATCCTGCTG TTATATTTGG TTATCTACTT
AAAGCGCCTT TTGGTGGTGA AGGCTGGATC ATTGGCGTTA ATAACATGGA AGATATTATC
GGTGGTCATA TCTGGATTGG CCTAATTTGT ATCTCCGGTG GTATTTGGCA TATTCTAACT
AAGCCTTTTG GTTGGGCACG TCGCGCTTTC ATCTGGTCTG GAGAAGCTTA TCTATCCTAC
AGTTTGGGCG CCCTGTCTTT GATGGGTTTA ATTGCTGCCG CTTTCGTATG GTTTAACAAC
ACTGCTTATC CTAGCGAATT CTATGGTCCT ACTAATGCTG AAGCTTCTCA AGCTCAGTCT
TTTGTGTTCT TAGTCCGTGA CCAAAAATTA GGTGCTAATA TTGGTTCTGC TCAAGGTCCT
ACTGGTCTTG GTAAGTACCT AATGCGCTCT CCCACTGGTG AGATCATATT CGGTGGTGAA
ACAATGCGTT TTTGGGACTT TCGTGGTCCT TGGTTAGAGC CTCTTCGTGG TCCTAACGGT
TTAGACTTGA GTAAACTGAA GAACGATATT CAGCCTTGGC AAGTTCGTCG TGCTGCTGAG
TACATGACTC ATGCTCCTAA TGGTTCTATC AACTCTGTAG GTGGTATTAT TACAGATATT
AACGGTTTCA ATTATGTAAA CCCTCGTGCT TGGTTAGCTG CCGCTCACTT TATTCTTGGT
TTCTTCTTCT TAATTGGTCA CTTGTGGCAT GCTGGTCGCG CTCGTGCTGC TGAGGGTGGT
TTTGAGAAGG GTCTGGACCG TCAAACTGAG CCAGTACTAT CTATGCCTAA CCTTGACTAA
 
Protein sequence
MVTLSNSIIG GRDQQSTGFA WWSGNARLIN LSGKLLGAHV AHAGLIVFWA GAMTLFEVAH 
FVPEKPMYEQ GLILMPHVAT IGWGVGPGGE IVDIFPFFVV GVLHLISSAV LGLGGIYHAV
RGPETLEDYS SFFGYDWKDK NQMTNIIGYH LIILGLGAFL LVIKAVFLGG VYDTWAPGGG
DVRVITNPTL NPAVIFGYLL KAPFGGEGWI IGVNNMEDII GGHIWIGLIC ISGGIWHILT
KPFGWARRAF IWSGEAYLSY SLGALSLMGL IAAAFVWFNN TAYPSEFYGP TNAEASQAQS
FVFLVRDQKL GANIGSAQGP TGLGKYLMRS PTGEIIFGGE TMRFWDFRGP WLEPLRGPNG
LDLSKLKNDI QPWQVRRAAE YMTHAPNGSI NSVGGIITDI NGFNYVNPRA WLAAAHFILG
FFFLIGHLWH AGRARAAEGG FEKGLDRQTE PVLSMPNLD