Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0744 |
Symbol | |
ID | 4242476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1203288 |
End bp | 1204619 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638106034 |
Product | sun protein |
Protein accession | YP_720647 |
Protein GI | 113474586 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription |
COG ID | [COG0144] tRNA and rRNA cytosine-C5-methylases [COG0781] Transcription termination factor |
TIGRFAM ID | [TIGR00446] NOL1/NOP2/sun family putative RNA methylase [TIGR00563] ribosomal RNA small subunit methyltransferase RsmB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.323768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAATA ATCCTCGTCA ACTAGCTTTT ATTATCCTCC AAGAAATATA TCGAAAACAA GTTTTTACTG ATGTTGCTCT AGATAGACAT CTGAAAAAAA ATGACTTAAT AGATGCTAAC CGCAGATTAG TTACAGAATT AGTTTATGGT TGTGTGAGAA GGCAGCGATC GCTTGATGCT ATTATCGACC AATTAGCAAA AAAGAAATCT CCCCAACAAC ACCCATATTT ACGGATAATT CTCCATATTG GTTTATATCA ATTATCTTAT TTAGAACAAA TTCCAGAATC AGCAGCAGTT GATACAACAG TTGAGCTAGC TAAACAAAAT AAATTTGCTA AATTAGCTGG TTTTGTTAAT GGTTTACTCC GAGAATATAT TCGGCAGAAC TTAACTATAA ATCTCCCAGA AAATCCTGTT CAAAAATTAG GAATATCTTA TAGTTTTCCT AACTGGATAG TTAAATATTG GATAGAAGAA TTAGGTTTAA CTGAAGCTGA AAAATTATGC TATTGGTTCA ATCTATCCCC TAGTATTGAT TTAAGAATTA ATCCACTCAA AACTTCTGTT GAAGAAGTAG AAATAGCTAT GAAAAATATA GGTATTTCTG TTAGTAGAAT TTTGCAAGTT CCCCAAGCTT TAAGATTAAA TGGGGCAGTG GGGCAAATTC AAAAATTACC TGGTTATAAT GAAGGTTGGT GGTCAATTCA AGATAGTAGC GCTCAGTTAG TTTGTTATTT ATTAAATCCT CAACCAGGAG AAATAATAAT TGATGCTTGT GCTGCACCTG GAGGTAAGAC AACTCATATA GGGGAATTAA TGGGAGATAA TGGTAAAATT TTTGCTATTG ATATGACTGC TTCTAGGTTG AAAAAATTAG AATCAAATAC TGAAAGGCTA CAGTTAAAAT CTATCTCTAT TTCTAGAGGT GATAGTCGAA ATTTAACTGA GTTTATTAAT CAAGCTGACC GGGTTTTATT AGATGTACCT TGTTCTGGTT TAGGTACTTT ACATCGTAGG GCAGATGCAC GGTGGAGAAA AACTTTAGAG AATATTGGAG AATTGGCTAA ACTTCAGGGT GAGTTGCTAG AAAATGCTGC TAAATGGGTG AAGCCTGGGG GTGTCTTAGT ATATGCTACT TGCACAATTT ATCCCTTAGA AAATGAGGGA GTTATTGAGA AATTTTTAAC TAATAATTAT GAGTGGGAAA TTGAAGCACC AACTGTAGAT TTTATGGTTT CACCTTGTAG GGAAGGATGG ATAAAAATTT GGCCTCATAG AGAACAAATG GATGGATTTT TTATGGTTAA ATTAAGACGC AAGGTTATTT AG
|
Protein sequence | MNNNPRQLAF IILQEIYRKQ VFTDVALDRH LKKNDLIDAN RRLVTELVYG CVRRQRSLDA IIDQLAKKKS PQQHPYLRII LHIGLYQLSY LEQIPESAAV DTTVELAKQN KFAKLAGFVN GLLREYIRQN LTINLPENPV QKLGISYSFP NWIVKYWIEE LGLTEAEKLC YWFNLSPSID LRINPLKTSV EEVEIAMKNI GISVSRILQV PQALRLNGAV GQIQKLPGYN EGWWSIQDSS AQLVCYLLNP QPGEIIIDAC AAPGGKTTHI GELMGDNGKI FAIDMTASRL KKLESNTERL QLKSISISRG DSRNLTEFIN QADRVLLDVP CSGLGTLHRR ADARWRKTLE NIGELAKLQG ELLENAAKWV KPGGVLVYAT CTIYPLENEG VIEKFLTNNY EWEIEAPTVD FMVSPCREGW IKIWPHREQM DGFFMVKLRR KVI
|
| |