Gene Tery_0744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0744 
Symbol 
ID4242476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1203288 
End bp1204619 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content34% 
IMG OID638106034 
Productsun protein 
Protein accessionYP_720647 
Protein GI113474586 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases
[COG0781] Transcription termination factor 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase
[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.323768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAATA ATCCTCGTCA ACTAGCTTTT ATTATCCTCC AAGAAATATA TCGAAAACAA 
GTTTTTACTG ATGTTGCTCT AGATAGACAT CTGAAAAAAA ATGACTTAAT AGATGCTAAC
CGCAGATTAG TTACAGAATT AGTTTATGGT TGTGTGAGAA GGCAGCGATC GCTTGATGCT
ATTATCGACC AATTAGCAAA AAAGAAATCT CCCCAACAAC ACCCATATTT ACGGATAATT
CTCCATATTG GTTTATATCA ATTATCTTAT TTAGAACAAA TTCCAGAATC AGCAGCAGTT
GATACAACAG TTGAGCTAGC TAAACAAAAT AAATTTGCTA AATTAGCTGG TTTTGTTAAT
GGTTTACTCC GAGAATATAT TCGGCAGAAC TTAACTATAA ATCTCCCAGA AAATCCTGTT
CAAAAATTAG GAATATCTTA TAGTTTTCCT AACTGGATAG TTAAATATTG GATAGAAGAA
TTAGGTTTAA CTGAAGCTGA AAAATTATGC TATTGGTTCA ATCTATCCCC TAGTATTGAT
TTAAGAATTA ATCCACTCAA AACTTCTGTT GAAGAAGTAG AAATAGCTAT GAAAAATATA
GGTATTTCTG TTAGTAGAAT TTTGCAAGTT CCCCAAGCTT TAAGATTAAA TGGGGCAGTG
GGGCAAATTC AAAAATTACC TGGTTATAAT GAAGGTTGGT GGTCAATTCA AGATAGTAGC
GCTCAGTTAG TTTGTTATTT ATTAAATCCT CAACCAGGAG AAATAATAAT TGATGCTTGT
GCTGCACCTG GAGGTAAGAC AACTCATATA GGGGAATTAA TGGGAGATAA TGGTAAAATT
TTTGCTATTG ATATGACTGC TTCTAGGTTG AAAAAATTAG AATCAAATAC TGAAAGGCTA
CAGTTAAAAT CTATCTCTAT TTCTAGAGGT GATAGTCGAA ATTTAACTGA GTTTATTAAT
CAAGCTGACC GGGTTTTATT AGATGTACCT TGTTCTGGTT TAGGTACTTT ACATCGTAGG
GCAGATGCAC GGTGGAGAAA AACTTTAGAG AATATTGGAG AATTGGCTAA ACTTCAGGGT
GAGTTGCTAG AAAATGCTGC TAAATGGGTG AAGCCTGGGG GTGTCTTAGT ATATGCTACT
TGCACAATTT ATCCCTTAGA AAATGAGGGA GTTATTGAGA AATTTTTAAC TAATAATTAT
GAGTGGGAAA TTGAAGCACC AACTGTAGAT TTTATGGTTT CACCTTGTAG GGAAGGATGG
ATAAAAATTT GGCCTCATAG AGAACAAATG GATGGATTTT TTATGGTTAA ATTAAGACGC
AAGGTTATTT AG
 
Protein sequence
MNNNPRQLAF IILQEIYRKQ VFTDVALDRH LKKNDLIDAN RRLVTELVYG CVRRQRSLDA 
IIDQLAKKKS PQQHPYLRII LHIGLYQLSY LEQIPESAAV DTTVELAKQN KFAKLAGFVN
GLLREYIRQN LTINLPENPV QKLGISYSFP NWIVKYWIEE LGLTEAEKLC YWFNLSPSID
LRINPLKTSV EEVEIAMKNI GISVSRILQV PQALRLNGAV GQIQKLPGYN EGWWSIQDSS
AQLVCYLLNP QPGEIIIDAC AAPGGKTTHI GELMGDNGKI FAIDMTASRL KKLESNTERL
QLKSISISRG DSRNLTEFIN QADRVLLDVP CSGLGTLHRR ADARWRKTLE NIGELAKLQG
ELLENAAKWV KPGGVLVYAT CTIYPLENEG VIEKFLTNNY EWEIEAPTVD FMVSPCREGW
IKIWPHREQM DGFFMVKLRR KVI