Gene Tery_1214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1214 
Symbol 
ID4241787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp1886638 
End bp1888128 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content33% 
IMG OID638106430 
ProductRNA polymerase, sigma 28 subunit 
Protein accessionYP_721042 
Protein GI113474981 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.672098 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATGA TAGGGCATAC ATTAGAACTA GACGATCTAA AAAAAATTAG TAGCCCGAAA 
AAGATAGTAG CTTTGTTTGA AAAACTTGGC TTCAATTTCC AAATTGAACA TTTAGATATT
GAACAACTAG AACTACCAAG TTATTGCAGA GAAACGATTG ATAATACTTA TAAAATTTAT
AATCCGAACC AAGAGGAATT ACAAGTATTT CTTTTAGAAT TAAGACAAGA AAAATTTAAA
ACAAATTATA AAGAGCAAGA AAGAATAATA ACTATTTCTC AGGGAATCTT GAGAAACAAA
AATAATTTCT TGCTAATTGC CACAAAAAAA TATAATAAGT TAATAATAGT TGCTCCTCAT
AGAAGAGAAA ATCAAAATAT TAAAATTTCA ATTAAATGGT GGTTAATTGA TTGTCAAAAT
CCTAATTATT ATGATACTTA CTGGCTAGAA AATGTAGCTA TAAATAATTG TGAACCTACA
GAAGAATTGA CAAATAAACA TAATAGTGAA GGTAGCTACG ACCCAGACAT AGAACCGCCT
ACTGGAACAT TTACTGAAGA CTCAATTCAA CTTTATTATA AGCAAGTTAA TAGCTTCGCC
CGCTTGCACC CTGAGGAAGA AATTCATTTA GCACATCAAA TTGATGGGCT ATTAGCTTTA
GAGAAAATTT ATGAAAAATT AGAGGAAAAG CTCAACCGTT CGCCTAACGA AATAGAATGG
GCAACAGAAG CCAAAATGAG CAAATGGGAA CTGTGTGATC GCCTCGAGCA AGGTAGACGA
GCTAAAAATC AAATGGTGAC AGCTAACTTG CGACTGGTGA CATCAGTTGC CAAACGATAC
CAAAACCGTG GTCTAGACTT CCAAGATTTA ATTCAAGAGG GAGTTATAGG ATTACTTCGT
GCCACAGAAA AATTTGATCA TACAAAAGGT TATAAGTTTT CTACTTATGG TATTTGGTGG
ATTCGACAAG CTATTACCAG AGCTATTTGT GACTATTCTC GGATCGTTCG GTTGCCAGTT
TATCTCTACG AAACTATTTC CCAAATTAAG AAAATAGTTA AGCAAATTTC AATAGAATGT
TCTCCCCTAA CTGCAGAAGA AGTTGCTACA CGCATGGAAA TGACAACGGA GAAGTTGCAA
TTTATTATAG AATGCGCTCA AGAAACTTTA TCTCTAGACT ACCCTATAAA TAAAGGGGAA
AACATTATCT ACAAGGAAAT GATTAAATTT AATGGGGAAA CACCAACAAA ATATGTCTTT
GAAAATTGCT TGCAAGAAGA TGTAGAAAGT GTTTTGAAAA CTCTAACTGA ACGAGAAAGA
AATGTGTTAC AAATGCGATT CGGTTTGGAT GATGGTCAAG AAAAGACCCT CAAAGAAATT
GGTGATACTT TTAATTTAAC GCGGGAAAGA ATTAGGCAAA TAGAGGCTAA AGCATTGAGA
AAGTTAGAAG ATCCAAAGCG AAACCATATT CTCAGGGAAT ATATTCATTA G
 
Protein sequence
MIMIGHTLEL DDLKKISSPK KIVALFEKLG FNFQIEHLDI EQLELPSYCR ETIDNTYKIY 
NPNQEELQVF LLELRQEKFK TNYKEQERII TISQGILRNK NNFLLIATKK YNKLIIVAPH
RRENQNIKIS IKWWLIDCQN PNYYDTYWLE NVAINNCEPT EELTNKHNSE GSYDPDIEPP
TGTFTEDSIQ LYYKQVNSFA RLHPEEEIHL AHQIDGLLAL EKIYEKLEEK LNRSPNEIEW
ATEAKMSKWE LCDRLEQGRR AKNQMVTANL RLVTSVAKRY QNRGLDFQDL IQEGVIGLLR
ATEKFDHTKG YKFSTYGIWW IRQAITRAIC DYSRIVRLPV YLYETISQIK KIVKQISIEC
SPLTAEEVAT RMEMTTEKLQ FIIECAQETL SLDYPINKGE NIIYKEMIKF NGETPTKYVF
ENCLQEDVES VLKTLTERER NVLQMRFGLD DGQEKTLKEI GDTFNLTRER IRQIEAKALR
KLEDPKRNHI LREYIH