Gene Tery_5068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5068 
Symbol 
ID4246723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7732475 
End bp7733635 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content39% 
IMG OID638109869 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_724445 
Protein GI113478384 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family
[TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.651182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.814117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCAGA CTGACAAAGT AATTGAAACC ACTATTCAGC CTCAGCTAGA ATCTAGTGAG 
TTATTTCAGC CAACCCCTAC CAGACAAGTA AACGATGAGT TAGAGATTTT AATTGGAGAT
AGAGAAGAAT ATATAGATGC TCAGTCTGAT GAGGACGATC TAAAGTCTGG TAAAGTTGCT
AAATCTCGTA CTCGGACTGC GGGCAAAAAA AAGCATTATA CAGAAGACTC AATTCGCCTT
TATCTACAAG AAATAGGAAG AATCAGACTA TTACGAGCTG ATGAAGAAAT TGAATTAGCC
CGTAAGATTG CTGACTTACT AGAATTGGAA CGAATTCGAG AAGAGTTAAT TTATCACTTA
GATCGAGAAC CCCAAGTGAG TGAGTGGGCA AATGCAGTAG ATATGGAATT GCCAAAGTTT
AAGCGTCGCT TAATACTTGG GCGTAGAGCT AAAGAAAAGA TGGTACAGTC TAACCTGCGT
TTGGTGGTTT CGATCGCCAA AAAGTACATG AACCGAGGTT TATCATTCCA GGACTTGATT
CAAGAAGGTA GTTTGGGGTT AATTCGAGCA GCAGAAAAAT TTGATCATGA AAAGGGGTAT
AAATTTAGTA CTTATGCAAC TTGGTGGATT CGTCAAGCTA TTACTAGAGC TATAGCTGAT
CAGTCTCGTA CTATCCGTCT ACCAGTTCAT CTATACGAAA CAATATCCCG AATCAAGAAA
ACTACTAAGC TTCTTTCCCA AGAAATGGGT CGTAAACCCA CAGAAGAGGA AATAGCAACT
AGCATGGAAA TGACTATCGA AAAGTTGCGT TTCATTGCTA AATCTGCTCA ACTTCCCATA
TCTTTAGAAA CTCCCATTGG AAAAGAAGAA GACTCTCGAC TTGGAGATTT TATTGAGTCA
GATGGGGAGA CTCCTGAAGA TGAAGTATCC AAAAATCTAT TGCGAGAAGA TTTAGAAAGT
GTTTTAAATA GTTTGAGTCC CCGTGAACGG GATGTATTAC GGTTAAGGTA TGGCTTGGAT
GACGGTCGAA TGAAGACTTT AGAAGAAATT GGGCAAATAT TTAATGTGAC TCGTGAGCGA
ATTCGACAAA TTGAGGCAAA AGCCCTTAGA AAGTTACGAC ATCCAAACCG AAACAGTATT
CTCAAAGAAT ATATCCGCTA G
 
Protein sequence
MIQTDKVIET TIQPQLESSE LFQPTPTRQV NDELEILIGD REEYIDAQSD EDDLKSGKVA 
KSRTRTAGKK KHYTEDSIRL YLQEIGRIRL LRADEEIELA RKIADLLELE RIREELIYHL
DREPQVSEWA NAVDMELPKF KRRLILGRRA KEKMVQSNLR LVVSIAKKYM NRGLSFQDLI
QEGSLGLIRA AEKFDHEKGY KFSTYATWWI RQAITRAIAD QSRTIRLPVH LYETISRIKK
TTKLLSQEMG RKPTEEEIAT SMEMTIEKLR FIAKSAQLPI SLETPIGKEE DSRLGDFIES
DGETPEDEVS KNLLREDLES VLNSLSPRER DVLRLRYGLD DGRMKTLEEI GQIFNVTRER
IRQIEAKALR KLRHPNRNSI LKEYIR