Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_5068 |
Symbol | |
ID | 4246723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7732475 |
End bp | 7733635 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638109869 |
Product | RNA polymerase sigma factor RpoD |
Protein accession | YP_724445 |
Protein GI | 113478384 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.651182 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.814117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCAGA CTGACAAAGT AATTGAAACC ACTATTCAGC CTCAGCTAGA ATCTAGTGAG TTATTTCAGC CAACCCCTAC CAGACAAGTA AACGATGAGT TAGAGATTTT AATTGGAGAT AGAGAAGAAT ATATAGATGC TCAGTCTGAT GAGGACGATC TAAAGTCTGG TAAAGTTGCT AAATCTCGTA CTCGGACTGC GGGCAAAAAA AAGCATTATA CAGAAGACTC AATTCGCCTT TATCTACAAG AAATAGGAAG AATCAGACTA TTACGAGCTG ATGAAGAAAT TGAATTAGCC CGTAAGATTG CTGACTTACT AGAATTGGAA CGAATTCGAG AAGAGTTAAT TTATCACTTA GATCGAGAAC CCCAAGTGAG TGAGTGGGCA AATGCAGTAG ATATGGAATT GCCAAAGTTT AAGCGTCGCT TAATACTTGG GCGTAGAGCT AAAGAAAAGA TGGTACAGTC TAACCTGCGT TTGGTGGTTT CGATCGCCAA AAAGTACATG AACCGAGGTT TATCATTCCA GGACTTGATT CAAGAAGGTA GTTTGGGGTT AATTCGAGCA GCAGAAAAAT TTGATCATGA AAAGGGGTAT AAATTTAGTA CTTATGCAAC TTGGTGGATT CGTCAAGCTA TTACTAGAGC TATAGCTGAT CAGTCTCGTA CTATCCGTCT ACCAGTTCAT CTATACGAAA CAATATCCCG AATCAAGAAA ACTACTAAGC TTCTTTCCCA AGAAATGGGT CGTAAACCCA CAGAAGAGGA AATAGCAACT AGCATGGAAA TGACTATCGA AAAGTTGCGT TTCATTGCTA AATCTGCTCA ACTTCCCATA TCTTTAGAAA CTCCCATTGG AAAAGAAGAA GACTCTCGAC TTGGAGATTT TATTGAGTCA GATGGGGAGA CTCCTGAAGA TGAAGTATCC AAAAATCTAT TGCGAGAAGA TTTAGAAAGT GTTTTAAATA GTTTGAGTCC CCGTGAACGG GATGTATTAC GGTTAAGGTA TGGCTTGGAT GACGGTCGAA TGAAGACTTT AGAAGAAATT GGGCAAATAT TTAATGTGAC TCGTGAGCGA ATTCGACAAA TTGAGGCAAA AGCCCTTAGA AAGTTACGAC ATCCAAACCG AAACAGTATT CTCAAAGAAT ATATCCGCTA G
|
Protein sequence | MIQTDKVIET TIQPQLESSE LFQPTPTRQV NDELEILIGD REEYIDAQSD EDDLKSGKVA KSRTRTAGKK KHYTEDSIRL YLQEIGRIRL LRADEEIELA RKIADLLELE RIREELIYHL DREPQVSEWA NAVDMELPKF KRRLILGRRA KEKMVQSNLR LVVSIAKKYM NRGLSFQDLI QEGSLGLIRA AEKFDHEKGY KFSTYATWWI RQAITRAIAD QSRTIRLPVH LYETISRIKK TTKLLSQEMG RKPTEEEIAT SMEMTIEKLR FIAKSAQLPI SLETPIGKEE DSRLGDFIES DGETPEDEVS KNLLREDLES VLNSLSPRER DVLRLRYGLD DGRMKTLEEI GQIFNVTRER IRQIEAKALR KLRHPNRNSI LKEYIR
|
| |