Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0784 |
Symbol | |
ID | 4243195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 1251949 |
End bp | 1252959 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638106065 |
Product | RNA polymerase sigma factor |
Protein accession | YP_720677 |
Protein GI | 113474616 |
COG category | [K] Transcription |
COG ID | [COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family [TIGR02997] RNA polymerase sigma factor, cyanobacterial RpoD-like family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.532885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAGC TAAATCAAGA AAATAAAAAT CAAAAGAAAA CTCAAACTAA ATTTGGTCCT ACTACTTACT CTTCAATGGA CACAGTTCGT ACCTACTTAC ATGAGATTGG TCGTGTACCA CTTTTAACTC GTGAGGAAGA GATTGTTTAT GGTAAACAAG TGCAGCAAAT GATGCAATTC CTAGAATTAA AAGAAAAATT AGAGGAAAAA CTTAATAGAA CAGCAACTTT ACTAGAATGG GCAGAACATT TACTAGTAAG TGAAAAAGTA CTTAAGCAAG CAATTAAGCA AGGTCAACGT GCCAAGCAAA AAATGATTGA AGCTAATCTT CGCTTAGTAG TAGCGATCGC TAAAAAGTAT CAAAAGCGAA ATATGGAGTT CTTAGATTTA ATTCAAGAAG GTACTCTTGG TTTAGAAAGA GGGGTGGAAA AATTTGATCC TACACGGGGA TACAAATTTT CTACTTATGC TTACTGGTGG ATTCGTCAGG CAATTACTAG AGCGATCGCA CAACAAGCTC GTACTATTCG CCTACCTATT CATATTACTG AAAAGCTCAA CAAAATCAAA AGAGTTCAGA GAGAACTAAT TCAGAGTTTG GGTCGTAGTC CTACTCCTTC TGAGATTGCC CAAGCTTTAG AATTAGAACC TTCTCAGATT AGAGAATATC TTTTGATGGC CCGTCATCCT ATTTCCTTAG ATTTAAGAGT TGGAGATAAT CAGGATACTG AACTGCAAGA ACTTTTGGAG GATGAAACAT CTTCTCCGGA TGACTACATA ACTGGTGAGT TATTGCGTCA AGATATAAAT ACCTTACTGG CAGAATTGAG TGAGCAACAA CGTCGTGTAT TGGTATTGCG TTTTGGTCTT GAGGATGGTA AGGAGATGTC TTTAGCAAAA GTGGGAGATA AACTTCAACT TAGTCGTGAG CGAGTTCGCC AACTAGAACA TCAAGCTTTG GCCTTTTTAC GCCGTCGTCA GGCCAAGGTG CGGGAGTATG TAGCTAGCTA A
|
Protein sequence | MSKLNQENKN QKKTQTKFGP TTYSSMDTVR TYLHEIGRVP LLTREEEIVY GKQVQQMMQF LELKEKLEEK LNRTATLLEW AEHLLVSEKV LKQAIKQGQR AKQKMIEANL RLVVAIAKKY QKRNMEFLDL IQEGTLGLER GVEKFDPTRG YKFSTYAYWW IRQAITRAIA QQARTIRLPI HITEKLNKIK RVQRELIQSL GRSPTPSEIA QALELEPSQI REYLLMARHP ISLDLRVGDN QDTELQELLE DETSSPDDYI TGELLRQDIN TLLAELSEQQ RRVLVLRFGL EDGKEMSLAK VGDKLQLSRE RVRQLEHQAL AFLRRRQAKV REYVAS
|
| |