Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2074 |
Symbol | |
ID | 8416392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | + |
Start bp | 2442049 |
End bp | 2442924 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 645025057 |
Product | RNA polymerase, sigma-24 subunit, ECF subfamily |
Protein accession | YP_003182426 |
Protein GI | 257791820 |
COG category | [K] Transcription |
COG ID | [COG1595] DNA-directed RNA polymerase specialized sigma subunit, sigma24 homolog |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.558135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000114613 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATCGCG AAGAATTATC GAAAGAAGCG CAGCTCGACC TGGTGGACAA GGCCATCGTA GGCGACGGCC AGGCGTTGGA GGAACTGCTG CGGTCCACGA GCGGGCTCGT GTTCAACCTT GCGCTCAGGT TCCTGGGCAC CGTCCACGAC GCCGAGGACG CGAGCCAGGA GATCGCGGTC AAGATCATGA CCCGGCTCTC GACGTTTCGC AAGGAGAGCG CGTTCTCCAC ATGGGTGTAC CGCATCGCCG TGAACCACCT CAAGGACTGC CGCACGCATC AGTTCGCGAA CGCCCCGTTC AGCTTCGAGA TGTACGGGGC CGACATCGTC GACGAGCGCG CGAAGGACGT CCCCGACCTG TCCGAAGGGG TCGACCGCGG CATGCTGGCG CGTGAGCTGA AGCTGTCGTG CACGAACGTG ATGCTGCAAT GCCTCGACGC CGACAGCCGC TGCGCCTACG TGCTGGGCAC CATGTTCAAG GTGGACAGCT CCACCGCGGC CGACGTGCTG GGCATCACGC CCGAGGCGTA CCGCCAGCGC CTGTCGCGCG CCCGCAAGAC GGTCGCCGAG TTTTTGGGCG CGTACTGCCA GCACGGCGGC GCGGAGACGT GCTCGTGCGA GCGGCGCGTG AACTTCGCCA TCGCCACGCA TCGGCTGGCC CCGCACAACC TGGAGTACCT CGAGCTCACC GAGGAGGAGC GCGCGCAGGA CGCCTTCGTC GACGCCATGG ACGAGATGGA CGGCTACGCC AGCCTGTTCG ACCGGCTTCC CTCCTACCGC CCCACGCCGC GCGCCAAGGA GCTGCTGGAA GCGTGCATGG GCACCGCCAG CTTCGACGCC GTGGTGAACG CCGGCAAGGA GGCCTCCCAT GCCTAA
|
Protein sequence | MHREELSKEA QLDLVDKAIV GDGQALEELL RSTSGLVFNL ALRFLGTVHD AEDASQEIAV KIMTRLSTFR KESAFSTWVY RIAVNHLKDC RTHQFANAPF SFEMYGADIV DERAKDVPDL SEGVDRGMLA RELKLSCTNV MLQCLDADSR CAYVLGTMFK VDSSTAADVL GITPEAYRQR LSRARKTVAE FLGAYCQHGG AETCSCERRV NFAIATHRLA PHNLEYLELT EEERAQDAFV DAMDEMDGYA SLFDRLPSYR PTPRAKELLE ACMGTASFDA VVNAGKEASH A
|
| |