Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Elen_2804 |
Symbol | |
ID | 8417130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Eggerthella lenta DSM 2243 |
Kingdom | Bacteria |
Replicon accession | NC_013204 |
Strand | - |
Start bp | 3253606 |
End bp | 3254556 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 645025779 |
Product | DNA-directed RNA polymerase, alpha subunit |
Protein accession | YP_003183140 |
Protein GI | 257792534 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000656675 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000144009 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGAGT TCATGAGGCC TACGGTAACA ACGGAAGAAG TCAACGACAC TGTCGCACGT TTCATAGTCG AGCCGCTCGA GCGTGGCTAC GGCTACACGC TGGGCAACTG CATGCGCCGC GTTCTGCTCT CCTCGCTGGA TGGTGCGAAA GCCACCGCCA TCCAGATCGA GGGTGTGCAG CATGAGTTCA CGACGGCCGA AGGCGTCATC GAGGATATCA CCGATATCGT CCTGAATGTC AAGGGTCTTG TGTTCTCCGC ACTGAACGAT GATATCGAGG AAGCCACGGC GCATGTGTCG GCGGAGGGTC CTTGCACGGT GACGGGTGCC GATCTCGACA TTCCCACCGA GTTCACGCTG GTCAACCCGG AGCACGTCAT CGCTACGGTC GCCGACGGCG GTCAGCTGGA CATGACGGTT CGCATCGGCG TGGGCCGCGG TTACGTGTCG GCCGAGCGCA ACAAGCGCAC GGAGGATCCC ATCGGGGTCA TCCATGTGGA CTCGCTGTTC TCGCCGGTTC GCCGCTGCAC GCTCAACGTC ACCGACACCC GCGTGGGTCA GCGCACCGAC TACGACAAGC TCGTGCTGGA AGTTGAGACT GACGGCAGCA TCACGCCCAC CGAGGCCGTG TGCCGCGCGT CCAACATCAT CAACCAGTAC ATGGGCGCGT TTTTGAGCCT GTCCGACGTT GTCGACGAGG AGGAGGGCGA AATCCCGTCC ATCTTCGCGC CGGAGGGCCA GGAGTCCAAC GCCGAGCTGG ACAAGCAGAT CGAGGATCTC GACCTGTCCG TCCGCTCGTA CAACTGCCTG AAGCGTGCCG GAATCCACTC GGTGCGCCAG CTCGTTGAGT TCTCCGAAAA CGACCTGCTG AACATCAGAA ACTTTGGTGC GAAGTCCATT GAAGAAGTGA AGGACAAGCT CATTTCCATG GACCTCAATT TGAAGCTATA G
|
Protein sequence | MTEFMRPTVT TEEVNDTVAR FIVEPLERGY GYTLGNCMRR VLLSSLDGAK ATAIQIEGVQ HEFTTAEGVI EDITDIVLNV KGLVFSALND DIEEATAHVS AEGPCTVTGA DLDIPTEFTL VNPEHVIATV ADGGQLDMTV RIGVGRGYVS AERNKRTEDP IGVIHVDSLF SPVRRCTLNV TDTRVGQRTD YDKLVLEVET DGSITPTEAV CRASNIINQY MGAFLSLSDV VDEEEGEIPS IFAPEGQESN AELDKQIEDL DLSVRSYNCL KRAGIHSVRQ LVEFSENDLL NIRNFGAKSI EEVKDKLISM DLNLKL
|
| |