Gene Elen_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_2804 
Symbol 
ID8417130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp3253606 
End bp3254556 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content60% 
IMG OID645025779 
ProductDNA-directed RNA polymerase, alpha subunit 
Protein accessionYP_003183140 
Protein GI257792534 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000656675 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000144009 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGAGT TCATGAGGCC TACGGTAACA ACGGAAGAAG TCAACGACAC TGTCGCACGT 
TTCATAGTCG AGCCGCTCGA GCGTGGCTAC GGCTACACGC TGGGCAACTG CATGCGCCGC
GTTCTGCTCT CCTCGCTGGA TGGTGCGAAA GCCACCGCCA TCCAGATCGA GGGTGTGCAG
CATGAGTTCA CGACGGCCGA AGGCGTCATC GAGGATATCA CCGATATCGT CCTGAATGTC
AAGGGTCTTG TGTTCTCCGC ACTGAACGAT GATATCGAGG AAGCCACGGC GCATGTGTCG
GCGGAGGGTC CTTGCACGGT GACGGGTGCC GATCTCGACA TTCCCACCGA GTTCACGCTG
GTCAACCCGG AGCACGTCAT CGCTACGGTC GCCGACGGCG GTCAGCTGGA CATGACGGTT
CGCATCGGCG TGGGCCGCGG TTACGTGTCG GCCGAGCGCA ACAAGCGCAC GGAGGATCCC
ATCGGGGTCA TCCATGTGGA CTCGCTGTTC TCGCCGGTTC GCCGCTGCAC GCTCAACGTC
ACCGACACCC GCGTGGGTCA GCGCACCGAC TACGACAAGC TCGTGCTGGA AGTTGAGACT
GACGGCAGCA TCACGCCCAC CGAGGCCGTG TGCCGCGCGT CCAACATCAT CAACCAGTAC
ATGGGCGCGT TTTTGAGCCT GTCCGACGTT GTCGACGAGG AGGAGGGCGA AATCCCGTCC
ATCTTCGCGC CGGAGGGCCA GGAGTCCAAC GCCGAGCTGG ACAAGCAGAT CGAGGATCTC
GACCTGTCCG TCCGCTCGTA CAACTGCCTG AAGCGTGCCG GAATCCACTC GGTGCGCCAG
CTCGTTGAGT TCTCCGAAAA CGACCTGCTG AACATCAGAA ACTTTGGTGC GAAGTCCATT
GAAGAAGTGA AGGACAAGCT CATTTCCATG GACCTCAATT TGAAGCTATA G
 
Protein sequence
MTEFMRPTVT TEEVNDTVAR FIVEPLERGY GYTLGNCMRR VLLSSLDGAK ATAIQIEGVQ 
HEFTTAEGVI EDITDIVLNV KGLVFSALND DIEEATAHVS AEGPCTVTGA DLDIPTEFTL
VNPEHVIATV ADGGQLDMTV RIGVGRGYVS AERNKRTEDP IGVIHVDSLF SPVRRCTLNV
TDTRVGQRTD YDKLVLEVET DGSITPTEAV CRASNIINQY MGAFLSLSDV VDEEEGEIPS
IFAPEGQESN AELDKQIEDL DLSVRSYNCL KRAGIHSVRQ LVEFSENDLL NIRNFGAKSI
EEVKDKLISM DLNLKL