Gene Elen_1311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagElen_1311 
Symbol 
ID8415606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEggerthella lenta DSM 2243 
KingdomBacteria 
Replicon accessionNC_013204 
Strand
Start bp1575542 
End bp1576771 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID645024277 
ProductRNA polymerase, sigma 70 subunit, RpoD subfamily 
Protein accessionYP_003181669 
Protein GI257791063 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.046516 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCCAAG CCTCAAGCAA GCAAGCCCAA GAGCGGAACA ACGCGGCGGT TGCCTCGATC 
GAAGCCGAAG ACATCCTGGA GGAAGACGCC CTCGACGACG AGCCCGATGT GGTCGACGCC
GGCGACGGAC TCGACGACGA TAAGCTTGAA AGCCCCCTGT CCGATGACAG CGACGACGAA
GACCTGCTCG AAGGCATTCC TGAAGAGGAG CTTAAGGCGA CGGTCGAGGT TCAGCTGCCC
AAGGTGGCGG GCAAGAGCAA GGTGCGCTCC GTGCGCAAGC GCAATGCCGA CGCCAGCGTG
ACCATGCTCA CGGGCGACCC CGTCCGCATG TACCTCAAGG AGATCGGCAA GGTCCCGCTG
CTCACGGCCG CCGAAGAGAT CGACCTCGCC ATGAAGATCG AGGCCGGCGT GGCCGCCATG
GAGGAGCTTG AGAAGGCCGA GGACGAGGGC ATCGAGCTCG AACGCCGCGA GAAGCGCCGC
CTCGGCCGCA TCGAGCAGGT GGGCATCGAC GCGAAGCAGC AGCTTATCGA GGCGAACCTG
CGTCTCGTCG TGTCCATCGC CAAGCGCTAC GTAGGACGCG GCATGCTGTT CCTCGACCTT
ATCCAGGAGG GCAACCTCGG CCTCATCCGC GCCGTCGAGA AGTTCGACTA CACGAAGGGC
TTCAAGTTCT CGACGTACGC CACCTGGTGG ATCCGCCAGG CCATCACGCG CGCCATCGCC
GATCAGGCCC GCACCATCCG CATTCCCGTG CACATGGTGG AGACCATCAA CAAGCTCGTG
CGCATCCAGC GCCAGCTGTT GCAGGAGCTC GGCCGCGAGC CCAGCCCCGA GGAGATCGGC
AAGGAGATGG GTCTGCCCGC CGAGCGCGTG CGCGAGATCC AGAAGATCTC GCAGGAGCCC
GTGTCGCTGG AAACGCCTAT CGGCGAGGAG GAGGACTCCC AGCTGGGCGA CTTCATCGAG
GACGACGCCG CCGTGGTGCC GCCTGACGCC GCCTCGTTCA GCATGCTGCA AGAGCAGCTG
TCGAAGGTGC TCGACGGCCT GGCCGAACGC GAGCGCAAGG TGATCAGCCT GCGCTTCGGC
CTGGAGGACG GCCATCCCCG CACGCTCGAG GAGGTCGGAC GCGAGTTCGG CGTCACGCGC
GAGCGCATCC GCCAGATCGA GAGCAAGACG CTGGCGAAGC TGCGCCACCC GTCCCGCTCG
AGCAAGCTGA AAGACTACCT GGAAGATTAA
 
Protein sequence
MAQASSKQAQ ERNNAAVASI EAEDILEEDA LDDEPDVVDA GDGLDDDKLE SPLSDDSDDE 
DLLEGIPEEE LKATVEVQLP KVAGKSKVRS VRKRNADASV TMLTGDPVRM YLKEIGKVPL
LTAAEEIDLA MKIEAGVAAM EELEKAEDEG IELERREKRR LGRIEQVGID AKQQLIEANL
RLVVSIAKRY VGRGMLFLDL IQEGNLGLIR AVEKFDYTKG FKFSTYATWW IRQAITRAIA
DQARTIRIPV HMVETINKLV RIQRQLLQEL GREPSPEEIG KEMGLPAERV REIQKISQEP
VSLETPIGEE EDSQLGDFIE DDAAVVPPDA ASFSMLQEQL SKVLDGLAER ERKVISLRFG
LEDGHPRTLE EVGREFGVTR ERIRQIESKT LAKLRHPSRS SKLKDYLED