Gene Emin_0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0312 
Symbol 
ID6263690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp332638 
End bp334353 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content41% 
IMG OID642610777 
ProductRpoD family RNA polymerase sigma factor 
Protein accessionYP_001875209 
Protein GI187250727 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.593238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00000519497 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAAACG TTGCAAATAA AGCTTTTAAA GATTTAGTTG AAGTTGGTAA AGAAAACGGT 
TTTCTTACCT TGGACGAAAT TAACCGTTCG CTTACATCTT CCTCAATGAG CGCGGAAGAT
ATTGACGGTT TAATGGGCAC ATTGGAAGAT TTGGGCATAC AGGTAGTTGA CCGTAAAAAA
TATAAATCCG TAGCCATGGC GGAAAAAGAA GCCTATACCG AAGAGTTTAT GGCTAACCCC
GATATTTCTA ACTCTATCCG CATGTACTTG TCTGAAATGG GCAAAGTGCC GCTGCTTTCC
AGAGACGAGG AAATAACCTT GGCCCGTAAC GTCAGGGAAA GAGAAAAAGA ATTGCGCAAA
CTTGTGTTGG AATCCCCCAT CACAATGCGC GAAATTTGCA ACTGGGAAGA ACTTGTAGAC
CAAGAAGAAA TGACTACAAA AGAATTGATG CCCAGAGGCA AAAGAACAAC GCGCGAACTT
AACAACATGC GTAAAAAACT TAAAGAGGCC GCCAAGTTTA TCGGTAAACG GGAAGGTGAA
ATTACCGAAT TAATGAAAAA ATTGCGCGAG CCCAGCATTT CAGACAAAAT GCTTAACAAA
TATACCGAAG CTCTTGAGAA AAAGCAAAAG GAAGTTGTTG ACAGCATTAT AAAACTTAAT
TTAAACCAAA ATAAAATTAA GCGTTTAACA AACAGAATCA AAAGTTTGGC GCAGAGAATA
GCCGAGTCAA GAAACGACCT TAACAGGTTT GACGAATATT TTGGCCAGTA CGCCGAAGTA
AAAAAACTTT TTACCCAGGC TTCCAAAGGA AAAATTTCTA AAGCCGAACT TAAAAAGAAA
ACCCGCTTTA CTTATGAAGA ACTTGAAACC GCCATCAACA ATATCGATAT GATACGTTCA
AGACATGAGA AGCTTATAAA CACTCTTCCC ATGAGCGAAA AAGAGTTTTT GGCTTTTAAC
GACCGCATTG TGTTTTTTGA AGACATGATT TTGCAAGACA AACTTAAACT TATTAAAGCG
AACTTAAGGC TGGTTGTGTC TATCGCCAAA AAGCACGTAA ACTCAAATTT AGAACTTTCC
GACTTAATCC AGGAAGGCGG TTTGGGCCTT ATGAAAGCCG TGGAAAAGTT TGAATATAAA
AGAGGTTTTA AATTCTCAAC CTACGCAACA TGGTGGATAA GGCAGTCCAT TAACCGCGCC
ATCGCGGACC AGGCCAACAC CATACGCATA CCAGTGCATA TGAAGGAGCT TGTTTCCAAA
CTTACAAAAG TTACAAATAA GTTCAGACAG GAACACGGCA GGGAACCCAG CTTGGAAGAT
TATTCAAAAT CACTGCGCCT TTCTATGGAA AAAGTTAAAG GCGTGCTTAA AATAATGCAA
GACCCTATAT CTTTATCAAC ACCCGTAGGC GAGGATGAAG ATTCCAACCT GGAAGACTTT
ATTGAGGATA AAGCGGGCGC TAACCCCACG GTAACCGCGT CTGACTTTTT AAGAAAGCAA
GAAGTATCCG AAGTGCTTAA CACACTTTCA GAACGTGAGG CAAAAATTAT AAGGCTGCGC
TTTGGCATTG ACTCGGGCTA TCCCAGAACA TTAGAAGAAG TGGGTAAAAT GTTTAACGTA
ACACGTGAGC GCGTAAGACA AATTGAGGCA AAAGCCATAC GAAAACTGCG CCACCCGAGC
AGAACCAAAA TGCTTAAAGA TTATTCCGAC GAATAG
 
Protein sequence
MVNVANKAFK DLVEVGKENG FLTLDEINRS LTSSSMSAED IDGLMGTLED LGIQVVDRKK 
YKSVAMAEKE AYTEEFMANP DISNSIRMYL SEMGKVPLLS RDEEITLARN VREREKELRK
LVLESPITMR EICNWEELVD QEEMTTKELM PRGKRTTREL NNMRKKLKEA AKFIGKREGE
ITELMKKLRE PSISDKMLNK YTEALEKKQK EVVDSIIKLN LNQNKIKRLT NRIKSLAQRI
AESRNDLNRF DEYFGQYAEV KKLFTQASKG KISKAELKKK TRFTYEELET AINNIDMIRS
RHEKLINTLP MSEKEFLAFN DRIVFFEDMI LQDKLKLIKA NLRLVVSIAK KHVNSNLELS
DLIQEGGLGL MKAVEKFEYK RGFKFSTYAT WWIRQSINRA IADQANTIRI PVHMKELVSK
LTKVTNKFRQ EHGREPSLED YSKSLRLSME KVKGVLKIMQ DPISLSTPVG EDEDSNLEDF
IEDKAGANPT VTASDFLRKQ EVSEVLNTLS EREAKIIRLR FGIDSGYPRT LEEVGKMFNV
TRERVRQIEA KAIRKLRHPS RTKMLKDYSD E