Gene Emin_0924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0924 
Symbol 
ID6262628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1024431 
End bp1026059 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content44% 
IMG OID642611403 
Productchaperonin GroEL 
Protein accessionYP_001875814 
Protein GI187251332 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00446655 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value9.722580000000001e-19 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCAAAAC AGATTGTATA TGGTGACGAA GCCAGGGCTA AAATGAAAGC CGGCATTGAA 
AAAGTTGCAA AAGCTGTAAG CGTTACGTTA GGGCCTAAAG GAAGAAGCGT TGTATTGGAA
AAGAAGTTTG GCTCGCCTTT GATTATTGAC GACGGCGTAA CAATAGCTAA AGATATTGAA
CTTGAAGATA AATTTGAAAA CATGGGCGCG CAGTTAATTC GCGAAGTCGC CTCAAAAACC
AATGATATTG CGGGTGACGG CACCACAACG GCAACCGTTT TAACGCACGC AATTTTAACG
GAAGGTATTA AAAATATTAC AGCGGGTGCA AACCCTACTT TAGTTAAAAA AGGTATTGAA
ATGGCTGTGG AAACAGTTAA AGAAGAACTT AAAAAAATGC AGCGCCCCGT AGAAACAAAA
GAAGAAAAAG CACAAATCGC CACAATTTCA GCCAATGACC GTATGGTAGG TGAACTTATT
GCCGAAGCTA TGGAAAAAGT GGGCCACGAA GGCGTTATCA CCGTTGAAGA AGGCAAAACA
GCAACAACTG AACTTCAGGT TGTTGAAGGT ATGCAGTTTG ACCGCGGTTA TATCTCACCT
TATTTTGTAA CCGATTCCGA AAGAATGGAA TGCGTGTTGG AAGACTGTCA AATTATTTTA
GCCGATAAAA AAGTTTCTTC AATGAACGAA CTTTTACCCT TACTTGAAGG CATTGTTAAA
AACGGCCGCA ACTTCTTAAT AATAGCCGAA GACGTTGACG GCGAAGCCCT TGCCACATTA
GTTGTTAACA GGCTTAGAGG CACATTAAAA GGTTGCGCGG TTAAAGCCCC CGGCTTTGGA
GACAGACGCA AAGAAATGCT TGAAGATATA GCCATTTTAA CAGGCGGCCA GGTAATCGCT
GAAGAACGCG GCATGAAGCT TGAAACAGCC ACTTTAGATA TGCTCGGTTC AGCAAAAAGA
GTTGTTATCG ATAAAGAAAA CGCCACAATC GTAAGCGGCG AAGGCGACAA GAAAAAAATT
GAAGCGAGAG CCGAACAAAT AAGAAAACAA ATCGAAAACT CAACCTCAGA TTACGATAAG
GAAAAATTAC AGGAACGCCT TGCAAAACTT TCCGGCGGCG TAGCTGTTAT CAGTGTAGGC
GCGGCTACAG AAACGGAAAT GAAAGCCAAA AAAGCTAAAG TTGAAGACGC TAAAAACGCC
ACAAAAGCGG GTGTTGAAGA AGGCTTAATC CCAGGCGGCG GCGTGGCTTT AACAAGATGC
GAAGGCGCGG TCGGCAAATT AAAAGCCGAT AACGAAGATG TACAGACAGG TATTAACATC
GTTAAGAAAG CTCTTACCGC TCCGTTATAT CAAATTGCGT TTAACGCCGG CTTGGATGGT
TCCGTAGTTG TTGAAAATGT ACGCAACGCT AAAGGAAACC AAGGTTTTGA CGCTGACACC
GGCGAATATG TTGACATGAT TAAAGCCGGC GTTGTTGACG CTGTTAAAGT TGTCCGAATA
GGGCTTGAAA ACGCGGCCTC AATAGCCGCG ACAGTGCTTT TAACTGAAGC GCTTGTAGCC
GACATTCCTG AGGAAAAGGG CGCGGCCCCC ATGGGGCACC CCGGTATGGG CGGTATGGGC
ATGATGTAA
 
Protein sequence
MAKQIVYGDE ARAKMKAGIE KVAKAVSVTL GPKGRSVVLE KKFGSPLIID DGVTIAKDIE 
LEDKFENMGA QLIREVASKT NDIAGDGTTT ATVLTHAILT EGIKNITAGA NPTLVKKGIE
MAVETVKEEL KKMQRPVETK EEKAQIATIS ANDRMVGELI AEAMEKVGHE GVITVEEGKT
ATTELQVVEG MQFDRGYISP YFVTDSERME CVLEDCQIIL ADKKVSSMNE LLPLLEGIVK
NGRNFLIIAE DVDGEALATL VVNRLRGTLK GCAVKAPGFG DRRKEMLEDI AILTGGQVIA
EERGMKLETA TLDMLGSAKR VVIDKENATI VSGEGDKKKI EARAEQIRKQ IENSTSDYDK
EKLQERLAKL SGGVAVISVG AATETEMKAK KAKVEDAKNA TKAGVEEGLI PGGGVALTRC
EGAVGKLKAD NEDVQTGINI VKKALTAPLY QIAFNAGLDG SVVVENVRNA KGNQGFDADT
GEYVDMIKAG VVDAVKVVRI GLENAASIAA TVLLTEALVA DIPEEKGAAP MGHPGMGGMG
MM