Gene YpsIP31758_3675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3675 
SymbolgroEL 
ID5387916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4142801 
End bp4144447 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content51% 
IMG OID640866698 
Productchaperonin GroEL 
Protein accessionYP_001402629 
Protein GI153950695 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGCTA AAGACGTAAA ATTCGGTAAC GACGCTCGCA TTAAAATGCT ACGCGGCGTA 
AACATCCTTG CTGATGCAGT GAAAGTGACT CTGGGCCCTA AAGGCCGTAA CGTAGTTCTG
GATAAGTCTT TCGGTTCTCC AACGATCACT AAAGACGGTG TTTCTGTTGC ACGTGAAATC
GAACTGGAAG ACAAGTTCGA GAACATGGGC GCACAGATGG TTAAAGAAGT TGCCTCTAAA
GCGAATGACG CTGCGGGTGA CGGTACCACG ACTGCAACAG TATTGGCTCA ATCCATCATC
ACTGAAGGCC TGAAAGCAGT TGCCGCAGGC ATGAACCCAA TGGATCTGAA GCGCGGCATC
GACAAAGCCG TTATCGCAGC GGTAGAAGAG CTGAAAAAAC TGTCTGTACC TTGCTCTGAT
TCCAAAGCGA TTGCTCAGGT CGGTACCATC TCTGCAAACT CCGACTCAAC CGTTGGCGAA
CTGATCGCTC AAGCGATGGA AAAAGTCGGT AAAGAAGGCG TTATCACCGT TGAAGAAGGT
TCAGGCCTGC AAGACGAGTT GGACGTTGTA GAAGGTATGC AGTTCGATCG CGGCTACCTG
TCTCCTTACT TCATCAATAA ACCAGAAACT GGCTCTATTG AACTTGAAAG CCCATTCATT
CTGTTGGCTG ACAAGAAAAT CTCTAACATC CGTGAAATGC TGCCAGTTCT GGAAGCCGTA
GCGAAAGCCG GCAAGCCACT GCTGATCATT GCAGAAGACG TTGAAGGCGA AGCCCTGGCG
ACTCTGGTAG TGAACACCAT GCGCGGTATC GTTAAAGTTG CAGCGGTTAA AGCACCTGGC
TTCGGCGACC GTCGTAAAGC GATGCTGCAA GACATCGCGA CCCTGACTGC GGGTACTGTT
ATCTCTGAAG AGATCGGTCT GGAACTGGAA AAAACCACTC TGGAAGATCT GGGTCAGGCG
AAACGTGTTG TTATCAACAA AGACACCACC ATCATTATTG ATGGCGTAGG CGATGAAGCG
GCAATCCAAG GCCGTGTGGC TCAGATTCGT CAGCAGATTG AAGATGCCAC TTCTGACTAC
GACAAAGAAA AACTGCAAGA GCGTGTTGCT AAACTGGCTG GCGGCGTTGC CGTTATCAAA
GTGGGCGCCG CAACTGAAGT TGAAATGAAA GAGAAGAAAG CACGCGTTGA AGATGCTCTG
CACGCAACTC GTGCCGCAGT AGAAGAAGGC GTCGTAGCGG GTGGTGGTGT TGCACTGATT
CGTGCAGCTC ACGCTATCGC TGGCCTGAAA GGCGATAACG AAGACCAGAA CGTGGGTATT
AAAGTGGCTC TGCGTGCGAT GGAATCTCCA CTGCGTCAGA TCGTGGTTAA CGCGGGTGAA
GAAGCCTCTG TTATCGCGAA CAAAGTTAAA GCGGGTGAAG GTAGCTTCGG TTACAACGCT
TATACTGAAG AATACGGCGA CATGATCGCG ATGGGTATCT TGGATCCAAC TAAAGTGACG
CGTTCTGCTC TGCAGTACGC AGCCTCTATT GCTGGTCTGA TGATTACCAC CGAGTGCATG
GTTACTGACC TGCCACGTGA TGACAAAGGT GCTGATATGG GCGCTGGCGG CATGGGTGGT
ATGGGCGGCA TGGGCGGCAT GATGTAA
 
Protein sequence
MAAKDVKFGN DARIKMLRGV NILADAVKVT LGPKGRNVVL DKSFGSPTIT KDGVSVAREI 
ELEDKFENMG AQMVKEVASK ANDAAGDGTT TATVLAQSII TEGLKAVAAG MNPMDLKRGI
DKAVIAAVEE LKKLSVPCSD SKAIAQVGTI SANSDSTVGE LIAQAMEKVG KEGVITVEEG
SGLQDELDVV EGMQFDRGYL SPYFINKPET GSIELESPFI LLADKKISNI REMLPVLEAV
AKAGKPLLII AEDVEGEALA TLVVNTMRGI VKVAAVKAPG FGDRRKAMLQ DIATLTAGTV
ISEEIGLELE KTTLEDLGQA KRVVINKDTT IIIDGVGDEA AIQGRVAQIR QQIEDATSDY
DKEKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG VVAGGGVALI
RAAHAIAGLK GDNEDQNVGI KVALRAMESP LRQIVVNAGE EASVIANKVK AGEGSFGYNA
YTEEYGDMIA MGILDPTKVT RSALQYAASI AGLMITTECM VTDLPRDDKG ADMGAGGMGG
MGGMGGMM