Gene Cphy_3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3289 
Symbol 
ID5741568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4005483 
End bp4007108 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content40% 
IMG OID641294390 
Productchaperonin GroEL 
Protein accessionYP_001560382 
Protein GI160881414 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00111576 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAGG ATATTAAATA TAGTGCAGAT GCCAGAGTAG CGATGGAAGC TGGTGTAAAC 
AAGTTAGCAA ATACAGTAAG AGTAACTCTA GGACCAAAGG GAAGAAATGT AGTTCTTGAT
AAGTCCTTCG GTGCACCATT AATTACAAAT GATGGTGTTA CTATTGCAAA AGAAATTGAG
TTAGAAGATT CCTTTGAGAA TATGGGCGCT CAGCTTGTAA AAGAAGTTGC TACAAAGACA
AATGATGTAG CAGGTGATGG TACAACAACA GCTACTGTTC TTGCTCAAGC TATGATTAAC
GAAGGTATGA AAAATCTTGC AGCAGGTGCT AATCCGATCA TCTTAAGAAG AGGTATGAAG
AAAGCTACTG ATTGCGCTGT TGAAGCAATT AGCAGTATGA GCTCAGCAAT TAATGGTAAA
GATCAGATTG CTAAGGTTGC TGCAATCTCT GCTGGCGATG ATTCCGTAGG TGAGATGGTT
GCGGATGCTA TGGATAAAGT TAGCAAAGAT GGTGTTATCA CCATTGAAGA GTCTAAGACT
ATGCAGACTG AGCTTGACTT AGTAGAAGGT ATGCAATTTG ACCGTGGATA TGTTTCCGCA
TATATGGCGA CTGATATGGA TAAGATGGAA GCAAATCTAG ATAATCCATA TATTTTAATC
ACAGATAAGA AGATCAGCAA CATTCAGGAG ATCCTTCCTG TTCTTGAGCA GATTGTTCAA
AGTGGATCCA GATTATTAAT CATCGCTGAA GATATCGAAG GCGAAGCTTT AACAACATTA
GTAATCAATA AGTTAAGAGG GACATTCACT GTTGTTGGTG TTAAGGCGCC AGGTTATGGT
GATAGAAGAA AGGCTATGTT ACAAGATATC GCTATTTTAA CTGGTGGTAC TGTTATCTCT
GATGAACTTG GCCTTGACTT AAAAGAAGCT ACATTAGATC AGCTTGGTCG TGCAAAATCC
GTTAAAATTC AGAAAGAAAA CACTATCATT GTTGATGGTG AAGGAAATAA AGCAGAAATC
GAAGCTAGAA TTTCTCAGAT TAAGGCTCAG ATTGCTGAAA CAACATCAGA ATTTGATAAA
GAAAAATTAC AGGAGAGACT TGCTAAACTT GCAGGTGGTG TAGCTGTAAT TCGTGTTGGT
GCTGCAACAG AGACTGAGAT GAAAGAGAAG AAGCTTCGTA TGGAAGATGC TTTAGCAGCT
ACAAGAGCAG CTGTGGAAGA AGGTATTATC GCAGGTGGCG GTTCTGCTTA CATCCATGCA
TCTAAGGAAG TTGCTAAACT TGCTGCTAAA TTAGAAGGTG ATGAGAGAAC TGGTGCACAG
ATTATATTAA AAGCATTAGA AGCTCCATTA TCATGCATCG CTCAAAACGC TGGTTTAGAA
GGCGCTGTTA TTGTTAACAA GGTTAGAGAA AAGAAAACAG GTGTTGGTTT CAATGCCCTA
ACTGAGAAGT ATGTAGATAT GGTAGAAGAC GGAATTCTTG ATCCTTCTAA GGTTACAAGA
AGTGCTCTTC AGAATGCAAC CAGTGTTGCT TCTACATTCT TAACAACAGA AGCTGCAGTT
GCATCCATTA AAGAACCAGC TCCAGCTATG CCAGCAGGCG GCCCTGGCGG AATGGGTATG
ATGTAA
 
Protein sequence
MAKDIKYSAD ARVAMEAGVN KLANTVRVTL GPKGRNVVLD KSFGAPLITN DGVTIAKEIE 
LEDSFENMGA QLVKEVATKT NDVAGDGTTT ATVLAQAMIN EGMKNLAAGA NPIILRRGMK
KATDCAVEAI SSMSSAINGK DQIAKVAAIS AGDDSVGEMV ADAMDKVSKD GVITIEESKT
MQTELDLVEG MQFDRGYVSA YMATDMDKME ANLDNPYILI TDKKISNIQE ILPVLEQIVQ
SGSRLLIIAE DIEGEALTTL VINKLRGTFT VVGVKAPGYG DRRKAMLQDI AILTGGTVIS
DELGLDLKEA TLDQLGRAKS VKIQKENTII VDGEGNKAEI EARISQIKAQ IAETTSEFDK
EKLQERLAKL AGGVAVIRVG AATETEMKEK KLRMEDALAA TRAAVEEGII AGGGSAYIHA
SKEVAKLAAK LEGDERTGAQ IILKALEAPL SCIAQNAGLE GAVIVNKVRE KKTGVGFNAL
TEKYVDMVED GILDPSKVTR SALQNATSVA STFLTTEAAV ASIKEPAPAM PAGGPGGMGM
M