Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4150 |
Symbol | groEL |
ID | 5901612 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4501482 |
End bp | 4503128 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641564671 |
Product | chaperonin GroEL |
Protein accession | YP_001685772 |
Protein GI | 167648109 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00153421 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.471865 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCTA AAGACGTTTA TTTCTCCTCG GACGCGCGCG ACAAGATGCT GCGCGGCGTC AACATCCTCG CCAACGCGGT GAAGGTGACC CTCGGCCCCA AGGGCCGCAA CGTCGTGATC GAAAAGTCCT TCGGCGCCCC GCGCTCGACC AAGGACGGCG TCTCGGTCGC CAAGGAAATC GAACTGGCTG ACCGTTTCGA GAACCTCGGC GCGCAGATGA TCCGCGAAGT CGCCAGCAAG ACCAACGACA AGGCCGGCGA CGGCACCACC ACCGCCACCG TCCTGGCCCA AGCCATCGTG CAAGAAGGCC TCAAGTCGGT CGCCGCCGGC ATGAACCCGA TGGACCTGAA GCGCGGCATC GACAAGGCCG TCCACGTCGT CGTCGACTCC ATCAAGGCGT CGTCGAAGAA GGTCACCACC AACAACGAGA TCGCCCAGGT CGGCACCATC TCGGCCAACG GTGACAAGGA CGTCGGCGAG ATGATCGCCA AGGCCATGGA CAAGGTCGGC AACGAAGGCG TCATCACCGT CGAGGAAGCC AAGACCGCCG AGACCGAACT CGACGTCGTC GAAGGCATGC AGTTCGACCG CGGCTACCTG TCGCCGTACT TCATCACCAA CGCCGACAAG ATGGAAGTTC AGCTCGAAGA GCCGCTCATC CTGCTGTTCG AAAAGAAGCT CTCCTCGCTG CAGCCGCTGC TGCCGGTGCT GGAAGCCGTC GTCCAGTCGG GCCGTCCGCT GGTGATCATC GCCGAGGACG TCGAGGGCGA AGCCCTGGCC ACCCTGGTGG TCAACAAGCT GCGCGGCGGT CTCCGCGTCG CCGCCGTCAA GGCTCCGGGC TTCGGCGATC GCCGCAAGGC CATGCTGGAA GACATCGCCA TCCTGACCGG CGCCCAAGTG ATCAGCGAAG ACCTCGGCAT CAAGCTCGAG AACGTTTCGC TCGACATGCT GGGCAAGGCC AAGAAGGTCT CGATCACCAA GGACGACACC ACCATCGTGG ACGGCGTCGG TGAGAAGACC GCGATCGAAG CCCGCATCGG CCAGATCAAG AAGCAGATCG AAGACACCAC CTCGGACTAC GACAAGGAAA AGCTGCAAGA GCGTCTGGCC AAGCTGGCCG GCGGCGTCGC GGTCATCCGC GTCGGCGGCT CGACCGAAGT CGAAGTGAAG GAAAAGAAGG ACCGCGTCGA CGACGCCCTC AACGCGACCC GCGCGGCCGT GGAAGAAGGC ATCGTCCCGG GCGGCGGCGT GGCTCTGCTG AAGGCCTCCA AGGCCCTGGC GACCCTGGTC GGCGACAATG ACGACCAGAC CGCCGGCATC GCGATCGTCC GTCGCGCCCT GCAGGCTCCG ATCCGTCAGA TCGCCGAGAA CGCCGGCGTC GAAGGCTCGA TCGTGGTCGG CAAGATCCTG GAAAACGACA GCCCGACCTT CGGCTTCAAC GCCCAGACCG AGCAGTATGT CGACCTGATC GCCGACGGCG TCATCGACCC CGCCAAGGTG GTCCGCACCG CCTTGCAGGA CGCCGCCTCG GTGGCCGGCC TGCTGATCAC CACGGAAGCG GCCATCGTCG AAGCCCCCAA GAAGGGCGGC GGCGCTGGCG GCCCTCCCGG CGGCGGCATG GGCGGCATGG GCGACATGGA CTTCTAA
|
Protein sequence | MAAKDVYFSS DARDKMLRGV NILANAVKVT LGPKGRNVVI EKSFGAPRST KDGVSVAKEI ELADRFENLG AQMIREVASK TNDKAGDGTT TATVLAQAIV QEGLKSVAAG MNPMDLKRGI DKAVHVVVDS IKASSKKVTT NNEIAQVGTI SANGDKDVGE MIAKAMDKVG NEGVITVEEA KTAETELDVV EGMQFDRGYL SPYFITNADK MEVQLEEPLI LLFEKKLSSL QPLLPVLEAV VQSGRPLVII AEDVEGEALA TLVVNKLRGG LRVAAVKAPG FGDRRKAMLE DIAILTGAQV ISEDLGIKLE NVSLDMLGKA KKVSITKDDT TIVDGVGEKT AIEARIGQIK KQIEDTTSDY DKEKLQERLA KLAGGVAVIR VGGSTEVEVK EKKDRVDDAL NATRAAVEEG IVPGGGVALL KASKALATLV GDNDDQTAGI AIVRRALQAP IRQIAENAGV EGSIVVGKIL ENDSPTFGFN AQTEQYVDLI ADGVIDPAKV VRTALQDAAS VAGLLITTEA AIVEAPKKGG GAGGPPGGGM GGMGDMDF
|
| |