Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2175 |
Symbol | groEL |
ID | 3906775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2548390 |
End bp | 2550015 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637879508 |
Product | chaperonin GroEL |
Protein accession | YP_481274 |
Protein GI | 86740874 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.755929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.503349 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAGA TCATCGCTTA CGAGGAGGAA GCACGGCGCG GGCTGGAACG TGGCATGAAC CAGCTGGCCG GCGCCGTGAA GGTGACGCTC GGTCCGAAGG GGCGCAACGT CGTCCTGGAG AAGAAGTGGG GCGTTCCCGC CATCACCAAT GACGGCGTCT CCATAGCCAG GGAGATCGAG CTCGAAGATC CGTATGAGAA GATCGGTGCC GAGATGGTCA AGGAGGTCGC CAAGAAGACC GACGAGGTGG CCGGCGACGG CACCACGACG GCGACCGTTC TGGCGGAGGC ACTGGTCCAT GAGGGCCTGC GGAACGTGGC CGCGGGCGCG AACCCGATCG CCCTCAAACG TGGCATCGAG CTGGCTGTCG AGCGGGTCTG TGGGGAACTG GCCAATCTGT CCAGGGAGCT GGAAACCAAG GATCAGATCG CCTCAACGGC CTCGATCTCG GCCGGCGGGG ACACCGCGAT CGGCCAGATC ATCGCCGAGG CGATGGACAA GGTCGGCCGG GACGGCGTCA TCACCGTCGA GGAGAGCAAC ACCTTCGGCC TCGAGCTGGA GCTCACCGAA GGTATGCGTT TCGACAAGGG CTACATCTCG CCGTACTTCA TCACTGATCA GGAGCGGATG GAGTGCGTCC TGGAGGACCC CTACATCCTG GTCGCCAACA TCAAGATTTC GCTGGTCAAG GACCTGCTCC CGCTGTTGGA GAAGGTCATG CAGGCCGGCA GGCCGCTGCT GGTCATCGCC GAGAACGTTG AGGGGGAGGC CCTGGCGACC CTGGTCGTCA ACAAGATCCG CGGTACGTTC CGGTCCGTGG CCGTGAAGGC GCCGGGTTTC GGCGAGCGGC GCAAGGCCAT GCTCGGCGAT ATCGCCGTTC TGACGGGCAG TCAGGTGATC AGTGAGGAGG TTGGTCTCAG GCTGGAGAAC GCCGACCTCG ACCTGCTTGG CCGGGCCCGC AAGGTTGTCG TTACCAAGGA TGACACGACC ATTATCGAGG GCGCCGGCGA CCCGGGCCGG ATCGCCGGTC GGGTCAGCCA GATCCGTAGC GAGATCGAGA AGTCGGACTC CGACTACGAT CGCGAGAAGC TGCAGGAGCG GCTGGCCAGG CTCGCCGGTG GCGTGGCCGT CATCAAAGCC GGCGCGGCCA CCGAGGTCGA GCTCAAGGAG CGTAAGCACC GCATCGAGGA CGCGGTCCGC AACGCGAAGG CCGCCGTTGA GGAGGGCATC GTCCCCGGCG GTGGGGTGGC TCTGCTGCTG GCCTCGGGGG CTGTCTTCGA CGGGCTGGAG GTGGCTGAGG ACGAGCGGAC CGGGGCCGAG ATGGTGCGCC GCGCGTTGAC CGAGCCGCTC CGGCAGATCG CGGTCAATGC CGGCCTGGAA GGCGGCGTCG TGGTCGAGAA GGTCCGCAAC CTGCAACCGG GGTGGGGGCT GGACGCCGCC ACCGGCGAGC ACGTCAACAT GCTCGAGGCC GGGATCATCG ACCCGACCAA GGTCACCCGC TCCGCCCTGC AGAATGCCGC ATCCATCGCC GGGCTGTTCC TCACCACCGA GGCCGTCGTT GCCGAAAAGC CAGAGGAAAA GGAAACCGCG GCAGCGCCAG CTGGTGGGGG TGGCCTGGAG TACTGA
|
Protein sequence | MPKIIAYEEE ARRGLERGMN QLAGAVKVTL GPKGRNVVLE KKWGVPAITN DGVSIAREIE LEDPYEKIGA EMVKEVAKKT DEVAGDGTTT ATVLAEALVH EGLRNVAAGA NPIALKRGIE LAVERVCGEL ANLSRELETK DQIASTASIS AGGDTAIGQI IAEAMDKVGR DGVITVEESN TFGLELELTE GMRFDKGYIS PYFITDQERM ECVLEDPYIL VANIKISLVK DLLPLLEKVM QAGRPLLVIA ENVEGEALAT LVVNKIRGTF RSVAVKAPGF GERRKAMLGD IAVLTGSQVI SEEVGLRLEN ADLDLLGRAR KVVVTKDDTT IIEGAGDPGR IAGRVSQIRS EIEKSDSDYD REKLQERLAR LAGGVAVIKA GAATEVELKE RKHRIEDAVR NAKAAVEEGI VPGGGVALLL ASGAVFDGLE VAEDERTGAE MVRRALTEPL RQIAVNAGLE GGVVVEKVRN LQPGWGLDAA TGEHVNMLEA GIIDPTKVTR SALQNAASIA GLFLTTEAVV AEKPEEKETA AAPAGGGGLE Y
|
| |