Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_2886 |
Symbol | groEL |
ID | 4444443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3250565 |
End bp | 3252175 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639690709 |
Product | chaperonin GroEL |
Protein accession | YP_832365 |
Protein GI | 116671432 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00686344 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAGC AGCTTGCGTT TAACGACGCT GCCCGCCGGT CGCTTGAAGC CGGCATCGAT AAGCTCGCCA ACACTGTCAA GGTGACGCTT GGCCCGCGCG GCCGCAACGT CGTGCTGGAC AAGAAGTGGG GCGCTCCCAC CATTACGAAC GACGGCGTGA CCATCGCCCG CGAAGTCGAA CTGGATGACC CGTTCGAGAA CCTCGGCGCG CAGCTGGCCA AGGAAGTCGC CACCAAGACC AACGATGTTG CCGGCGACGG CACCACCACC GCCACCGTGC TGGCACAGGC ACTGGTCAAG GAAGGCCTGC GCAACGTGGC GGCTGGCGCC GCCCCGGGCC AGATCAAGCG CGGCATCGAG GTTTCCGTCG AAGCCGTCGC AGCACGCCTG CTGGAGAACG CCCGCCCTGT CGAAGGCTCC CAGGTTGCGA ACGTTGCAGC CATCTCCGCC CAGAGCGACG AGATCGGCGA GCTCCTGGCC GAGGCTTTCG GCAAGGTCGG CAAGGATGGT GTGATCACCA TCGAGGAGTC CTCCACCACG CAGACCGAGC TCGTCCTCAC CGAGGGCATG CAGTTCGACA AGGGCTACCT TTCCCCGTAC TTCGTCACCG ACGCGGAACG CCAGGAGGCA GTCCTCGAAG ACGCCCTTAT CCTGATCAAC CAGGGCAAGA TCTCCTCGGT GCAGGAATTC CTGCCCCTCC TGGAGAAAGC GCTGCAGAGC TCCAAGCCGC TGTTCATCAT TGCCGAGGAC GTCGAGGGCG AGGCCCTGTC CACGCTCATC GTCAACCGCA TCCGCGGCAC CCTGAACGTC GTTGCCGTCA AGGCTCCGGG CTTCGGTGAC CGCCGCAAGG CCATGCTGCA GGACATCGCC ACCCTCACCG GTGCGCAGGT TGTCTCCCCG GAACTGGGCC TCAGCCTTGA TTCCGTTGGC CTCGAGGTGC TGGGCACGGC CCGCCGCATC ACGGTGACCA AGGACAACAC CACCATTGTT GACGGCGCCG GCACGGCCGA GGACGTAGCG GCACGCGTTG CCCAGCTGCG CGCCGAGCTG ACCCGCACCG ACTCCGACTG GGACAAGGAA AAGCTCCAGG AGCGCCTGGC CAAGCTGGCC GGCGGCATCG GTGTGATCAA GGTTGGCGCA GCCACCGAGG TGGAGCTGAA GGAAAAGAAG CACCGCATCG AGGACGCAGT CTCCTCCACC CGCGCTGCCC TCGAAGAAGG CATCGTTGCC GGTGGCGGTT CGGCCCTCAT CCACGCCCTG AAGGCGCTGG ACGAGGACCC TGCAGTCACC GCACTCGAAG GCGATGCAGC CTCGGCTGTG GGCATCGTTC GCCGGGCACT CGTCCAGCCG CTGCGCTGGA TCGCCCAGAA CGCCGGTTTC GACGGCTACG TCGTTGCCGC CAAGGTTGCC GAGTCGGCTG TCAACCAGGG CTTCAACGCC AAGAGCGGCG ATTACGAAGA CCTGATCGCT GCAGGCGTCA TCGACCCCGT GAAGGTGACG CGCGCAGCCC TCCGCAACGC CGCTTCCATC GCAGCGCTGG TTCTCACCAC AGAGACCCTC GTTGTCGAGA AGCCGGCCGA CGAGGACGAG CACGCTGGCC ACAAGCACTA G
|
Protein sequence | MAKQLAFNDA ARRSLEAGID KLANTVKVTL GPRGRNVVLD KKWGAPTITN DGVTIAREVE LDDPFENLGA QLAKEVATKT NDVAGDGTTT ATVLAQALVK EGLRNVAAGA APGQIKRGIE VSVEAVAARL LENARPVEGS QVANVAAISA QSDEIGELLA EAFGKVGKDG VITIEESSTT QTELVLTEGM QFDKGYLSPY FVTDAERQEA VLEDALILIN QGKISSVQEF LPLLEKALQS SKPLFIIAED VEGEALSTLI VNRIRGTLNV VAVKAPGFGD RRKAMLQDIA TLTGAQVVSP ELGLSLDSVG LEVLGTARRI TVTKDNTTIV DGAGTAEDVA ARVAQLRAEL TRTDSDWDKE KLQERLAKLA GGIGVIKVGA ATEVELKEKK HRIEDAVSST RAALEEGIVA GGGSALIHAL KALDEDPAVT ALEGDAASAV GIVRRALVQP LRWIAQNAGF DGYVVAAKVA ESAVNQGFNA KSGDYEDLIA AGVIDPVKVT RAALRNAASI AALVLTTETL VVEKPADEDE HAGHKH
|
| |