Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4327 |
Symbol | groEL |
ID | 4245979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6670078 |
End bp | 6671712 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638109215 |
Product | chaperonin GroEL |
Protein accession | YP_723793 |
Protein GI | 113477732 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.645492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAAC GTATTATCTA CAACGAAAAT GCTCGTCGCG CCTTAGAAAA AGGTATGGAT ATTCTTGCGG AGTCTGTAGC AGTAACACTC GGTCCTAAAG GTCGAAACGT TGTTCTTGAG AAAAAATTTG GCGCACCCCA AATTGTTAAT GATGGTGTAA CAATTGCTAA AGAAATTGAA TTGGAAGACC ATGTAGAAAA TACTGGTGTT TCCCTAATTC GTCAAGCAGC TTCCAAAACT AACGATGCAG CAGGTGATGG TACAACCACT GCAACTGTAC TTGCTCATGC AATGGTAAAA GAAGGTCTAC GGAACGTGGC TGCTGGCGCA AACCCTATTG CTCTCAAGCG TGGTATTGAC AAAGCTGCTG GTTTTCTAGT AGAAAAAATT GCTGAACATG CTCGTCAAAT TGAAGATTCC AAGGCGATAG CTCAAGTTGG TGCTATTTCT GCTGGTAATG ATGAAGAAGT AGGTAAAATG ATTGCCGAAG CCATGGATAA AGTGGGTAAA GAAGGTGTGA TTTCTCTTGA AGAAGGAAAG TCAATGCAGA CTGAGTTGGA AATTACTGAA GGTATGCGCT TTGACAAGGG TTACATCTCT CCCTACTTCG CAACAGACAT GGAACGGATG GAAGCTTCTC TTGAAGAACC TCAGATTCTG ATAACTGATA AGAAAATTGC TTTAGTACAA GACTTAGTAC CAGTATTAGA ACAAGTTGCT CGTTCTGGCA AACCATTATT AATATTGGCT GAAGATATTG AGAAAGAAGC TTTGGCAACC CTAGTTGTTA ACCGTTTACG GGGTGTAGTG AATGTTGCTG CAGTTAAAGC TCCTGGTTTT GGGGATCGTC GTAAAGCTAT GTTAGAAGAT ATCGCGGTAT TAACTGGCGG TCAAGTTATC ACTGAAGATG CTGGCTTGAA ACTAGAAAAT GCCAAGTTAG ATATGCTTGG TAAAGCACGT CGCATTACGA TCACTAAAGA TAATACTACT ATCGTTGCTG AAGGTAACGA AAAAGAAGTT AAAGCACGTT GCGAACAAAT TCGTCGTCAA ATGGATGAAA CTGACTCTTC CTACGACAAA GAGAAATTAC AAGAGCGTTT AGCTAAATTA GCTGGTGGTG TAGCGGTTGT TAAAGTTGGT GCTGCTACTG AAACAGAAAT GAAAGATCGC AAGTTACGCT TGGAAGATGC TATCAACGCA ACTAAAGCTG CTGTAGAAGA AGGTATCGTT CCTGGTGGTG GTACAACTCT AGCGCACTTA GCTCCTGAGT TGGAAACTTG GGCAAATGAA AATCTACAAT CTGAAGAGTT AACTGGTTCT TTAATTGTTA GTCGTGCTTT ATTAGCTCCA CTGAAGCGCA TTGCTGAAAA TGCTGGGCAA AATGGTGCTG TAATTGGCGA ACGGGTGAAA GAGAAGGATT TCAATACTGG TTTTAATGCA GCAAACAATG AGTTTGTTGA TATGTTTGAA GCTGGTATTG TTGACCCAGC GAAGGTAACT CGTTCGGCTC TACAAAACGC TGCTTCTATT GCTGGTATGG TGTTAACAAC TGAGTGTATT GTTGTTGACA AACCTGAGCC TAAGGAAAAC GCACCTGCTG GCGCTGGTAT GGGCGGTGGC GATTTCGATT ACTAA
|
Protein sequence | MAKRIIYNEN ARRALEKGMD ILAESVAVTL GPKGRNVVLE KKFGAPQIVN DGVTIAKEIE LEDHVENTGV SLIRQAASKT NDAAGDGTTT ATVLAHAMVK EGLRNVAAGA NPIALKRGID KAAGFLVEKI AEHARQIEDS KAIAQVGAIS AGNDEEVGKM IAEAMDKVGK EGVISLEEGK SMQTELEITE GMRFDKGYIS PYFATDMERM EASLEEPQIL ITDKKIALVQ DLVPVLEQVA RSGKPLLILA EDIEKEALAT LVVNRLRGVV NVAAVKAPGF GDRRKAMLED IAVLTGGQVI TEDAGLKLEN AKLDMLGKAR RITITKDNTT IVAEGNEKEV KARCEQIRRQ MDETDSSYDK EKLQERLAKL AGGVAVVKVG AATETEMKDR KLRLEDAINA TKAAVEEGIV PGGGTTLAHL APELETWANE NLQSEELTGS LIVSRALLAP LKRIAENAGQ NGAVIGERVK EKDFNTGFNA ANNEFVDMFE AGIVDPAKVT RSALQNAASI AGMVLTTECI VVDKPEPKEN APAGAGMGGG DFDY
|
| |