Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_13480 |
Symbol | groEL |
ID | 7760290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1312587 |
End bp | 1314227 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643804250 |
Product | chaperonin GroEL |
Protein accession | YP_002798549 |
Protein GI | 226943476 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.434878 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCAA AAGAAGTCAA GTTCGGTGAT TCCGCCCGCA AGAAAATGCT GGTAGGCGTG AACGTCCTGG CCGATGCCGT CAAGGCCACT CTCGGCCCGA AAGGCCGCAA CGTGGTCCTG GACAAGAGCT TCGGCGCACC GACCATCACC AAGGATGGCG TATCCGTGGC CAAGGAAATC GAGCTCAAAG ACAAGTTCGA GAACATGGGC GCCCAACTGG TGAAGGACGT GGCCTCCAAG GCCAACGACG AAGCCGGTGA CGGTACCACC ACCGCTACCG TACTGGCCCA GGCCATCGTC AACGAAGGCC TGAAGGCCGT TGCCGCAGGC ATGAACCCGA TGGATCTGAA GCGCGGCATC GATAAGGCGA CCATCGCCAT CGTTGCGGAG CTGAAGTCGC TGGCCAAGCC GTGCTCCGAC TCCAAGGCGA TCGCCCAGGT AGGCACCATT TCCGCCAACT CCGACGAGTC CATCGGTAAC ATCATCGCCG AGGCCATGAA CAAGGTCGGC AAGGAAGGCG TGATCACCGT CGAGGAAGGC TCGGGCCTGG AGAACGAGTT GTCCGTCGTC GAAGGCATGC AGTTCGATCG TGGCTACCTG TCTCCGTACT TCATCAACAA GCCCGACACC ATGGTTGCCG AACTGGACAA CCCGCTGCTG CTGCTGGTCG ACAAGAAGAT CTCCAACATC CGCGAGCTGC TGCCGGTGCT GGAAGCCGTG GCCAAGTCCG GCCGTCCGCT GCTGATCGTC GCCGAAGACG TCGAAGGCGA AGCCCTGGCC ACCCTGGTGG TCAACAACAT GCGTGGCATC GTCAAGGTCG CTGCCGTGAA GGCTCCGGGC TTCGGCGATC GTCGCAAGGC CATGCTGCAG GACATCGCCA TCCTGACCGG CGCCACCGTG ATCTCCGAGG AAGTCGGCCT GAGCTTGGAA AGCGCCACTC TGGAGCATCT GGGTAACGCC AAGCGCGTCG TGCTGAACAA GGAAAACACC ACCATCATCG ACGGTGCCGG TGCCCAGGCC GACATCGAGG CTCGTGTGGC GCAGATCCGC AAGCAGATCG AGGAAACCTC CTCCGATTAC GATCGCGAGA AGCTGCAGGA GCGTCTGGCC AAGCTGGCCG GTGGCGTGGC CGTGATCAAG GTCGGTGCCG CCACCGAGGT CGAGATGAAG GAGAAAAAGG CCCGCGTCGA AGACGCCCTG CATGCGACCC GTGCGGCAGT GGAGGAGGGC GTGGTTCCCG GCGGCGGCGT GGCTCTGGTG CGTGCCCTGC AGGCCATCGA GGGTCTGAAG GGCGACAACG AAGACCAGAA CGTCGGCATC GCCCTGCTAC GTCGTGCCGT CGAAGCACCG CTGCGTCAGA TCGTCGCCAA TGCCGGTGAT GAGCCGAGCG TGGTGGTCGA CAAGGTCAAG CAGGGCTCTG GCAACTTCGG CTTCAACGCT GCCAGCGGCG TATACGGCGA CATGATCGAG ATGGGCATCC TCGATCCGGC CAAGGTTACC CGCTCCGCGC TGCAGGCGGC TTCTTCGATC GGTGGACTGA TGATCACCAC CGAGGCCATG GTCGCCGACA TTGTCGAAGA CAAGGCAGCT CCAGCCATGC CGGACATGGG CGGTATGGGT GGCATGGGCG GCATGATGTA G
|
Protein sequence | MAAKEVKFGD SARKKMLVGV NVLADAVKAT LGPKGRNVVL DKSFGAPTIT KDGVSVAKEI ELKDKFENMG AQLVKDVASK ANDEAGDGTT TATVLAQAIV NEGLKAVAAG MNPMDLKRGI DKATIAIVAE LKSLAKPCSD SKAIAQVGTI SANSDESIGN IIAEAMNKVG KEGVITVEEG SGLENELSVV EGMQFDRGYL SPYFINKPDT MVAELDNPLL LLVDKKISNI RELLPVLEAV AKSGRPLLIV AEDVEGEALA TLVVNNMRGI VKVAAVKAPG FGDRRKAMLQ DIAILTGATV ISEEVGLSLE SATLEHLGNA KRVVLNKENT TIIDGAGAQA DIEARVAQIR KQIEETSSDY DREKLQERLA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG VVPGGGVALV RALQAIEGLK GDNEDQNVGI ALLRRAVEAP LRQIVANAGD EPSVVVDKVK QGSGNFGFNA ASGVYGDMIE MGILDPAKVT RSALQAASSI GGLMITTEAM VADIVEDKAA PAMPDMGGMG GMGGMM
|
| |