Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0035 |
Symbol | groEL |
ID | 3916038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 35710 |
End bp | 37353 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640442760 |
Product | chaperonin GroEL |
Protein accession | YP_495318 |
Protein GI | 87198061 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0226788 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCCA AGGACGTAAA GTTCGGCCGC GACGCTCGTG AACGCATTCT GCGCGGCGTC GACATCCTCG CTGATGCCGT GAAGGTCACG CTGGGCCCCA AGGGCCGCAA CGTCGTGATC GACAAGAGCT TCGGCGCCCC CCGCATCACC AAGGACGGTG TTTCGGTCGC CAAGGAAATC GAGCTCAAGG ACAAGTTCGA GAACATGGGC GCCCAGATGC TGCGCGAAGT GGCCTCGAAG GCCAACGACG CGGCCGGTGA CGGCACCACC ACCGCGACCG TTCTTGCACA GGCGATCGTT CGCGAAGGCA TGACCGCGGT TGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC GACATCGCCG TCGGCAAGGT CGTCGAGAAC CTCAAGGCCC GTTCGACCCC GGTCGCCGGT TCGTCGGAAA TCGCCCAGGT CGGCATCATC TCGGCCAACG GCGACACCGA AGTCGGCCAG AAGATCGCCG AGGCGATGGA GAAGGTCGGC AAGGAAGGCG TGATCACCGT TGAAGAGGCC AAGGGCCTCG AATTCGAACT CGATGTCGTC GAAGGCATGC AGTTCGACCG CGGCTACCTC TCGCCCTACT TCATCACCAA CCCGGAAAAG ATGACGGTCG AACTCGAGAA CCCGTACATC CTGATCCACG AGAAGAAGCT GTCGTCGCTC CAGGCGATGC TGCCGATCCT CGAAGCCGTG GTGCAGTCGG GCCGTCCGCT CCTCATCATC GCCGAGGACA TCGAAGGCGA AGCGCTGGCC ACCCTCGTGG TCAACAAGCT GCGCGGTGGC CTCAAGATTG CCGCCGTCAA GGCTCCGGGC TTCGGTGACC GCCGCAAGGC CATGCTGGGC GACATCGCCA CGCTGACCGC CGGCGAAATG ATCTCCGAAG ACCTCGGCAT CAAGCTGGAG AGCGTCACGC TCGCCATGCT CGGCCAGGCC AAGAAGGTCA CCATCGACAA GGACAACACC ACGATCGTCG ACGGCGCCGG TTCGGCCGAA GAGATCAAGG CCCGCGTCGA GCAGATCCGT GCGCAGATCG AAGTCACCAC TTCGGACTAC GACCGCGAGA AGCTGCAGGA ACGCCTTGCC AAGCTTGCTG GCGGCGTTGC CGTGATCAAG GTCGGCGGCG CGACCGAAGT CGAGGTCAAG GAGCGCAAGG ACCGCGTCGA CGACGCTCTC CACGCCACCC GCGCCGCAGT CGAGGAAGGT ATCGTCCCGG GCGGCGGTAC GGCTCTGCTC TATGCCACCA AGGCTCTCGA AGGCCTCAAG GGCGCCAACG ACGACCAGAC CAAGGGCATC GACATCGTGC GCCGCGCGAT CCAGGCCCCG ATCCGTCAGA TCGCCGCGAA CGCGGGTCAT GACGGTGCGG TCGTCTCGGG CAACCTGCTG CGCGAGAACG ACGAGAACCA GGGCTTCAAC GCCGCGACCG ACACCTACGA GAACCTGAAG GCCGCCGGCG TCATCGACCC GACCAAGGTC GTGCGCACCG CGCTTCAGGA CGCTGCCTCG GTCTCGGGCC TGCTGATCAC CACGGAAGCG GCGATCAGCG AGAAGCCGGA CGACAAGCCC GCGATGCCGC CGATGGGCGG TGGAATGGGC GGCATGGGCG GCATGGACTT CTAA
|
Protein sequence | MAAKDVKFGR DARERILRGV DILADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVSVAKEI ELKDKFENMG AQMLREVASK ANDAAGDGTT TATVLAQAIV REGMTAVAAG MNPMDLKRGI DIAVGKVVEN LKARSTPVAG SSEIAQVGII SANGDTEVGQ KIAEAMEKVG KEGVITVEEA KGLEFELDVV EGMQFDRGYL SPYFITNPEK MTVELENPYI LIHEKKLSSL QAMLPILEAV VQSGRPLLII AEDIEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLG DIATLTAGEM ISEDLGIKLE SVTLAMLGQA KKVTIDKDNT TIVDGAGSAE EIKARVEQIR AQIEVTTSDY DREKLQERLA KLAGGVAVIK VGGATEVEVK ERKDRVDDAL HATRAAVEEG IVPGGGTALL YATKALEGLK GANDDQTKGI DIVRRAIQAP IRQIAANAGH DGAVVSGNLL RENDENQGFN AATDTYENLK AAGVIDPTKV VRTALQDAAS VSGLLITTEA AISEKPDDKP AMPPMGGGMG GMGGMDF
|
| |