Gene Gdia_0861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0861 
Symbol 
ID6974258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp978489 
End bp980114 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content66% 
IMG OID643390390 
Productchaperonin GroEL 
Protein accessionYP_002275266 
Protein GI209543037 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00285794 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA AAGACGTCAA GTTCGCCGGC GACGCACGGG CGCGCCTGCT TTCCGGAATC 
GACACGCTTG CCGACGCGGT CAAGGTGACG CTGGGGCCGA AGGGCCGCAA CGTCGTCATC
GACAAGAGCT TCGGCGCGCC AAGGATCACC AAGGACGGTG TCACGGTCGC CAAGGAGATC
GAACTGTCCG ACAAGTTCGA GAACCTGGGC GCGCAGCTTC TGCGTGAGGT CGCCAGCAAG
ACCAACGACC TGGCGGGCGA CGGGACGACG ACCGCGACCG TCCTGGCGCA GTCCATCGTG
CGCGAGGGGC TGAAGGCGGT CGCCGCCGGT TTCAACCCGC AGGACGTCAA GCGCGGAATC
GATCACGCGA CGACCGCGGT GATCGAGGAA CTGCGGACGC GTACCCGCCC GATCGCGACC
CGGGAGGAAA CCGCCCAGGT GGCCACGATT TCGGCCAACG GCGAGGTGGA AATCGGCCGC
ATCATTTCCG AGGCGGTGCA GAAGGTCGGC AAGGACGGTG TGATCACCGT CGAGGAAGCC
AAGGGATTCG AGACCGAACT GGACGTAGTC GAGGGGTTGC AGTTCGACCG GGGCTATATC
TCGCCCTATT TCGTGACGAA CAGCGAGAAG CTGATCGCGG ACCTGGAAAA TCCCTATATC
CTGATCCATG AAAAGAAGCT GTCGTCGCTG CAGCCCCTGC TGCCGCTGCT GGAGAACGTG
GTCAAATCCG GGCGCCCGCT GCTGAGCATC GCCGAGGATG TCGAGGGCGA GGCCCTGGCG
ACCCTGGTGG TGAACAAGTT GCGCGGCGGG CTGAAGATCG CGGCTGTCAA GGCGCCGGGC
TTCGGCGACC GGCGCAAGGC CATTCTGGAG GATATCGCGA TCCTGACGGG TGGCGAGGTC
ATCAGCGAGG ATCTGGGCAT CAAGCTGGAA AGCGTGACGC TGTCGCAGCT TGGCCAGGCG
CGGCGCATCG TGATCGACAA GGACAACACG ACCATCGTCG ACGGCGAGGG CGACGCCGAC
GCCATCAAGG GCCGCGTCGG GCAGATCCGC GCGCAGATCG AGGAAACCAC CTCGGACTAC
GATCGGGAAA AATTGCAGGA GCGCCTGGCG AAGCTGGCGG GCGGAGTGGC CATCATCCGG
GTCGGCGGTT CGACCGAAAT CGAGGTGAAG GAGCGCAAGG ATCGCGTCGA TGACGCGCTG
AACGCCACGC GCGCGGCCGT CGAGGAAGGC ATCGTCCCGG GCGGCGGGAC CGCGCTGGCG
CGCGCGGCGG AGGTCGTGGC GCGGCTGCAG TTCCATAATG ACGACCAGCG CATCGGCGGC
GACATCGTCC GCAAGGCATT GCAGGCGCCG CTGCGCCAGA TCGCGGAGAA TGCCGGCGAG
GACGGTGCGG TCGTGGCCGG AAAGGTGCTG GAGAACGGCG CATACAATTT CGGATTCGAT
GCGCAGATCG GCGAATTCAA GGATCTGGTC GCCGCCGGCA TCATCGACCC GACCAAGGTC
GTGCGCACGG CCCTGCAGGA CGCGGCCTCG GTCGGCAGCC TGCTGATCAC GACCGAGGTC
CTGGTCACCG AAAAGGCCGA ACCCAAGCCG GCCGCCCCAC CGGCGGGTGC CGACCTCGGA
TACTGA
 
Protein sequence
MASKDVKFAG DARARLLSGI DTLADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVTVAKEI 
ELSDKFENLG AQLLREVASK TNDLAGDGTT TATVLAQSIV REGLKAVAAG FNPQDVKRGI
DHATTAVIEE LRTRTRPIAT REETAQVATI SANGEVEIGR IISEAVQKVG KDGVITVEEA
KGFETELDVV EGLQFDRGYI SPYFVTNSEK LIADLENPYI LIHEKKLSSL QPLLPLLENV
VKSGRPLLSI AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAILE DIAILTGGEV
ISEDLGIKLE SVTLSQLGQA RRIVIDKDNT TIVDGEGDAD AIKGRVGQIR AQIEETTSDY
DREKLQERLA KLAGGVAIIR VGGSTEIEVK ERKDRVDDAL NATRAAVEEG IVPGGGTALA
RAAEVVARLQ FHNDDQRIGG DIVRKALQAP LRQIAENAGE DGAVVAGKVL ENGAYNFGFD
AQIGEFKDLV AAGIIDPTKV VRTALQDAAS VGSLLITTEV LVTEKAEPKP AAPPAGADLG
Y