Gene BBta_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3034 
SymbolgroEL 
ID5150367 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3169771 
End bp3171393 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content66% 
IMG OID640557906 
Productchaperonin GroEL 
Protein accessionYP_001239060 
Protein GI148254475 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.398943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA AGGACGTGAA GTTTTCGACC GACGCGCGCG ACCGCATGCT GCGCGGCGTC 
GACATCCTCG CCAATGCGGT CAAGGTCACG CTCGGCCCCA AGGGCCGCAA CGTCGTGATC
GAGAAATCGT TCGGCGCGCC GCGCATCACC AAGGACGGCG TCACGGTCGC CAAGGAGATC
GAGCTCGAGG ACAAGTTCGA GAACATGGGC GCGCAGATGG TGCGCGAGGT GGCCTCGAAG
ACCGCCGATC TCGCCGGCGA CGGCACCACC ACCGCCACCG TGCTCGCCCA GGCGATCGTG
AAGGAAGGCG CGAAGTCGGT CGCCGCCGGC ATGAACCCGA TGGACCTCAA GCGCGGCATC
GACCTCGCGG TCGACGCCAT CGTGGCCGAC CTGAAGGCGC ACGCCAAGAA GATCACCAGC
AATGACGAGA TCGCCCAGGT CGGCACCATC TCGGCCAATG GCGACAACGA GATCGGCCGC
TTCCTGGCCG AGGCCATGCA GAAAGTCGGC AATGAGGGCG TGATCACGGT CGAGGAGGCC
AAGAGCCTCG ACACCGAGCT CGAAGTGGTC GAGGGCATGC AGTTCGACCG TGGCTATGTC
TCGCCATACT TCGTCACCAA TTCCGAGAAG ATGCGGGTCG AGCTCGAGGA TCCCTATATT
CTGATCCACG AGAAGAAGCT GTCGGGCCTG CAGACCATGC TGCCGCTGCT CGAAGCGGTG
GTGCAGTCCG GCAAGCCGCT CTTGATCGTC GCCGAGGACG TTGAAGGCGA GGCGCTGGCG
ACCTTGGTCG TCAACAAGCT GCGCGGCGGC CTCAAGATCG CCGCCGTCAA GGCGCCGGGC
TTCGGCGATC GCCGCAAGGC GATGCTGGAG GACATCGCGA TCCTCACCGG CGGCACCACG
ATATCAGAGG ATCTCGGCAT CAAGCTGGAG AACGTGACCC TGTCGATGCT CGGCCGCGCC
AAGAAGGTCG TCATCGACAA GGAAAACACC ACCATCGTCG ATGGTGCCGG CGCCAAGAAG
GACATCGAGG CGCGCACGCA GCAGATCAAG CTGCAGATCG AGGAGACCAC CTCCGACTAT
GACCGCGAGA AGCTGCAGGA GCGGCTGGCC AAGCTCGCCG GCGGCGTCGC GGTGATCAGG
GTCGGCGGCG CCACCGAGGT CGAGGTCAAG GAGCGCAAGG ACCGCGTCGA CGATGCGCTG
CATGCCACGC GCGCGGCTGT CGAGGAGGGC ATCCTGCCCG GCGGCGGCGT GGCGCTGTTG
CGCGCCACCA AGGTGCTCGA CGGCGTCAAG ACCGCCAATG CCGACCAGAA GGCCGGGGTC
GACATCATCC GCCGCGCCAT CCAGGTGCCG GTGCGGCAGA TCGTGCAGAA CGCCGGCGAG
GACGGCTCGC TGGTGGTCGG CAAGCTCCTG GAGAAGGACA CCTACAGCTG GGGCTTCAAC
GCCGCGACCG GCGAGTACCA GGATCTGGTG CAGGCCGGCG TGATCGACCC GGCCAAGGTG
GTCCGCACCG CGCTGCAGGA TGCGGCCTCG GTCGCCTCGC TGCTGATCAC CACCGAGGCG
CTGGTTGCCG ACAAGCCGAA GAAGGCGGAG GCCACGCAGG CAGCGCCGGC GATGGACTTC
TGA
 
Protein sequence
MAAKDVKFST DARDRMLRGV DILANAVKVT LGPKGRNVVI EKSFGAPRIT KDGVTVAKEI 
ELEDKFENMG AQMVREVASK TADLAGDGTT TATVLAQAIV KEGAKSVAAG MNPMDLKRGI
DLAVDAIVAD LKAHAKKITS NDEIAQVGTI SANGDNEIGR FLAEAMQKVG NEGVITVEEA
KSLDTELEVV EGMQFDRGYV SPYFVTNSEK MRVELEDPYI LIHEKKLSGL QTMLPLLEAV
VQSGKPLLIV AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLE DIAILTGGTT
ISEDLGIKLE NVTLSMLGRA KKVVIDKENT TIVDGAGAKK DIEARTQQIK LQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK ERKDRVDDAL HATRAAVEEG ILPGGGVALL
RATKVLDGVK TANADQKAGV DIIRRAIQVP VRQIVQNAGE DGSLVVGKLL EKDTYSWGFN
AATGEYQDLV QAGVIDPAKV VRTALQDAAS VASLLITTEA LVADKPKKAE ATQAAPAMDF