Gene Bind_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1738 
Symbol 
ID6200696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1965730 
End bp1967376 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content62% 
IMG OID641705729 
Productchaperonin GroEL 
Protein accessionYP_001832857 
Protein GI182678711 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.280539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00152275 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCAGCCA AAGACGTCCG TTTCTCGTCT GATGCCCGCG ATCGTATGCT GCGCGGCGTC 
GAGATCCTGG CCAATGCGGT CAAGGTCACC CTCGGCCCCA AAGGCCGTAA CGTCGTCATC
GAAAAATCCT TCGGTGCGCC GCGCATCACC AAGGACGGCG TGAGCGTCGC CAAGGAGATC
GAACTCGCCG ACAAGTTCGA AAATCTCGGT GCGCAGCTCG TGCGCGAAGT CGCTTCCAAG
CAGAACGATG CGGCTGGCGA CGGCACCACC ACCGCCACCA TTCTCGCCGC TTCCATCGTC
AAGGAAGGCA CCAAGGCGGT CGCCGCCGGC CTCAACCCGA TGGATCTGAA GCGCGGCATC
GACCACGCCG TCGAAGCGAT CGTCGCCGAC CTCAAGGCTA ATTCCAAGAA GGTCACCTCG
AACGACGAAA TCGCCCAGGT CGGCACGATT TCGGCCAATG GCGACAAGTC CGTCGGCGAC
ATGATCTCGA CCGCCATGCA GAAGGTCGGC AACGAGGGTG TCATCACCGT CGAGGAAGCC
AAGAGCCTCG AGACCGAGCT CGATGTCGTC GAAGGCATGC AGTTCGATCG CGGCTATCTC
TCCCCCTACT TCATCACCAA TGCCGAGAAG ATGATCGCCG AGCTCGAGGA TCCCTATATC
CTCGTCCACG AGAAAAAGCT CTCCTCGCTG CAAGCCATGC TGCCGGTGCT CGAAGCCGTC
GTGCAGACCG GCAAGCCGCT CGTCATCATC GCTGAGGACG TCGAAGGTGA GGCTCTTGCC
ACGCTCGTCG TCAACAAGCT GCGCGGTGGC CTCAAGGTCG CGGCCGTCAA GGCTCCGGGC
TTCGGTGATC GCCGCAAGGC CATGCTCGAA GACATCGCCA TCCTGACCGG CGGCACCCTG
ATCTCCGAAG AGCTCGGCAT CAAGCTCGAG AACGTCACGC TTGCCATGCT CGGCCGCGCC
AAGCGCATCC GCATCGACAA GGAAGCCACG ACGATCATCG ATGGCGCCGG CAACAAGGAC
GACATCGAGG GCCGTATCTC CCAGATCAAG GCTCAGATCG CCGAGACCAC TTCCGATTAC
GATCGTGAAA AGATGCAGGA GCGTCTCGCC AAGCTCGCTG GCGGCGTCGC CGTGCTGCGT
GTCGGCGGCT CGACCGAAGT CGAAGTGAAG GAAAAGAAGG ACCGCGTCGA CGACGCGCTC
AACGCGACCC GCGCTGCGGT CGAAGAAGGC GTTTCCCCCG GTGGTGGTGT CGCTCTTCTC
CGCGCGATCA AGGCGCTCGA AAACCTGCCT ACGGATAATT CCGACCAGAA GGCTGGTATC
GAAATCGTCC GCAAGGCGAT CCAGACCCCG GCTAAGCAGA TCGTCGACAA TTCCGGCGGC
GACGGCGCGG TCGTGGTTGG CAAGCTGCTC GAGTCGAACG AATATGCTTT CGGCTACAAT
GCCCAGACCG GCGAATATGG CGACATGGTC AAGCTCGGCA TTATCGACCC GACCAAGGTT
GTCCGCACGG CGCTCCAAGA TGCGGCTTCC ATTGCCGGCC TCCTGATCAC CACCGAGGCG
ACGATCACCG AAGCGCCGAA GAAGGAAGCT CCGCTTCCTC CGATGGGCGG TGGCGGCATG
GGCGGCATGG GCGGTATGGA TTTCTAA
 
Protein sequence
MAAKDVRFSS DARDRMLRGV EILANAVKVT LGPKGRNVVI EKSFGAPRIT KDGVSVAKEI 
ELADKFENLG AQLVREVASK QNDAAGDGTT TATILAASIV KEGTKAVAAG LNPMDLKRGI
DHAVEAIVAD LKANSKKVTS NDEIAQVGTI SANGDKSVGD MISTAMQKVG NEGVITVEEA
KSLETELDVV EGMQFDRGYL SPYFITNAEK MIAELEDPYI LVHEKKLSSL QAMLPVLEAV
VQTGKPLVII AEDVEGEALA TLVVNKLRGG LKVAAVKAPG FGDRRKAMLE DIAILTGGTL
ISEELGIKLE NVTLAMLGRA KRIRIDKEAT TIIDGAGNKD DIEGRISQIK AQIAETTSDY
DREKMQERLA KLAGGVAVLR VGGSTEVEVK EKKDRVDDAL NATRAAVEEG VSPGGGVALL
RAIKALENLP TDNSDQKAGI EIVRKAIQTP AKQIVDNSGG DGAVVVGKLL ESNEYAFGYN
AQTGEYGDMV KLGIIDPTKV VRTALQDAAS IAGLLITTEA TITEAPKKEA PLPPMGGGGM
GGMGGMDF