Gene Francci3_0633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0633 
SymbolgroEL 
ID3903311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp715686 
End bp717326 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content69% 
IMG OID637877966 
Productchaperonin GroEL 
Protein accessionYP_479746 
Protein GI86739346 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA TTCTGACGTT CAACGAGGAC GCCCGCCGCG CGCTCGAGCA CGGGGTGAAC 
GCCCTGGCCA ACGCGGTCAA GGTGACGATC GGCCCGCGTG GTCGCAATGT CGTCATCGAC
AAGCACTACG GCGCCGCGAC GATCACCAAT GACGGGGTGA CGATCGCGCG CGAGATCGAG
CTGGAGGACC CCTACGAGAA CCTGGGCGCC CAGCTCGCGA AGGAAGTCGC CACCAAGACC
AACGACGTGG CTGGCGACGG CACCACGACG GCGACCGTGC TAGCCCAGGA GATGGTGCGC
TTCGGTCTCA AGCAGGTGAC CGCGGGGGCC GCCCCACTGA CGCTGAAGCT GGGCATCGAG
GCCGCCGTCG AGGCCGTCTC CGCGGCACTG CTGAAGCAGG CCATCGAGGT CAACTCGAAG
GAGACCATCG CCCAGGTCGC CGCCATCTCC GCTCAGGACC CGCAGGTCGG GGAACTGATC
GCCGAGGCGA TCGACAAGAT CGGCAAGGAC GGCGTCATCA CGGTCGAGGA GAGCCAGACC
CTCGGGCTGG ACCTTGAACT GACCGAGGGC ATGCAGTTCG ACAAGGGCTA CATCTCGCCG
TACTTCGTCA CGGACGCCGA GGCCCAGGAG GCCGTGCTCG AGGACGCCTA CGTCCTGCTC
TACCCGGGCA AGATCTCGGC GCTGAACGAG ATCCTGCCCG TGCTGGAGCA GGTCGTCCAG
GAGCGCAAGC CGCTACTGAT TATCGCCGAG GAGGTCGAGG GCGAGGCCCT GTCCACCCTG
GTGGTGAACT CGATCCGCAA GACCTTCCAG GTCGTCGCGG TCAAGGCTCC CGGGTTTGGG
GACCGCCGCA AGGCACTGCT GCAGGATATC GCCGTGCTCA CCGGCGGCCA GGTGGTGGCC
TCGGAGGTCG GTCTTTCCCT CGACGCGGTG ACGTTGGCCG ACCTGGGCCG GGCCCGGCGG
GTCGTGGTGG ACAAGGACAA CACCACCATC GTTGACGGGG TTGGCGAGGC CTCCTCGATC
GCCGATCGGG TGCGTCAGCT CAAGCAGGAG ATCGAGGTCA GCGACTCCGA CTGGGACCGC
GAGAAGCTGC AGGAGCGGTT GGCCAAGCTC GCCGGTGGGG TCGCGGTCAT CCGCGTCGGC
GCCGCCACCG AGGTGGAGCT CAAGGAGAGG AAGCACCGCC TCGAGGACGC CGTGTCGGCT
ACCCGCGCGG CCATCGAGGA GGGCATCATC GCCGGCGGCG GTTCCGCGCT CACCCACGTG
GCGTCCGTGC TCGATGACGG GCTCGGTCGC ACCGGGGACG AGCTCGCCGG GGTGCGGATC
GTGCGCCGCG CGCTCGACGC CCCGCTGTCG TGGATCGCGC GCAACGCTGG TCTGGAGGGC
GCGGTCATCG TCTCCAAGGT CAAGGAGCTC GAGCCGGGTC GTGGGTACAA CGCGGCCACC
GGCGAGTACA CCGATCTGAT CGCGGCCGGC GTCATCGACC CGGTCAAGGT CACCCGGTCG
GCGGTGGCGA ACGCCGCCTC GATCGCGGCT CTGCTCATCA CCACCGAGGG CCTGGTCGTC
GAGAAGCCGG CGGAGCCCGC TCCCCAGGAC GGCCACGGCC ACGGCCACGG GCACAGCCAC
CCGCAGGGCC CGGGTTTCTG A
 
Protein sequence
MPKILTFNED ARRALEHGVN ALANAVKVTI GPRGRNVVID KHYGAATITN DGVTIAREIE 
LEDPYENLGA QLAKEVATKT NDVAGDGTTT ATVLAQEMVR FGLKQVTAGA APLTLKLGIE
AAVEAVSAAL LKQAIEVNSK ETIAQVAAIS AQDPQVGELI AEAIDKIGKD GVITVEESQT
LGLDLELTEG MQFDKGYISP YFVTDAEAQE AVLEDAYVLL YPGKISALNE ILPVLEQVVQ
ERKPLLIIAE EVEGEALSTL VVNSIRKTFQ VVAVKAPGFG DRRKALLQDI AVLTGGQVVA
SEVGLSLDAV TLADLGRARR VVVDKDNTTI VDGVGEASSI ADRVRQLKQE IEVSDSDWDR
EKLQERLAKL AGGVAVIRVG AATEVELKER KHRLEDAVSA TRAAIEEGII AGGGSALTHV
ASVLDDGLGR TGDELAGVRI VRRALDAPLS WIARNAGLEG AVIVSKVKEL EPGRGYNAAT
GEYTDLIAAG VIDPVKVTRS AVANAASIAA LLITTEGLVV EKPAEPAPQD GHGHGHGHSH
PQGPGF