Gene Francci3_2175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2175 
SymbolgroEL 
ID3906775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2548390 
End bp2550015 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content66% 
IMG OID637879508 
Productchaperonin GroEL 
Protein accessionYP_481274 
Protein GI86740874 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.755929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.503349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGA TCATCGCTTA CGAGGAGGAA GCACGGCGCG GGCTGGAACG TGGCATGAAC 
CAGCTGGCCG GCGCCGTGAA GGTGACGCTC GGTCCGAAGG GGCGCAACGT CGTCCTGGAG
AAGAAGTGGG GCGTTCCCGC CATCACCAAT GACGGCGTCT CCATAGCCAG GGAGATCGAG
CTCGAAGATC CGTATGAGAA GATCGGTGCC GAGATGGTCA AGGAGGTCGC CAAGAAGACC
GACGAGGTGG CCGGCGACGG CACCACGACG GCGACCGTTC TGGCGGAGGC ACTGGTCCAT
GAGGGCCTGC GGAACGTGGC CGCGGGCGCG AACCCGATCG CCCTCAAACG TGGCATCGAG
CTGGCTGTCG AGCGGGTCTG TGGGGAACTG GCCAATCTGT CCAGGGAGCT GGAAACCAAG
GATCAGATCG CCTCAACGGC CTCGATCTCG GCCGGCGGGG ACACCGCGAT CGGCCAGATC
ATCGCCGAGG CGATGGACAA GGTCGGCCGG GACGGCGTCA TCACCGTCGA GGAGAGCAAC
ACCTTCGGCC TCGAGCTGGA GCTCACCGAA GGTATGCGTT TCGACAAGGG CTACATCTCG
CCGTACTTCA TCACTGATCA GGAGCGGATG GAGTGCGTCC TGGAGGACCC CTACATCCTG
GTCGCCAACA TCAAGATTTC GCTGGTCAAG GACCTGCTCC CGCTGTTGGA GAAGGTCATG
CAGGCCGGCA GGCCGCTGCT GGTCATCGCC GAGAACGTTG AGGGGGAGGC CCTGGCGACC
CTGGTCGTCA ACAAGATCCG CGGTACGTTC CGGTCCGTGG CCGTGAAGGC GCCGGGTTTC
GGCGAGCGGC GCAAGGCCAT GCTCGGCGAT ATCGCCGTTC TGACGGGCAG TCAGGTGATC
AGTGAGGAGG TTGGTCTCAG GCTGGAGAAC GCCGACCTCG ACCTGCTTGG CCGGGCCCGC
AAGGTTGTCG TTACCAAGGA TGACACGACC ATTATCGAGG GCGCCGGCGA CCCGGGCCGG
ATCGCCGGTC GGGTCAGCCA GATCCGTAGC GAGATCGAGA AGTCGGACTC CGACTACGAT
CGCGAGAAGC TGCAGGAGCG GCTGGCCAGG CTCGCCGGTG GCGTGGCCGT CATCAAAGCC
GGCGCGGCCA CCGAGGTCGA GCTCAAGGAG CGTAAGCACC GCATCGAGGA CGCGGTCCGC
AACGCGAAGG CCGCCGTTGA GGAGGGCATC GTCCCCGGCG GTGGGGTGGC TCTGCTGCTG
GCCTCGGGGG CTGTCTTCGA CGGGCTGGAG GTGGCTGAGG ACGAGCGGAC CGGGGCCGAG
ATGGTGCGCC GCGCGTTGAC CGAGCCGCTC CGGCAGATCG CGGTCAATGC CGGCCTGGAA
GGCGGCGTCG TGGTCGAGAA GGTCCGCAAC CTGCAACCGG GGTGGGGGCT GGACGCCGCC
ACCGGCGAGC ACGTCAACAT GCTCGAGGCC GGGATCATCG ACCCGACCAA GGTCACCCGC
TCCGCCCTGC AGAATGCCGC ATCCATCGCC GGGCTGTTCC TCACCACCGA GGCCGTCGTT
GCCGAAAAGC CAGAGGAAAA GGAAACCGCG GCAGCGCCAG CTGGTGGGGG TGGCCTGGAG
TACTGA
 
Protein sequence
MPKIIAYEEE ARRGLERGMN QLAGAVKVTL GPKGRNVVLE KKWGVPAITN DGVSIAREIE 
LEDPYEKIGA EMVKEVAKKT DEVAGDGTTT ATVLAEALVH EGLRNVAAGA NPIALKRGIE
LAVERVCGEL ANLSRELETK DQIASTASIS AGGDTAIGQI IAEAMDKVGR DGVITVEESN
TFGLELELTE GMRFDKGYIS PYFITDQERM ECVLEDPYIL VANIKISLVK DLLPLLEKVM
QAGRPLLVIA ENVEGEALAT LVVNKIRGTF RSVAVKAPGF GERRKAMLGD IAVLTGSQVI
SEEVGLRLEN ADLDLLGRAR KVVVTKDDTT IIEGAGDPGR IAGRVSQIRS EIEKSDSDYD
REKLQERLAR LAGGVAVIKA GAATEVELKE RKHRIEDAVR NAKAAVEEGI VPGGGVALLL
ASGAVFDGLE VAEDERTGAE MVRRALTEPL RQIAVNAGLE GGVVVEKVRN LQPGWGLDAA
TGEHVNMLEA GIIDPTKVTR SALQNAASIA GLFLTTEAVV AEKPEEKETA AAPAGGGGLE
Y