Gene Cag_1306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1306 
SymbolgroEL 
ID3747395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1772444 
End bp1774087 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content46% 
IMG OID637773843 
Productchaperonin GroEL 
Protein accessionYP_379609 
Protein GI78189271 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000255221 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCAA AAGATATTTT TTTTGATACC GATGCGCGTG CAAAGCTTAA AGTTGGTGTG 
GATAAGCTTG CTAACGCAGT AAAAGTTACG CTTGGACCTG CCGGTCGCAA TGTGTTGATT
GACAAAAAAT TTGGTGCACC AACATCTACC AAAGATGGTG TAACCGTTGC TAAAGAGATT
GAGCTTGCCG ATGCTGTTGA AAACATGGGT GCTCAAATGG TGCGCGAAGT GGCTTCTAAA
ACCAGCGATG TTGCTGGCGA TGGCACCACC ACAGCAACCG TGCTTGCTCA AGCTATTTAC
CGTGAAGGTT TAAAGAACGT TACCGCTGGC GCTCGTCCAA TTGACCTGAA AAGAGGCATT
GACCGTGCGG TTAAAGAGGT GGTTGCTGAA TTAAAAGCTA TCAGCCGCAG CATTTCCAGC
AAAAAAGAGA TTGCTCAGGT TGGAACCATT TCGGCTAACA ACGATCCAGA AATCGGTGAG
TTAATTGCTG AAGCTATGGA AAAAGTTGGC AAAGATGGTG TTATTACGGT TGAAGAGGCT
AAAGGTATGG AAACCGAGCT GAAAGTAGTT GAAGGTATGC AGTTCGATCG TGGTTACCTT
TCTCCATACT TTGTAACCAA TTCCGACACC ATGGAAGCCG AGCTCGACAA TCCGCTGATT
CTTATCTACG ATAAAAAAAT CAGCAACATG AAAGAGCTGC TCCCAATTCT TGAAAAATCA
GCACAATCAG GTCGTCCTCT GCTTATTATT GCTGAAGATA TTGAAGGTGA AGCACTTGCT
ACCCTTGTAG TAAACAAACT ACGTGGCACG CTGAAAGTAT GTGCCGTTAA AGCTCCAGGC
TTTGGCGATC GCCGCAAAGC AATGCTTGAA GATATTGCTA TTCTTACCGG TGGCACCGTT
ATTTCGGAAG AGAAAGGCTA CAAGCTTGAA AATGCTACCC TTTCTTACCT CGGTCAAGCA
GGCAGCGTAA GCCTCGACAA AGACAACACT ACCCTTGTGG AAGGCAAAGG CGCAAGCGAT
GCAATTAAAG CTCGCATTAA CGAAATCAAA GGGCAGATTG AAAAATCAAC CTCCGATTAC
GATACCGAAA AATTGCAAGA GCGTCTTGCA AAACTTTCTG GCGGTGTAGC CGTTATCAAC
ATTGGTGCAT CAACCGAAGT TGAAATGAAA GAGAAAAAAG CTCGCGTTGA GGATGCTCTT
CATGCAACCC GTGCAGCAGT TCAAGAAGGT ATTGTTGTTG GTGGTGGCGT TGCTCTTATC
CGTGCAATTA AAGGTCTCAA CAACGCACAA GCCGACAACG AAGATCAGAA AATTGGTATC
GAAATTGTTC GCCGTGCGCT CGAAGAACCA CTTCGTCAGA TTGTTGCCAA CACTGGTACT
CAAGATGGCG CTGTAGTTCT CGAAAAAGTA AAAGAAGGCG AAGGCGACTT TGGCTTTAAT
GCAAGAACCG AAACCTACGA AAACCTTGTA GAAGCTGGTG TTGTTGACCC AACCAAAGTA
ACCCGTAGCG CTCTTGAGAA TGCTGCGTCA GTTGCAGGTA TTCTGCTTAC CACTGAGGCT
GCTATTACCG ACATCAAAGA CGATAAGATG GATATGCCTG CTATGCCTCC AGGTGGCATG
GGCGGAATGG GCGGTATGTA CTAA
 
Protein sequence
MTAKDIFFDT DARAKLKVGV DKLANAVKVT LGPAGRNVLI DKKFGAPTST KDGVTVAKEI 
ELADAVENMG AQMVREVASK TSDVAGDGTT TATVLAQAIY REGLKNVTAG ARPIDLKRGI
DRAVKEVVAE LKAISRSISS KKEIAQVGTI SANNDPEIGE LIAEAMEKVG KDGVITVEEA
KGMETELKVV EGMQFDRGYL SPYFVTNSDT MEAELDNPLI LIYDKKISNM KELLPILEKS
AQSGRPLLII AEDIEGEALA TLVVNKLRGT LKVCAVKAPG FGDRRKAMLE DIAILTGGTV
ISEEKGYKLE NATLSYLGQA GSVSLDKDNT TLVEGKGASD AIKARINEIK GQIEKSTSDY
DTEKLQERLA KLSGGVAVIN IGASTEVEMK EKKARVEDAL HATRAAVQEG IVVGGGVALI
RAIKGLNNAQ ADNEDQKIGI EIVRRALEEP LRQIVANTGT QDGAVVLEKV KEGEGDFGFN
ARTETYENLV EAGVVDPTKV TRSALENAAS VAGILLTTEA AITDIKDDKM DMPAMPPGGM
GGMGGMY