Gene P9301_04761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_04761 
SymbolgroEL 
ID4912653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp416187 
End bp417932 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content38% 
IMG OID640160054 
Productchaperonin GroEL 
Protein accessionYP_001090700 
Protein GI126695814 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAC AGTTAAGTTT TTCTAATGAG TCAAGAGAAG CGCTAGAAAA AGGTGTGAAT 
TTCGTAGCTA ATGCAGTAAA GGTTACTATT GGGCCAAAAG CAAAAAACGT TGTAATAGAG
AAGAAATTTG GTTCGCCAGA TATAGTAAGA GATGGATCTA CAGTTGCTAA AGAGATCGAG
ATTGAAAACC CCATCTCTAA TTTAGGTGCG AAATTAATAG AACAAGTTGC ATCCAAGACA
AAAGAGAGTG CTGGTGATGG AACAACAACA GCAACCATTT TGACTCAGAA GATGGTTCAG
GAGGGTTTGA AAAATATTGC CTCTGGCGCA AACCCTATGG AGTTAAAAAA AGGTATGGAG
GCAGGCCTAT CTTTTGTCTT AGAAAAATTA AGTTCCAAAA GTATTTCATT AAGTGGTTCT
GACATCCAAA AAGTTGCAAC AGTTAGTGCT GGAGGTGATG AAGAAATTGG ATCTATAATT
TCGAAAGCAA TGGATATTGT TACTTCAGAT GGTGTAATAA CTGTCGAAGA ATCGCAATCA
TTAGAAACAG AATTAGATAT AACTGAAGGT ATGTCTTTTG ATAGAGGTTA TAGTTCTCCA
TATTTTGTAA CGGACCAAGA AAGACAAGTT TGTGAACTTG AAAATCCAAA AATATTAATA
ACTGATCAAA AAATCTCAAC TTTAGTTGAT CTAGTTCCAA TACTTGAAGA AATTCAGAAG
TCAGGCTCAC CTTTTCTAAT TCTTGCTGAA GATATCGAAG GAGAGGCTTT AACTACTCTA
GTTTTAAATA AGAATAGTGG GGTTTTAAAT GTTGCTTCCG TAAGGGCTCC ATTATTTGGT
GAGAGAAGAA AAGCTGCCCT CGAAGATATT GCAATTCTTA CAGGGGCTAA GTTAATTAGC
GAAGATAAAT CGATGACACT TGATAAAGTA TCGATTAACG ATTTAGGTAA AGCAAAAAAA
ATAACTATCA CAAAGGACAA AACTACAATT GTTGCCTTCG AAGACACTAA AGATTTAGTT
AAAGGGAGAG TAGAGAAATT AAAGAGAGAA GTTAATATAA CTGAATCTGA GTATGATCAA
GATAAAATCA ATGAAAGGAT AGCCAAACTA GCTGGAGGAG TAGCTCTTAT CAAAGTAGGA
GCTGCCACAG AAACAGAGAT GAAGTATAAA AAATTGAGAA TCGAAGATTC CCTTAATGCT
ACGAAAGCTG CTATTGAAGA GGGTGTTGTT TCTGGAGGAG GACAAACTCT AATTGAAATA
TCAGATGACC TTTTAAATTT AAGTAAAACA TCTACAGATG ATTTAAGAAC AGGGATAAAT
ATAGTCAAAG AAGCCCTCTT GGAACCCACC AAACAAATAG CAAAAAATGC TGGTTTTAAT
GGAGATGTAG TTGTCGCTGA AATTAAAAGA CTTAACAAAG GCTTTAATGC TAATTCAGGA
AAATATGAGG ACTTAAAAGA TTCAGGGATA TTAGATCCAA CCAAAGTAAT AAGATTAGCT
CTTCAAGATT CAGTATCTAT TGCAGCTATG CTCCTCACAA CAGAAGTTGC GATGGCAGAC
ATTCCAGAGC CTGAAGCCGC AGGCCCTGGA GGACCAGGTG CAGATCCAAT GGGAGGAATG
GGTGGCATGG GAATGCCAGG TATGGGTGGC ATGGGAATGC CAGGTATGGG TGGCATGGGA
ATGCCAGGTA TGGGTGGCAT GGGAATGCCA GGTATGGGTG GCATGGGAAT GCCAGGTATG
ATGTAG
 
Protein sequence
MAKQLSFSNE SREALEKGVN FVANAVKVTI GPKAKNVVIE KKFGSPDIVR DGSTVAKEIE 
IENPISNLGA KLIEQVASKT KESAGDGTTT ATILTQKMVQ EGLKNIASGA NPMELKKGME
AGLSFVLEKL SSKSISLSGS DIQKVATVSA GGDEEIGSII SKAMDIVTSD GVITVEESQS
LETELDITEG MSFDRGYSSP YFVTDQERQV CELENPKILI TDQKISTLVD LVPILEEIQK
SGSPFLILAE DIEGEALTTL VLNKNSGVLN VASVRAPLFG ERRKAALEDI AILTGAKLIS
EDKSMTLDKV SINDLGKAKK ITITKDKTTI VAFEDTKDLV KGRVEKLKRE VNITESEYDQ
DKINERIAKL AGGVALIKVG AATETEMKYK KLRIEDSLNA TKAAIEEGVV SGGGQTLIEI
SDDLLNLSKT STDDLRTGIN IVKEALLEPT KQIAKNAGFN GDVVVAEIKR LNKGFNANSG
KYEDLKDSGI LDPTKVIRLA LQDSVSIAAM LLTTEVAMAD IPEPEAAGPG GPGADPMGGM
GGMGMPGMGG MGMPGMGGMG MPGMGGMGMP GMGGMGMPGM M