Gene A9601_05071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_05071 
SymbolgroEL 
ID4717205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp441717 
End bp443462 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content38% 
IMG OID640078219 
Productchaperonin GroEL 
Protein accessionYP_001008902 
Protein GI123968044 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAAAC AGTTAAATTT TTCTAATGAA TCAAGAGAAG CGCTAGAAAA AGGTGTGAAT 
TTTGTAGCTA ATGCAGTAAA GGTTACTATT GGGCCAAAAG CAAAAAACGT TGTAATAGAA
AAGAAATTTG GTTCGCCGGA TATAGTAAGA GATGGATCTA CAGTTGCTAA AGAGATCGAG
ATTGAAAACC CTATTGCTAA TTTAGGTGCG AAATTAATAG AACAAGTCGC ATCCAAGACA
AAAGAGAGTG CTGGTGATGG GACAACAACA GCAACCATTT TGACTCAGAA GATGGTTCAG
GAGGGATTAA AAAATATTGC TTCTGGTGCC AACCCTATGG AGTTAAAAAA AGGTATGGAA
GCAGGCCTAG CTTTTGTCTT AGAAAAATTA AGTTCCAAAA GTATTTCATT AAGTGGTTCT
GACATCCAAA AAGTTGCAAC AGTTAGTGCT GGAGGTGATG AAGAAATTGG ATCTATAATT
TCGAAAGCAA TGGATATTGT TACTTCAGAT GGTGTAATAA CTGTTGAAGA ATCTCAATCA
TTAGAAACAG AATTAGATAT AACTGAAGGA ATGTCTTTTG ATAGAGGTTA TAGTTCTCCA
TATTTCGTAA CAGACCAAGA AAGACAAGTT TGTGAACTTG AAAACCCTAA AATATTAATA
ACTGATCAAA AAATCTCAAC TTTAGTTGAT CTAGTTCCAA TACTTGAAGA AATTCAAAAG
GCGGGCTCAC CTTTTCTAAT TCTTGCTGAA GATATTGAAG GAGAGGCTTT AACTACTCTG
GTTTTGAATA AGAATAGTGG GGTTTTAAAT GTAGCTTCCG TGAGAGCTCC TTTATTTGGT
GAGAGAAGAA AAGCTGCCCT TGAAGATATT GCAATTCTTA CAGGGGCTAA GTTAATTAGC
GAAGATAAAT CGATGACACT TGATAAAGTA TCTATTAATG ATTTAGGCAA AGCAAAAAAA
ATAACTATCA CAAAGGATAA AACTACAATT GTTGCCTTCG AAGACACTAA AGATTTAGTT
GAAGCGCGAG TAGAGAAATT AAAGAGAGAA GTTAACATAA CTGAATCTGA GTATGATCAG
GACAAAATCA ATGAAAGGAT AGCCAAACTA GCCGGAGGAG TAGCTCTTAT CAAAGTAGGA
GCTGCTACAG AAACAGAGAT GAAATATAAA AAGTTGAGAA TTGAAGATTC CCTTAATGCT
ACGAAAGCTG CTATTGAAGA GGGTGTTGTT TCTGGAGGAG GACAAACTTT AATTGAAATA
TCAGATGACC TTTTAAATTT AAGTGAAACA TCTTCAGATG ATTTAAGAAC AGGGATAAAT
ATAGTTAAAG AAGCCCTTTT GGAACCTACC AAACAAATAG CAAAAAATGC TGGTTTTAAT
GGTGATGTAG TTGTCGCTGA AATTCAAAGG CTTAACAAAG GCTTTAATGC TAATTCAGGA
CAATATGAGG ATTTAAAAGA TTCAGGAATA TTAGATCCAA CCAAAGTAAT AAGATTAGCT
CTTCAAGATT CAGTATCTAT TGCAGCTATG CTCCTCACAA CAGAAGTTGC GATGGCAGAC
ATTCCAGAGC CTGAAGCCGC GGCCCCTGGA GGACCAGGTG GAGATCCAAT GGGAGGAATG
GGTGGCATGG GAATGCCAGG AATGGGTGGC ATGGGAATGC CGGGTATGGG TGGCATGGGA
ATGCCGGGTA TGGGTGGCAT GGGAATGCCA GGAATGGGTG GCATGGGAAT GCCGGGTATG
ATGTAA
 
Protein sequence
MAKQLNFSNE SREALEKGVN FVANAVKVTI GPKAKNVVIE KKFGSPDIVR DGSTVAKEIE 
IENPIANLGA KLIEQVASKT KESAGDGTTT ATILTQKMVQ EGLKNIASGA NPMELKKGME
AGLAFVLEKL SSKSISLSGS DIQKVATVSA GGDEEIGSII SKAMDIVTSD GVITVEESQS
LETELDITEG MSFDRGYSSP YFVTDQERQV CELENPKILI TDQKISTLVD LVPILEEIQK
AGSPFLILAE DIEGEALTTL VLNKNSGVLN VASVRAPLFG ERRKAALEDI AILTGAKLIS
EDKSMTLDKV SINDLGKAKK ITITKDKTTI VAFEDTKDLV EARVEKLKRE VNITESEYDQ
DKINERIAKL AGGVALIKVG AATETEMKYK KLRIEDSLNA TKAAIEEGVV SGGGQTLIEI
SDDLLNLSET SSDDLRTGIN IVKEALLEPT KQIAKNAGFN GDVVVAEIQR LNKGFNANSG
QYEDLKDSGI LDPTKVIRLA LQDSVSIAAM LLTTEVAMAD IPEPEAAAPG GPGGDPMGGM
GGMGMPGMGG MGMPGMGGMG MPGMGGMGMP GMGGMGMPGM M