Gene NATL1_05061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05061 
SymbolgroEL 
ID4780924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp461078 
End bp462769 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content40% 
IMG OID640083781 
Productchaperonin GroEL 
Protein accessionYP_001014333 
Protein GI124025217 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.125633 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAC TTCTAAGTTT TTCAGACGAA TCTCGTGGTG CTCTCGAAAA AGGAGTAAAC 
AATTTAGCCA ACGCTCTAAA AGTCACAATT GGACCTAAAG GTAGAAATGT TGTTATTGAA
AAAAAATTTG GAGCTCCAGA TATAGTTAAT GATGGAGTAA CTATTGCTAA GGAAATAGAT
CTTGAAGATC CATTTGAAAA TATAGGAGCA AAGCTCATTG AACAGGTTGC ATCAAAAACG
AAAGAAAAAG CTGGAGATGG AACAACTACT GCAACAGTTT TAGCTCAATT TATGGTTCAA
GAGGGTTTGA GAAATACAGC CGCTGGAGCA AGCCCAATCG AATTAAGAAG AGGAATGGAA
AAGGCTGTAG CTCAAATAGT TGATGATCTA AAGAAAAAAA GCAAATCAGT CAGTGGTGAT
GCTATAAAAC AAGTTGCGAC AGTAAGTGCC GGTGGAGACG AGGAAATAGG TTCCATGATT
GCAGATGCAA TAGATAAAGT AAGTTTTGAT GGAGTTATAA CTGTTGAGGA ATCCAAATCT
CTAGCCACCG AATTAGATAT CACTGAGGGA ATGGCATTTG ACAGAGGATA TAGCTCTCCA
TATTTTGTGA CAGATGAAGA TCGATTAATT TGCGAATTTG AAAATCCTTC AATCCTAATT
ACTGACAAAA AGATTTCATC AATTGCCGAT CTCATTCCTG TTCTAGAAAC AGTTCAAAAG
AACGGAACAC CATTAATAAT TCTTGCAGAA GAAGTAGAGG GTGAAGCATT AGCCACATTA
GTAGTAAATA AAAATCGTGG TGTTTTACAA GTAGCAGCTG TTAGAGCTCC ATCATTTGGC
GAGAGACGAA AAGCAGCTCT TGGAGATATT GCGGTATTAA CTGGTGGCAC ATTAATAAGC
GAAGACAAAG CAATGAGTCT TGAGAAAGTT CAAATTTCTG ACCTAGGTCA AGCAAGAAGA
GTAACAATTA CAAAAGACAG TACAACAATT GTCGCAAATG ATAATCAAAA CACCGAACTA
TCTAATCGCA TTGCATCAAT CAAGAGAGAA CTTGACGAAA CAGACTCTGA GTACGATCAA
GAGAAGTTAA ATGAGAGAAT AGCTAAACTT GCTGGGGGTG TAGCTGTAAT TAAAGTCGGA
GCTCCAACTG AAACTGAGTT AAAAAACAGA AAGCTCAGAA TTGAGGATGC TCTGAATGCA
ACTCGTGCAG CCATTGAAGA AGGTATTGTT GCAGGTGGTG GAACAACTCT TTTAGAACTG
AGTGAAGGGC TTGGAGATTT AGCTAAAAAG CTAGAGGGTG ATCAGAAGAC TGGAGTTGAA
ATTATAAAAA GAGCATTGAC TGCTCCAACA AAACAGATAG CGATAAATGC TGGATTTAAC
GGAGATGTTG TTGTTTCAGA TATCAAGCGT TTAGGCAAAG GCTTCAATGC ACAAACTGGA
GAGTACGTGG ATTTGCTTGA AGCAGGAATC TTAGATGCTT CAAAAGTAAT ACGACTTGCT
CTTCAAGATG CTGTATCAAT TGCCTCACTG CTCATAACTA CTGAAGTTGT TATTGCTGAC
AAACCTGAGC CCCCATCAGC GCCAGGAGCT GAAGGTGGAG ATCCAATGGG CGGAATGGGC
GGAATGGGCG GTATGGGCGG TATGGGCGGT ATGGGCGGTA TGGGCGGTAT GGGAATGCCT
GGAATGATGT AA
 
Protein sequence
MAKLLSFSDE SRGALEKGVN NLANALKVTI GPKGRNVVIE KKFGAPDIVN DGVTIAKEID 
LEDPFENIGA KLIEQVASKT KEKAGDGTTT ATVLAQFMVQ EGLRNTAAGA SPIELRRGME
KAVAQIVDDL KKKSKSVSGD AIKQVATVSA GGDEEIGSMI ADAIDKVSFD GVITVEESKS
LATELDITEG MAFDRGYSSP YFVTDEDRLI CEFENPSILI TDKKISSIAD LIPVLETVQK
NGTPLIILAE EVEGEALATL VVNKNRGVLQ VAAVRAPSFG ERRKAALGDI AVLTGGTLIS
EDKAMSLEKV QISDLGQARR VTITKDSTTI VANDNQNTEL SNRIASIKRE LDETDSEYDQ
EKLNERIAKL AGGVAVIKVG APTETELKNR KLRIEDALNA TRAAIEEGIV AGGGTTLLEL
SEGLGDLAKK LEGDQKTGVE IIKRALTAPT KQIAINAGFN GDVVVSDIKR LGKGFNAQTG
EYVDLLEAGI LDASKVIRLA LQDAVSIASL LITTEVVIAD KPEPPSAPGA EGGDPMGGMG
GMGGMGGMGG MGGMGGMGMP GMM