Gene NATL1_18361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18361 
SymbolgroEL 
ID4779967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1497893 
End bp1499524 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content42% 
IMG OID640085125 
Productchaperonin GroEL 
Protein accessionYP_001015656 
Protein GI124026541 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.393349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.975907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGC GCATCATTTA CAACGAGCAA GCCCGCCGAG CACTCGAGCG TGGTATTGAC 
ATTTTGGCTG AATCAGTCGC AGTCACTTTA GGGCCGAAAG GTCGTAATGT TGTTCTCGAG
AAAAAGTTCG GTGCACCTCA AATAATCAAT GATGGTGTAA CCATTGCTAA GGAGATCGAA
CTCGAAGATC ACATTGAAAA CACTGGAGTT GCTCTAATTC GTCAAGCCGC ATCAAAAACA
AATGATGCAG CTGGAGATGG CACGACCACT GCAACTGTTC TTGCTCATGC AATGGTCAAA
GCCGGTTTGA AAAATGTTGC AGCAGGAGCA AATGCAATCA CTTTGAAAAA AGGAATTGAT
AAGGCAACTG ACTTTTTGGT TGAAAAAATA AAAGATCATT CCAAGCCAAT TAGTGACAGT
AATGCAATTG CTCAATGTGG CACGATTGCT GCTGGTAATG ATGAAGAAGT TGGCAAAATG
ATTGCTGATG CTATGGATAA AGTTGGAAAA GAAGGTGTTA TCTCTTTAGA AGAAGGCAAG
TCAATGACCA CTGAGCTAGA GGTTACTGAA GGAATGCGTT TTGATAAAGG ATATATTTCT
CCTTATTTCG CAACTGATAC TGAGAGAATG GAAGCAGTAT TGGACGAGCC TTATATACTT
CTTACTGACA AGAAAATTGG TCTAGTTCAA GATCTTGTTC CTGTCTTAGA ACAAGTTGCT
AAAACTGGTA AACCTCTTCT AATAATTGCT GAAGATATAG AAAAAGAAGC TTTGGCGACA
TTAGTTGTCA ATCGCTTAAG AGGTGTTCTT AACGTTGCCG CAGTTAAGGC TCCTGGTTTT
GGTGATCGTA GAAAGGCGAT GCTTGAAGAT ATGGCTGTTT TAACTAATGG CCAACTCATT
ACTGAGGATG CAGGCTTAAA ACTTGAGAAT GCAACTCTTG ATATGCTTGG AACTTCAAGG
AGAGTGACCA TTAATAAGGA CACATCAACA ATTGTTGCTG AAGGCAATGA AGTCGCTGTA
AATGCAAGAT GTGAACAAAT CAAGAAGCAG ATGGATGAGA CCGACTCAAC CTATGACAAA
GAAAAACTTC AGGAAAGATT AGCTAAGCTA TCTGGTGGCG TAGCTGTAGT GAAAGTGGGA
GCTGCGACTG AGACTGAGAT GAAAGATAAG AAGCTTCGAT TAGAGGATGC AATTAACGCA
ACTAAAGCTG CAGTTGAAGA AGGCATTGTT CCTGGTGGTG GAACTACTCT CGCTCATTTA
GCTCCTGCAT TGGAAGATTG GTCCTCCACC AATCTTTCTG GTGAGGAATT AATCGGTGCA
AATATCGTTG AAGCTGCTCT TACTTCTCCA CTAATGCGCA TAGCTGAAAA TGCTGGTGCT
AATGGAGCAG TTGTTGCAGA GAATGTTAAA TCTAAGCCAG TTAATGATGG CTATAACGCT
GCAACTGGTG AATATGTAGA TATGCTTTCT GCTGGAATCG TTGATCCTGC GAAGGTAACA
AGATCAGGGT TGCAAAATGC TGCATCAATA GCAGGAATGG TTCTTACAAC TGAATGCATA
GTTGCTGATC TACCAGAGAA GAAAGACTCT TCTCCAGCAG GTGGTGGAAT GGGTGGTGAT
TTTGATTATT AA
 
Protein sequence
MAKRIIYNEQ ARRALERGID ILAESVAVTL GPKGRNVVLE KKFGAPQIIN DGVTIAKEIE 
LEDHIENTGV ALIRQAASKT NDAAGDGTTT ATVLAHAMVK AGLKNVAAGA NAITLKKGID
KATDFLVEKI KDHSKPISDS NAIAQCGTIA AGNDEEVGKM IADAMDKVGK EGVISLEEGK
SMTTELEVTE GMRFDKGYIS PYFATDTERM EAVLDEPYIL LTDKKIGLVQ DLVPVLEQVA
KTGKPLLIIA EDIEKEALAT LVVNRLRGVL NVAAVKAPGF GDRRKAMLED MAVLTNGQLI
TEDAGLKLEN ATLDMLGTSR RVTINKDTST IVAEGNEVAV NARCEQIKKQ MDETDSTYDK
EKLQERLAKL SGGVAVVKVG AATETEMKDK KLRLEDAINA TKAAVEEGIV PGGGTTLAHL
APALEDWSST NLSGEELIGA NIVEAALTSP LMRIAENAGA NGAVVAENVK SKPVNDGYNA
ATGEYVDMLS AGIVDPAKVT RSGLQNAASI AGMVLTTECI VADLPEKKDS SPAGGGMGGD
FDY