Gene OSTLU_41620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41620 
Symbol 
ID5005054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp523573 
End bp526383 
Gene Length2811 bp 
Protein Length936 aa 
Translation table 
GC content58% 
IMG OID640420475 
Productpredicted protein 
Protein accessionXP_001421175 
Protein GI145353765 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones136 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGCGA CGCGAGCGAT CGCGCGGAGA CTCGAACGAC ACGCGGCGCG ATGTAAAGGC 
GCACACGTCG CGCGCGCGGT GCGAGGGGCG AGAGCGCGAA CGACGTCCGC GCCGAGGGCG
CTGTTGGACG CGCTCGGGGC GGGGAGGGGC GACGCGGACG CGTTCGGGAC GAGGACGAGA
CGGACGAGGA ATGCGTTCGT GTCGAGCGTC GACGGCGATG GGTCGACGGG ATCGACGGGA
TCGTCGTCGT CGTCGTCGTC GAGCGCGTTC GGTGATTCGG CGTCGTCGGG GGGGATCATG
GTGAGCGCGT CGCACCCGAG TTCGCACCCG CAAGTGCTCG CGGTGCCTTT GCCGAGGCGG
CCGCTCATGC CGGGGATCAT CATGCCGGTC AAAGTGACGG ACGAAAAGCT CATAGCCGAG
CTTGAGGACA TGCGAAATCG TGGTCAAGCG TACGTGGGGG CGTTCTTGCA GCGAACGGAC
GCCGCGTCGT CGGCGTCCAA GGGGGAGGGC GAAGACGTCT TCGACGCGCT CTCGGCGATG
AAGCGCACGA CGACGTCGGT GGGTTTAGAC GGCGAGGAAA TGGTAGACGA AGACGAGGTT
GATCCCGCGG ATCACATGCA CGACATAGGT ACGTTTGCGC AGGTGCATAA CATCGTGCGC
CTGCCGACGG ATTCGACCAC GGGTGAAGAA TCGGCGACGC TGTTGCTCCT CGGCCACCGG
CGTTTGCGAA AGCTCGGGAC GATGAAGCGG GATCCGATGG TGGTCAAGGT GGAACACCTC
AAGGATGAGA AATTCGACGC CAACGATGAC ATCATTAAAG CCACGACGAA TGAGGTGGTG
GCGACGATCA AAGATTTGCT CAAGACGAAT CCTTTGCACA AGGAGACCCT GCAGTATTTC
GCTCAAAATT TCAACGACTT TCAAGATCCG CCAAAGCTCG CGGATTTGGG GGCGTCGATG
TGCAGTGCCG ACGACGCGCA GTTGCAACAC GTGTTGGAGC TATTATCGGT GAAAGAACGC
CTCGACGCGA CGCTCGAGTT GTTGAAGAAG GAGGTGGAAA TCGGCAAGCT CCAAGCCGAC
ATCGGGAAAA AAGTTGAGGA GAAAATTTCA GGCGACCAGA GGCGTTACTT TTTGATGGAG
CAGTTGAAAT CGATCAAGAA AGAGCTCGGT ATGGAGCGTG ACGACAAAAC CGCGCTCATC
GAAAAGTTTA CGAAACGTTT CGAGCCCAAG CGCGCGAGCG TGCCGGAAGA CACCGCCAAG
GTTATCGATG AAGAGCTTCA AAAGCTCGGC GGCCTCGAAC CGTCGTCGAG CGAATTCAAC
GTCACTCGCA ACTATCTCGA GTGGCTCACG TCACTGCCGT GGGGCGTGTG CGGCGACGAA
AAATTGGACA TATCTCACGC ACAAGAAGTG TTAGATAGCG ATCATTACGG CCTGGAGGAC
GTCAAAGATC GCATCTTGGA ATTCATCGCC GTCGGGCAAC TTTTGGGGAC GACGCAAGGA
AAAATCATCA CCATGGTCGG TCCGCCTGGG GTGGGGAAGA CATCCATCGG GCAATCGATC
GCCAAGGCGC TCGGGCGTAA ATTCTATCGC TTTTCCGTCG GCGGTATGAG CGACGTGGCG
GAGATCAAGG GCCATCGACG GACGTACGTT GGCGCGATGC CGGGCAAGCT GATTCAGTGC
TTGAAATCCA CGGGTGTGTG CAATCCAGTG GTTTTGATTG ACGAAATCGA CAAGCTCGGA
CGCGGTTATC AGGGCGATCC CGCGAGCGCG CTGCTCGAAC TACTCGATCC CGAGCAAAAC
GGCACGTTTC TTGATCACTA CCTCGACGTC CCCGTCGACT TGAGCAAGGT TTTATTCGTG
TGCACCGCCA ACGTGCTCGA CACGATTCCC GGGCCTTTGC TCGATCGCAT GGAAGTCGTG
CGGTTGTCTG GATACATCAC CGACGAAAAA GTGCAAATCG CTCGAACGTA TTTGGAGAAA
GCGGCGCGAG AAAAGAGTGG GCTGTCCGAC GTCGACGCGA GCATCACCGA CGCGGCGATG
GGGAAACTCA TCGGCGACTA CTGCCGCGAA GCCGGCGTGC GGAACTTGCA AAAGCATCTC
GAAAAGGTCT ATCGCAAGAT TGCCCTCAAG GTGGCTCGGG CGAAGAGTGC GGACGAAAAG
CTCGACTCTA TCGTCGTCGA TGTCGATGAC TTGGTCGATT ACGTCGGTCA ACCACCGTTC
GCGACCGACC GAATCTACGA CGTCACCCCG CCCGGAGTCG TCACCGGCTT GGCTTGGACG
GCGATGGGCG GATCCACGCT TTACATCGAG TGCACGGCTA TCGATTCCGG CGACGGCAAG
GGCGCGTTAA AGACGACCGG TCAACTCGGC GACGTCATGA AGGAATCGAG CACGATTGCG
CACACGTTCA CGCGAGGGTT TTTGGAATTG AAGGATCCCG GCAACAAGTA TCTCGCCGAC
ACGTCGCTTC ACGTTCACGT CCCCGCCGGG GCGACGCCGA AAGATGGACC GTCGGCGGGA
ATCACGATCA CGACGAGCCT GTTATCGCTC GCCATGAACA AACCGGTAAA GCCTAATTTA
GCCATGACGG GCGAGCTCAC GCTCACCGGT AGGGTGTTAC CGATCGGCGG CGTCAAGGAG
AAGACGATCG CCGCGCGTCG AAGCGGGGTG AAAACCATCA TTTTCCCCGA AGGAAACAAG
AAGGATTACG ACGAGCTTTC CGAAGACATT CGTGAAGGTT TGGACGCACA CTTTGTCTCG
ACGTACGACG AAGTCTATCG CCAAGCGCTC GATTGGGAAG CGTCTTCGTG A
 
Protein sequence
MYATRAIARR LERHAARCKG AHVARAVRGA RARTTSAPRA LLDALGAGRG DADAFGTRTR 
RTRNAFVSSV DGDGSTGSTG SSSSSSSSAF GDSASSGGIM VSASHPSSHP QVLAVPLPRR
PLMPGIIMPV KVTDEKLIAE LEDMRNRGQA YVGAFLQRTD AASSASKGEG EDVFDALSAM
KRTTTSVGLD GEEMVDEDEV DPADHMHDIG TFAQVHNIVR LPTDSTTGEE SATLLLLGHR
RLRKLGTMKR DPMVVKVEHL KDEKFDANDD IIKATTNEVV ATIKDLLKTN PLHKETLQYF
AQNFNDFQDP PKLADLGASM CSADDAQLQH VLELLSVKER LDATLELLKK EVEIGKLQAD
IGKKVEEKIS GDQRRYFLME QLKSIKKELG MERDDKTALI EKFTKRFEPK RASVPEDTAK
VIDEELQKLG GLEPSSSEFN VTRNYLEWLT SLPWGVCGDE KLDISHAQEV LDSDHYGLED
VKDRILEFIA VGQLLGTTQG KIITMVGPPG VGKTSIGQSI AKALGRKFYR FSVGGMSDVA
EIKGHRRTYV GAMPGKLIQC LKSTGVCNPV VLIDEIDKLG RGYQGDPASA LLELLDPEQN
GTFLDHYLDV PVDLSKVLFV CTANVLDTIP GPLLDRMEVV RLSGYITDEK VQIARTYLEK
AAREKSGLSD VDASITDAAM GKLIGDYCRE AGVRNLQKHL EKVYRKIALK VARAKSADEK
LDSIVVDVDD LVDYVGQPPF ATDRIYDVTP PGVVTGLAWT AMGGSTLYIE CTAIDSGDGK
GALKTTGQLG DVMKESSTIA HTFTRGFLEL KDPGNKYLAD TSLHVHVPAG ATPKDGPSAG
ITITTSLLSL AMNKPVKPNL AMTGELTLTG RVLPIGGVKE KTIAARRSGV KTIIFPEGNK
KDYDELSEDI REGLDAHFVS TYDEVYRQAL DWEASS