Gene OSTLU_42847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42847 
Symbol 
ID5003266 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp233128 
End bp236169 
Gene Length3042 bp 
Protein Length984 aa 
Translation table 
GC content62% 
IMG OID640418687 
Productpredicted protein 
Protein accessionXP_001419319 
Protein GI145349807 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.451507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.305439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG CGACGCCGGC GCGGGACGGC GGCGGCGACG GCGACGGGAC GAACGACGGC 
GACGGCGACG GACGCGAGAC GCGCGCGCCG AAGGCGCGCG CGGCGCGACG GGAGACGCGC
GCGGATGAGG TGGTGTACGC GTGCGCGACG CGCGAGGGAG GGGTGTCGCT GACGCGGTCG
ACGACGCGGC GAGGCGACGC GGCGACGACG TCGACGACGA CGGCGCTCGC GCGGGACGAG
GGCGATGGGG AGGCGATGAC GTCGATCGCG ATGGATCGAA GCGGGACGAA GGTGTTTTGC
GCGAGTCGGA GCGGACGCGT CGTGCGGTTG GACGCGATGG AGGATGGGAG GGGGGGGACG
ACGATGCGGA GGACGAAGGC GTGGTCGCCG CACAAGACGT CGCCGGTGTT GGACATGTGC
GTGGACGTCA CGGGGACGCT GTTGTGCACG GGGAGCGCCG ATCGAACGGC GCGAGTGTGG
GACATCGAAC GAGGGTACTG CACGCACGCG TTTCGCGGGA AGCACGGAGG GGCGGTGACG
GCGACGGCGT TTCATCCGAG CGTTAGAGAG GCGAGAGCGT TCACCGCGGC GGAGGATGGG
TCGCTCGCGA TGTGGTCGCT CACGGGCGAG GCGGGCGTCG GGAAGAAGGG TAAGAAGGCC
TCATCCGATG GCTGCGTCGC GTTCGTGGCG AACGCGCACG TGAGTGCGGT GACGTCGATT
CGAATCGACG TCGAATCGAA CACGTTGCTC ACCGCGGGGC GGGACAAAAT AGTTCGAACG
TTTGATTTAG ACACGCTCAA TCCGCGAACG ACGACGGCGG TTCACGAAAC GATCGAGGAT
TGCGTGATTT TACGTCCGGA TTCAGCCATC GTTCGCGACT GCAAGGTGAA GCCGCCGCCG
GGCGGGCGCG GAGTCATTTT CGCCGTCGTC GGCGACGGCG GACGCGTTCG AGTTTGGCGC
GAGAACGCGG CGAAGCATTC GATCGAGTCC GCACCTCTCG TAGCGGTAAA TACGCTCACC
AAAGGAGGCG ATGATAATGA CGAAGATTTT GAAGCCGCCG CGGGAACGTT CACGAAGTGT
GCGCTCACAC ACGATGGGAA CCGTTTGATT GGCGTGAGCG GCGACGCGCG TTTGTTGACG
TACCAGGCCA ACGCAGAGAC GACGTCGTTG GAGATTGAAC GCGAAATTGT TGCGAATACG
GATGAAGTGA TCGGTTTGGC GTTCGTACCC GGTGCGAAAG AGCAAGCACT TCAAAAGAAG
CGAAACATCG ATGGCGATAG CGACGAGAAC GAGAATGAAG ACGAGCGAAC GCTCGCGAGA
CCGCCGAGAG AAGTAGCCGT GGTCACCAAC TCGCCCACGG TACGTATGTT TGATCCGACG
ACGATGTCAT GTGTCGGATC TTTGAACGGT CACAGCGCCG TTGTGCTCTC GGTTGATGCC
ACGATGACGA CGGACGGGAC AGCGCTCATT TTGACGGGCG CAAAGGATCA CACGGTACGA
CTGTGGGACG CCGCCACGCG AGAGTGCATC GCCGTCGGCG AAGGCCATGT CGGCGCAGTT
GCCGCGGTAG CGTTTCCCCC GAACTCGAAA AATGGCGCAC CGTTTGCCAT TTCGGGTGGC
GTCGACCGCG TGCTTCGCGT ATGGGACATA GATGGAGTTC GGCGAAATGG CGACGGCGAA
TTGAACGCTA CGGCGGCCAC AGTGGCGCAC GACAAGTCCC TCAACGGCGT TGCCGTTGCG
CCGCACCTCC GCATGGTTGC CACGTGTTCG AGCGATAAGA CGGCGAAGAT TTGGAAAATG
CCCGATTTAG TTCCGTTGGC CACGCTACGC GGCCATCGTC GTGGAGTTTG GGCGTGCGCG
TTTTCTCCTT CGGATCGCGT ACTCGCCACC GCGGGCGGCG ACAAGATGGT GAAGATTTGG
AGCGCCGATG ACCGTGCTGG GAGCGACACC AACGGTGCTT GCTTGCGCAC GCTCGAAGGT
CATACCGCAG CGGTGTTGAG CATTAAATTT ATGTCTCGAG GTACCCAGCT TGTCACCACG
GGTGGCGACG GGCTGTTGAA TTTATGGAAC GTCACCTCTG GGTCTTGCGC CGCATCCATC
GATGCGCACG AAGACAAAGC TTGGGCGCTG GCCGTGGCAA GCGATGGCGA TTGGATCGCC
ACTGGGGGCA CCGACGCGTC CATGGCGCTG TGGAAGGACT CCACGTCGAG CACCACCGCC
GATGCGGCGA AGAAGCACGC CCTCGCCGTC GAACGCGAGC AAGCATTCTT CAACGCCGAG
CGCTCGGGCG AAGTCACGAA GGCGATCGAT TTAGCGCTCA GACTCGAGCG CCCTGGTGCA
CTTCTTCGTG TTTTGACGAA ACTTCTGGAG AGTGACTACG AAAATGGCGA CGCCAGACTC
CGGAAATGCG TCGAGCCGTT GCACGAAGAC AAGCTCGCGC GAGTGCTCAA GTGCGTGCGC
GAGTGGAACA CAAACGGACG CACGTGCCAC GTCGCGCAAC ACGTTCTCGC CGCCATCTTC
CGCACACACA CCATGGAGGA ACTGAGCAAG GTTCCGGAGA TTTCTCAAAT CACTAGAGCG
TGTCGGGCGT ACACCGAGCG TCATCGCTCG CGTCTCGAGC GTCTGTATCG CGGAACTTTT
TTAGTCGACA CGCTCCTCTC GCGCACGGGC GCCTTGGTAG ACGACGAAGA GTCGATGGAA
GAAGTCAGGC GCACCCACGA AACACTGGAT AACTTCGGTT TCATGCGCGC GGATGATGAC
GCGCCGCCGC GACGTTTGCC TGCTCCGACG GCGAGCGAAG AAGACGAGCC CGCCGACGTC
GCAGACGAAG AAATGGCGGA GCCGAGCGAG GACGACGAGC CCGCCGGCGA AGGGGAAGCC
GCCGAAAAAG TGCGAGAAGA CGATGTCGTC ATGGGTCCGC CGAAAAAGCT CAAGCGATTA
AACGCGATAA AAAGACTCGC GTCCGACGTA GAGGATCAAC GCAAACTCTT GCGCGACCCG
TCTCCCCGTC ATACGCGTAG CGGCAAAAAA TTGAGCGGCT GA
 
Protein sequence
MAIATPARDG GGDGDGTNDG DGDGRETRAP KARAARRETR ADEVVYACAT REGGVSLTRS 
TTRRGDAATT STTTALARDE GDGEAMTSIA MDRSGTKVFC ASRSGRVVRL DAMEDGRGGT
TMRRTKAWSP HKTSPVLDMC VDVTGTLLCT GSADRTARVW DIERGYCTHA FRGKHGGAVT
ATAFHPSVRE ARAFTAAEDG SLAMWSLTGE AGVGKKGKKA SSDGCVAFVA NAHVSAVTSI
RIDVESNTLL TAGRDKIVRT FDLDTLNPRT TTAVHETIED CVILRPDSAI VRDCKVKPPP
GGRGVIFAVV GDGGRVRVWR ENAAKHSIES APLVAVNTLT KGGDDNDEDF EAAAGTFTKC
ALTHDGNRLI GVSGDARLLT YQANAETTSL EIEREIVANT DEVIDERTLA RPPREVAVVT
NSPTVRMFDP TTMSCVGSLN GHSAVVLSVD ATMTTDGTAL ILTGAKDHTV RLWDAATREC
IAVGEGHVGA VAAVAFPPNS KNGAPFAISG GVDRVLRVWD IDGVRRNGDG ELNATAATVA
HDKSLNGVAV APHLRMVATC SSDKTAKIWK MPDLVPLATL RGHRRGVWAC AFSPSDRVLA
TAGGDKMVKI WSADDRAGSD TNGACLRTLE GHTAAVLSIK FMSRGTQLVT TGGDGLLNLW
NVTSGSCAAS IDAHEDKAWA LAVASDGDWI ATGGTDASMA LWKDSTSSTT ADAAKKHALA
VEREQAFFNA ERSGEVTKAI DLALRLERPG ALLRVLTKLL ESDYENGDAR LRKCVEPLHE
DKLARVLKCV REWNTNGRTC HVAQHVLAAI FRTHTMEELS KVPEISQITR ACRAYTERHR
SRLERLYRGT FLVDTLLSRT GALVDDEESM EEVRRTHETL DNFGFMRADD DAPPRRLPAP
TASEEDEPAD VADEEMAEPS EDDEPAGEGE AAEKVREDDV VMGPPKKLKR LNAIKRLASD
VEDQRKLLRD PSPRHTRSGK KLSG