Gene OSTLU_26305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26305 
Symbol 
ID5003943 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp572921 
End bp576423 
Gene Length3503 bp 
Protein Length1120 aa 
Translation table 
GC content55% 
IMG OID640419364 
Productpredicted protein 
Protein accessionXP_001420218 
Protein GI145351726 
COG category[A] RNA processing and modification 
COG ID[COG5161] Pre-mRNA cleavage and polyadenylation specificity factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.694924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.69508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG AGGACGCGCG CGAGAGCGGT TCAGGCGCGC GGTGTCAGCA CAATTACGTC 
GTCACGGTGC GCGCGCGCGC GTACGACGCG AAGCGACGCG ACGCTCGATC GTTGACGACC
GAGCGAACGA CCGACTGCGC GACGCTCTGC GCGATGACGA GCGATCGACT GACGACGACG
ACGACGACGA CGCGTGGTTT CCGCAGGCGC ACAAACCCAC CGTGGTCACG CACTCGGCCG
TGGGAAAGTT TACGTCGAGC GAGGCGACTG ATTTAATCGT CGCGAAGAGC ACGCGACTGG
AGGTGTATCG CTTGCACGCG GAGGGGCTGA AACCGGTGCT GGATGTGCCG ATAAATGGTC
GAATCGCGAC GATGTCGCTG TGCCAAACGG GATCGGGCGA TGGGAAGGCG CGGTTGTACC
TGACGACGGA GAGGTATGGG TTTACGGTGC TGTCGTACGA CGAGGCGAAC GAAGAGTTGA
AGACGGAGGC GTTCGGGGAC GTGCAGGATA ATATCGGTCG ACCGGCGGAT GATGGACAGA
TTGGGATCGT GGATGACACG TGTCGAGCGA TCGGGTTGCG GCTGTACGAC GGATTATTCA
AGGTGATTCC GTGCGACGAA AAGGGCGGGG TGAAGGAGGC GTTTAACATT CGCCTCGAGG
AGTTGCGCGT GGAGGACATC AAGTTCTTGC ACGGCACGCC CAAGCCGACG ATCGCGGTGC
TTTACCGAGA CACTAAAGAT GCGGTACACA TTAAGACGTA CGAGATCGGC ATTCGCGAGA
AGGAATTCGT GTCGTCGCCT TGGGCGCAGA ACGATTTAGA GGGTGGCTCG AATAAAATAA
TTCCGGTCCC CGCGCCCATC GGCGGTGTCG TCGTCTTGGG GCAGGAAATT ATCGTGTACT
TGAACAAGTT CGAAGACGAC GCAGATGTGT TTCTCAAAGC GATCAACATC CCCAACATCC
CCGATCGGAC GAACATCACG TGCTACGGCG CTATTGATCC GGACGGCTCG CGCTACCTGC
TGGGCGACGC GGACGGCATG CTCTACTTGC TCGTCATCTT ACACGACGGC AAGCGCGTTC
GAGAGCTTAA AATTGAGAGA CTTGGCGACA CGTCAATCGC GAGTACGCTT AGCTATCTCG
ATAACGGCGT GGTATTCGTC GGTAGCACGT ACGGCGACTC GCAACTCATC AAGCTGCACG
CGGAGAAGAC GAGCATCGAT AAAGACGGCA ACCCGACGTA CGTGCAAATT TTAGAAGAGT
TCACCAATCT CGGTCCCATT GTAGACTTTG CATTCGTTGA CTTAGAGCGG CACGGTCAAG
GGCAAGTGGT TACGTGTAGC GGGGCGTTAA AAGATGGGAG TTTACGCGTA GTGCGCAACG
GCATAGGCAT TGACGAGCAA GCGGTGATTC AGCTTCCCGG TGTCAAAGGC TTGTTTTCAC
TTCGCGATAG CGACGATAGC CAAATGGATA AGTACTTGGT CGTCACATTC ATAAACGAAA
CTCGCATCTT GGGCTTTGTC GGGGACGAAG GGGACACGTT GGACGAGACA GAAATTGCGG
GTTTCGACGC TGAAGCGCAA ACTTTGTGCT GTGGCAACAT GCAAGGCAAT GTTTTCCTTC
AAGTGACGCA CAGGGGCGTC CGCCTCGTCT CAAGAGGTGG CGACTTACTC GATGAATGGA
AGCCAAAGGA CGGTGCGGAG ATTCTTTCGG CGAAGTGCAA CCCGACTCAA ATTCTTGTTG
CAGCGGCGGG CGGGCAGTTA CACTGCTTGA ACGTGGCGAA GGGCAAAATC GTCCTCTTGG
CGAGTAAGAC ATTCGAGAAC GAAATAGCTT GCTTAGATTG CACGCCCATG GGCGATGGGA
TGAGCTCGCC GGTGTGCGCG GTTGGTTTGT GGTCCATGGA TATCGTGCTC GCGTCAATGA
GTGACTTGAG CGTCATCACG AAAGAAAGTA CAGATGAAGA CATCATCCCT CGATCGACGT
TGCTGTGTTC GTTCGAAGAC ATTCCATACT TGTTCGTAGG CTTAGGTGAC GGTCAGTTGA
TCACGTACGT TTTAGATCAG AACACTGGGG CTTTGAGCGG GCGCAAAAAA TTGAGCCTCG
GTACCAAGCC GATTACGTTG CAAACGTTTA AGAGCCATGC GACGAATGTG AGTAGTGTGT
TCGCGGCGTC GGATAGGCCG ACGGTGATCT TCAGCAACAA CAAAAAGCTA ATCTACTCCA
ACGTCAACGT GCAAGAGGTG TTGCATGTCT GTCCGTTCAG CAGCGAAGCG TTCCCGGACG
CTCTCGCGTT GGCGGGCGAC GAAGATTTAA CCATCGGCGG GATTGACGAC ATTCAAAAGC
TGCACATTCG CACGATCCCA CTCGGTGGTC ACCCGCGTAG GATCGCGCAT CAGGTCGACA
CGAACACATT CGCGGTCGCG GTCGAGCATT TGATGTCGAA GGGTGATCAA GAACTCTTCA
TCAGACTCAT CGACGACGGT TCGTTTGACA CGCTCCACCA GTTTCGATTA GAAGAGCACG
AATTGGCGAG TTCCTTGATG TCGTGCTCGT TCGCGGGCGA TTCGAGAGAG TACTACGTCG
TCGGTACGGG GTTTGCTTAC GAGCAAGAAG ATGAACCGTC GCGCGGGCGC ATTCTCGTCT
TACGCGTTGA GGCGGACGCC CTTGAACTCG TATCCGAGAA AGAAGTTCGA GGCGCCGTGT
ACAACTTGAA CGCATTCAAG GGTAAACTTC TCGCGGGGAT CAATTCTAAG CTAGAGCTAT
TCAAATGGAC ACCGCGCGAA GACGACGCGC ACGAGCTGGT GAGCGAGTGC TCGCACCACG
GCCAAATCAT CACGTTCTCC GTCAAGACGA GAGGCGATTG GATTCTCGTC GGCGATTTGC
TTAAGTCCAT GTCGCTGTTA CAGTACAAGC CCGAAGAAGG CGCGATCGAC GAGATCGCGA
GAGACTTTAA CGCAAATTGG ATGACTGCGG TGGCGATGTT AGACGATGAC GAAACCTATC
TCGGCGCCGA GAACAGCTTG AACCTGTTTA CCGTCGCTCG CAACATGAAC GCTATGACGG
ACGAAGAGCG TAGTCGTTTG GAAATCACGG GCGAGTATCA CCTAGGTGAG TTCGTCAACG
TGTTCTCTCC TGGTTCACTC GTCATGAGCC TCAAAGATGG CGACAGTTTA GAAGTCCCCA
CTTTGCTCTT CGGCACTGGT AACGGCGTCA TCGGCGTCTT GGCCAGTCTG CCGAAGGACG
CCTACGATTT CGCCGAGCGC CTTCAAACCT CCATGAACAA GCACATCCAA GGCGTCGGTG
GCTTGAAACA CGCGGAGTGG CGTTCATTCC GCCACACGCT TCGCCGCAAG AGCGATCCTT
CGAGGAATTT CGTCGACGGC GACCTCGTCG AATCGTTTCT CGACTTAAAA GTCGAGCAAG
CCGACGTCGT CGCCGCTGAC ATGAAGTGCG ATCGCGCCGA AATCATTCGT CGCGTCGAAG
AGCTTCAGCG CTTGACGCAT TAA
 
Protein sequence
MTNEDARESG SGARCQHNYV VTAHKPTVVT HSAVGKFTSS EATDLIVAKS TRLEVYRLHA 
EGLKPVLDVP INGRIATMSL CQTGSGDGKA RLYLTTERYG FTVLSYDEAN EELKTEAFGD
VQDNIGRPAD DGQIGIVDDT CRAIGLRLYD GLFKVIPCDE KGGVKEAFNI RLEELRVEDI
KFLHGTPKPT IAVLYRDTKD AVHIKTYEIG IREKEFVSSP WAQNDLEGGS NKIIPVPAPI
GGVVVLGQEI IVYLNKFEDD ADVFLKAINI PNIPDRTNIT CYGAIDPDGS RYLLGDADGM
LYLLVILHDG KRVRELKIER LGDTSIASTL SYLDNGVVFV GSTYGDSQLI KLHAEKTSID
KDGNPTYVQI LEEFTNLGPI VDFAFVDLER HGQGQVVTCS GALKDGSLRV VRNGIGIDEQ
AVIQLPGVKG LFSLRDSDDS QMDKYLVVTF INETRILGFV GDEGDTLDET EIAGFDAEAQ
TLCCGNMQGN VFLQVTHRGV RLVSRGGDLL DEWKPKDGAE ILSAKCNPTQ ILVAAAGGQL
HCLNVAKGKI VLLASKTFEN EIACLDCTPM GDGMSSPVCA VGLWSMDIVL ASMSDLSVIT
KESTDEDIIP RSTLLCSFED IPYLFVGLGD GQLITYVLDQ NTGALSGRKK LSLGTKPITL
QTFKSHATNV SSVFAASDRP TVIFSNNKKL IYSNVNVQEV LHVCPFSSEA FPDALALAGD
EDLTIGGIDD IQKLHIRTIP LGGHPRRIAH QVDTNTFAVA VEHLMSKGDQ ELFIRLIDDG
SFDTLHQFRL EEHELASSLM SCSFAGDSRE YYVVGTGFAY EQEDEPSRGR ILVLRVEADA
LELVSEKEVR GAVYNLNAFK GKLLAGINSK LELFKWTPRE DDAHELVSEC SHHGQIITFS
VKTRGDWILV GDLLKSMSLL QYKPEEGAID EIARDFNANW MTAVAMLDDD ETYLGAENSL
NLFTVARNMN AMTDEERSRL EITGEYHLGE FVNVFSPGSL VMSLKDGDSL EVPTLLFGTG
NGVIGVLASL PKDAYDFAER LQTSMNKHIQ GVGGLKHAEW RSFRHTLRRK SDPSRNFVDG
DLVESFLDLK VEQADVVAAD MKCDRAEIIR RVEELQRLTH