Gene OSTLU_31755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31755 
Symbol 
ID5002100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp482284 
End bp484251 
Gene Length1968 bp 
Protein Length655 aa 
Translation table 
GC content57% 
IMG OID640417521 
Productpredicted protein 
Protein accessionXP_001418023 
Protein GI145347115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0216247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGG CGTGCGAGCG GTGCGAGCTC GCGATCGTCG GCAGTGGACG CGCGTGTTTG 
AGCGTCTTGT CGAGGCTGAG CAGAGGTCGC GCCGAACGAG CGGTGGTGAT CGACCCGTCG
GGGGCGTGGT TGTACTCGTT CGCGAGGACG CAACTCAGGC TGGGGGCGAC GCACTTGAGA
TCGACGACGA CGCAAGTGCC GTTTGAGAAC GCATGTGGAC TCGAGCGGTA CATAGAGACG
TTGGGGAAGA AACGCGATGT GGTGCGGACG GGTAGTGGGT TCGCGGGAGT GCCGAGTGTG
AGGGTGTTCG CGGAGTACTG CGCGAAGACG GTGGCGGAGC GATTCGGTGG CGTGCGCGTG
GAGCGAGGCA CCGTCGTGGA CGTGCGGTGG TGCGACGAGA CGTCGGAAGA GGTTCGTGAC
GCGTTCAAAG CAATTGACGA GAGCGAAGGC GCTGCTGGCG TGGGCCAGGA CGAGCGTGAT
GCGGTGGCTA TGATGCGGTG CGGCGCGATT TTATTGACGC TCGACACCGG AAAGACGTTT
CTCGCCGCGC GATGCGTGTG GACGCCGAAG TTTTCCCTTC CGCTAGTTCC GTCCTGGGTC
CTGGAAGCGA AAGCGTCGTA CGCAAAGTAC AACGCGTCTT ACGACCGCCA AAGCATCGAT
TGTGGAATCA TGAACGCAGC CGACGTGGAT ATGAGCGCCA CTGATTGCGC GCGAGGAAAG
TGCATCTTGG TCGTCGGCGG AGGAACGACG GCGGCGACGC TCGCGCTCGC TGCGCAGACG
CGCGGCGCCA AGGTAGTGAC GCTGATGTGC CGAAGGAAAA TCACCGTGAG TGAGTTTGAA
TGCGACGTGA AGTATTTTGG GAATAAAGGA CTGTACGAAT TTCACGCGTG CGCCGACGCG
CAAATTCGCG CCAACAAGCT CGAGTCCTTC AAGTCGAAGG CGAGCGTGAA CGAGCACACG
CACCGTCGTT TGCGAGATGC AGCCTTAAAG ACTGATAATA TTCGAGTTCT GGAGCAACGC
GTGTTGAATG GCGCCGTGTG GAGCGACAGA GAAAAGAAAT GGCGCGTTCG ATCGGCGCCT
ACGGATGAAG CCAAAGTGGA GTTCGAGTCG GCGATGTACA GAAGGTATCG CGACGAAGGA
ATCGAGCCCG ATAGTTCAGC GCTCGCCATT TTCGAAAAAG AAGTCACATC TACACACGAT
GAGATATGGC TAGCGTGTGG TGAATTCGTG GACCTGGCGA AAGATCCAGC GCTCCGCACG
CTCGTGGAGA CAACGTCCGT AGAAATCGCT CGAGGCTTCC CTGCGCTCGC GGAGGAAAAA
ATTGAATGTG CGCACGACAA AGGACAAACC GCGGCTGCAG GCGGAGGTGG AGGCTGTCGA
TGGCCAGGAA CGTCGATGTA CGTGTTGGGT GCGTACGCCT CGCTAACCAT AGGTCCCGGG
GCAGACCTCC CCGTGGGGCA TCGAATGGCG GCGAAGCAAG TCGTCGACGC AATGAAGAAA
CACGAGACGG CGATATTGCG CAATAAGAAC CCGTATCAAG TCGCCGAAAC GAGCAGTGAA
GCACAGCGAA CGCCGGATAG AGGTGAAACG TTCGATCGCT TCAAAAAGCT CCCACCTGAG
CTCGCGGACA AAGGTTTGAT TGATATTGAA AGCTTGATCG CTGGCGCCGC GATGGAGCGC
GTCGAGTTGG ACAATTACGA AATGTACGAA GAAGATATGA GGGCGGAGAT TCGTTTGAAA
ATTCCCGAGG CCATTCTTGC GCGCGATGTT TACGTGTGCT TTCAAGATCG AGCGTTAGAG
ATGTGGGCGC TTGGTAAACA AAATGCTTAT CGCTTCTTCA TCCGCAAGCT CTACAAGAAC
GTCATCGTTG ATCGCTGTTC GTACAGGGTG TACGCCAATA AAAATCGCGT CGTGCTCAAC
ATTCACAAGT ACACGAACCA TTATTGGCGG TATCTTAGAG ACAGGTAG
 
Protein sequence
MATACERCEL AIVGSGRACL SVLSRLSRGR AERAVVIDPS GAWLYSFART QLRLGATHLR 
STTTQVPFEN ACGLERYIET LGKKRDVVRT GSGFAGVPSV RVFAEYCAKT VAERFGGVRV
ERGTVVDVRW CDETSEEVRD AFKAIDESEG AAGVGQDERD AVAMMRCGAI LLTLDTGKTF
LAARCVWTPK FSLPLVPSWV LEAKASYAKY NASYDRQSID CGIMNAADVD MSATDCARGK
CILVVGGGTT AATLALAAQT RGAKVVTLMC RRKITVSEFE CDVKYFGNKG LYEFHACADA
QIRANKLESF KSKASVNEHT HRRLRDAALK TDNIRVLEQR VLNGAVWSDR EKKWRVRSAP
TDEAKVEFES AMYRRYRDEG IEPDSSALAI FEKEVTSTHD EIWLACGEFV DLAKDPALRT
LVETTSVEIA RGFPALAEEK IECAHDKGQT AAAGGGGGCR WPGTSMYVLG AYASLTIGPG
ADLPVGHRMA AKQVVDAMKK HETAILRNKN PYQVAETSSE AQRTPDRGET FDRFKKLPPE
LADKGLIDIE SLIAGAAMER VELDNYEMYE EDMRAEIRLK IPEAILARDV YVCFQDRALE
MWALGKQNAY RFFIRKLYKN VIVDRCSYRV YANKNRVVLN IHKYTNHYWR YLRDR