Gene OSTLU_33588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33588 
Symbol 
ID5003818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp466251 
End bp467549 
Gene Length1299 bp 
Protein Length432 aa 
Translation table 
GC content58% 
IMG OID640419239 
Productpredicted protein 
Protein accessionXP_001419604 
Protein GI145350421 
COG category[S] Function unknown 
COG ID[COG3147] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.377381 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0708275 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTTA AATCGGCGAT GAAGTCGGTG TCGGCGTCGC CGCCGACGTG GTCGCCGCCG 
AAGCCGAAGC CGAAGCCGAA GCCGAAGCCG AAGCCGAAGC CGAAGCCGAA GCCGAAGCCG
CCGTCGCCGT CGCCGCCGTC GCCGTCGCCG CCGCCACCGC CGAAGCCGAC GACGGCGATC
GAGCCGATCA TCGATCCGAA GTGCGACGAA TGGACGGCAA AGCTCGCGGC CGCGGCGTCG
GCTCCAGAGT TTGAGTATGA TTTTCCCGCG TACACGTGCT CGAACGAGCA CGAGGCGCCG
TGCATTCAGG ATTTTCAATT CATGAAGCTG CAGCCGGACG CGTGCGATGT GTTGAAGAGG
GACGAGGACT GGCCGGGGAG GTTCATGCAG CGAGCGCCGG GCGTCAAAAA TGGAGGCGAT
CTCGTGCGCG CGCTGGGCGC TGATAATACG ATCGTTGTCA TCGGGGCTTC GATGACCAAA
CAAATAGAAA TCGCTATGGA CTGCAGCATT CGGCGCGCGG GCGTCGACAA GTCTCAATCT
GCGAAAATCA TGAAGCGATG GGGGTGGGCG AGATGGTCGT TCGATAGTCA AACGTGTGAA
CTGGATCACA TGGTGGCGGC TCGGACGGGG GGTGACATTA ACTATTGGTT CTCGGCGAAG
GAGTACGTCG ACGGATGTTG GAAGGATACG AAGGCGTTCG ACATCAATGT GCTCGGTTGC
GAAGGCGCCA TTGACGACTG TGACGCGTCC TCACGCGTCG TCGTACTGAT GTACAACATC
GAGCATTACG GGGGACTCGG TAACGTGGAA TTTTTCGAAA AAGAGACCGA GTTTCTCGTG
AAGCGAGCCG TATCGTTGGG CGCCAAGGTG GTTCTCGCCA CGAGCCCGCC TAAGCACTTC
AGCGCCGATG GCGCGTATAG TAAAGAGGCA TACATAACGA AAATCAAGAG TGAAACAGTG
TGCACGTGTA CACCAACGGT CGAGTCGATA GACAAAAACC CGGCGTGGCG CGGCTATTTC
GAAGCCATCG AACGTCTGTC GCGAATACCG GGCGTTCTCG GCGTCATCGA TATTTTACGG
TCGTCCATGA AAGAATTCTA CCGTAGTCAC AAAGGCGGTC ACTGCGGGTA TTATGTGGAC
GAAGTTCCGG GCGAGAATTC GCAAAAGCCC AAGCGCGAGG TGAAGTATAA AGCGTGCTGC
GATTGCACGC ACTACTGCTT CGATCCAGCG CTCTGGGACC AATACTTCCT CGATCCTCTC
GTCGACATTT TAGACCGCGG GACATCGCCC GCGAGTTAG
 
Protein sequence
MDVKSAMKSV SASPPTWSPP KPKPKPKPKP KPKPKPKPKP PSPSPPSPSP PPPPKPTTAI 
EPIIDPKCDE WTAKLAAAAS APEFEYDFPA YTCSNEHEAP CIQDFQFMKL QPDACDVLKR
DEDWPGRFMQ RAPGVKNGGD LVRALGADNT IVVIGASMTK QIEIAMDCSI RRAGVDKSQS
AKIMKRWGWA RWSFDSQTCE LDHMVAARTG GDINYWFSAK EYVDGCWKDT KAFDINVLGC
EGAIDDCDAS SRVVVLMYNI EHYGGLGNVE FFEKETEFLV KRAVSLGAKV VLATSPPKHF
SADGAYSKEA YITKIKSETV CTCTPTVESI DKNPAWRGYF EAIERLSRIP GVLGVIDILR
SSMKEFYRSH KGGHCGYYVD EVPGENSQKP KREVKYKACC DCTHYCFDPA LWDQYFLDPL
VDILDRGTSP AS