Gene OSTLU_33394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33394 
Symbol 
ID5003726 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp145834 
End bp147012 
Gene Length1179 bp 
Protein Length392 aa 
Translation table 
GC content53% 
IMG OID640419147 
Productpredicted protein 
Protein accessionXP_001419502 
Protein GI145350199 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.551934 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGAGT ATTGGCAGAA AAAGCCTCTG CTGATGCGGC AGGCGATACC GAACTTTCGA 
CCGCCGCTGG ATGGGAACGA AATCGCTGGT TTGGCGTGCG AGGAGGACGC GAGCGCGCGC
ATCTTCGTGC GCGAGGGGGA CGACGAGCAG TCGTGGAGGA AGAAGATTGG ACCGTTCGAA
GAAAGTGATT TGACATCCTT ACCCGAAGAC AAGCCGTGGA GCCTGATCGT TAACGATTTG
GACGTGCAGG CGCAACCGTT TGGGGACATG TTGGAACTCT TCAACTGTTT TCCGCGTTGG
CGAATTTCTG ATATTCAAGC GAGCGTATCA CCGGACGGCG GGGGCGTAGG ACCGCACTCC
GATCACTTTG ATGTATTTCT TCTTCAAGCC GAAGGCGAAA AAGTTTGGGC CGTGGCGGAT
AACGAGGAGT ACTGGCCAGA TAATGATGCG GCATTTGTCC CAGAATGTGA AATTCGCGTG
CTCAAAAGCT TTGTCGAGGA CGATTCCTTC ACGTTGGTTC CGGGTGATAT GCTTTACTTG
CCCCCCAAAA TCGCTCACAA CGGCGTGGCG ACGAACTCAA AACCAGGCGT GAGCGTAACG
TTGAGTATAG GCTTTCTAGC GCCGACGACG GATGAACTCG TCTTGTCTTA CACGCAACGA
GCATCTGAAA AATTGAAGGG CTCGCGTTGG TCCGATCCTT GGCTCAAACC GGTCGAAGAC
GTCGGTGCAA TATCCGCTGA ATCTATCACG TATGCATCGG AGATAATTAA GCGCACGTAT
CCGAAGAATG ATGCCGAAGT GGCGCGTTGG TTTGGTTGTC ACACGACGGC GCGCACCGGC
GAGGACGACG ACGCGGACGA GAACGAAGTG AGCATCGAAG AACTATTAGC GGCTTGGGAA
CACCAAGGTC TAGTCGCGAG AGAAGATTTA CGCTTCGCTT TCGTGGAAAA GGTTGCGGAT
GATAGTTTGA AGAACGCGCT GTTTTTCGCA AACGGAGAAT GTTGGGATGT CGTCAGCCCG
GCCGCTGTGA AAACAGCCAC CGTCATCGCA AATAGAGGCG AGCTTTACGA AGAAGACACG
CAGACGGAGG AGTGTGATTT CGATGATGAA GCCTTAAAGC TCGCACTAAC GCTATTTGAG
CGTGGTTATC TCTATTTCCC CGAGGATGAA GACGATTAA
 
Protein sequence
MREYWQKKPL LMRQAIPNFR PPLDGNEIAG LACEEDASAR IFVREGDDEQ SWRKKIGPFE 
ESDLTSLPED KPWSLIVNDL DVQAQPFGDM LELFNCFPRW RISDIQASVS PDGGGVGPHS
DHFDVFLLQA EGEKVWAVAD NEEYWPDNDA AFVPECEIRV LKSFVEDDSF TLVPGDMLYL
PPKIAHNGVA TNSKPGVSVT LSIGFLAPTT DELVLSYTQR ASEKLKGSRW SDPWLKPVED
VGAISAESIT YASEIIKRTY PKNDAEVARW FGCHTTARTG EDDDADENEV SIEELLAAWE
HQGLVAREDL RFAFVEKVAD DSLKNALFFA NGECWDVVSP AAVKTATVIA NRGELYEEDT
QTEECDFDDE ALKLALTLFE RGYLYFPEDE DD