Gene OSTLU_33336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33336 
Symbol 
ID5003629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp54615 
End bp56028 
Gene Length1414 bp 
Protein Length223 aa 
Translation table 
GC content60% 
IMG OID640419050 
Productpredicted protein 
Protein accessionXP_001419692 
Protein GI145350604 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCCGAGCTCT GTGTAGTCAA GCGGGCTCAC GGAAGCGATC CCCATACCAT CTTGCTCCCA 
CACGGCAAAC CGATCCTCAC CTCGCGAAAT AGCTTTCTTC CACACAGACA TGTGAGATAG
AAGACATCCA AGGTGGTGGT TATACGGGAA TACGCATTTT TCGCCGACGC AACGGTGCTG
TGGATTGAGC TGCGATTCGT GAACCTGTCT GATGGGATAC GGCAACTCAC TGGTTGGAAC
GTTCGCTTCT GAAGCAGCAA GGGCTCCCGC TAGTACATCG TGCGTCGCGC TTCGCGCTGT
CTCGAAGGTA TCCGGCCAAA GCGAAGGAAA CGTCGCACTC GAGATTTCCA CAGTATTGCG
CGCTTCGGCG TCGCTCAAAC CACCGTGCGT TTTGAACAGG TTGAACACGT AATCGCGTCC
TTTCCTAGAT CTCTCCATCT CGTCTCCGGG CGACGTATCT GCTATGATAT ACGTCCGCAC
TCCCGTCGCT CCACTGATCC CGAGCGTCGA TTTCCCACCG CCGCGAGCTC GTGTCGCTCC
ACTGATCCCG AGCGTCGATT TCCCACCGCC GCGAGCTCGT GCGCTCGTCC TCGCGTCGTC
GACCCCCCCG CGCGACGCGA CCGCCATCCC CGCGCCGACC GCACACGCCA TCAGCGCAAC
ACTCGCGAGA CATCCTCGCG ACGTCGACGT TCTCGCTCGG GATCGCTCGC TCACGTCGGT
CTCACCGCGC TCTTCGTCCA GGATGCTCGC AACACCCTCA ACGACGCCGT TCGCGGCGTT
GCGCGCGAGC AAATTCGAGC GCTCGCCGCG AGAGATGCTT TGCGCGTCTT TGTCACTCAT
TCTCGCGTCG CGTGGGCGCG CGCAGTGTCG GCTGCGGACG CGCGTCGGTC TCTGTCGATC
GGTCGGACAT TTGTCGGCGA CGGAGCTACA GCTACAGCTA GGCTACACCT ACTCCTATTG
GGGCGACTCG CTTGTACCGC ACGGAAGCGC TTTCCAGACT CGCGAACCGC TGTTCGCGCG
AGATGGGAAG GATGCGGACG GCGAACTCGC GCGAGATGCG ACGCCGACCG AGAAGAAGAG
CGACGGAAGG ACGGAAACGC CGCTGTCGAT CAAGGTGTTG AGCATGGTGT CGTACTTGTT
CCTGCAAGTG GCGGGAACGA TATTCGCGAG TCTGAGCCGG AACAGTGATG ATGATTATCC
GTACGATACC GTGGTGCTGG CGTTCACGAT GGAGTCGGTG AAGCTGGTGT TGTCGTTCAT
CTTTTTGACG ACGTCGCGGG CGTGCGGAGG AGTCGAAGAA GTGACGTGGA GCGCGAAACG
CTTCACCTCG TTTGCGCTTC CCGCGCTGTG CTATTTCGTC GCGAATAACT GTATGCTCCT
CATCATACAA GAACTGGGGC CGTCGACGTA TTGA
 
Protein sequence
MLATPSTTPF AALRASKFER SPREMLCASL SLILASRGRA QCRLRTRVGL CRSVGHLSAT 
ELQLQLGYTY SYWGDSLVPH GSAFQTREPL FARDGKDADG ELARDATPTE KKSDGRTETP
LSIKVLSMVS YLFLQVAGTI FASLSRNSDD DYPYDTVVLA FTMESVKLVL SFIFLTTSRA
CGGVEEVTWS AKRFTSFALP ALCYFVANNC MLLIIQELGP STY