Gene OSTLU_87326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_87326 
Symbol 
ID5002118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp540130 
End bp541380 
Gene Length1251 bp 
Protein Length416 aa 
Translation table 
GC content58% 
IMG OID640417539 
Productpredicted protein 
Protein accessionXP_001418043 
Protein GI145347158 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0450714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCACC AATCTCGCTC GGGCGCGAAA CGCCGCAAAA TCGCCGTCGG CGACGATCCG 
CTGCTTCGAG CGTTTCTCAA TCAGTACGCC GGCTGCGCCG TCCCCGACCA CGGAAACCCA
CCTCAGATCG ATCGCGTGCG CTACGCCTCG CTCGACGCCG ACGCGTTCGA GACGAAATAC
GTCCTCGGTC GTCGTCCGGT GATCCTCACG GACGCCCTGA GCGACGAGAG CGATCTCGCG
GGGGCGAACA TGTGGACAAA CGCTTTTCTG CGGGACGCGT GCGGCGGCGA CGCGTCGACG
TCGCGCGTAG AATCGCGCGA TGAAGGCGCG AACGAACGGT TCGGACGAGG AAAATATAAA
TCCATGACGT TCGATGCGTT CATGGATAAG TACGAGGCGC GAGACGGGGG ATGGTATCTC
TCAGCGGGAG GGAAGGCGGA ACCGTTTGCG GCGCCGGGAC GGTTGTTGGC GAGCGAGACG
GCGAAGTCGA GCAAAGGGAA GCTGCCGTTA CGTCCGAAAG GAGTGCCTCG CTCGCTCGTG
CCGGCGGATG TGAATTTATG GATGGGTAGG AATTCGACCT CGACGTCGAG CGGTCTGCAT
CACGATTATC ACGACAATTT GTACGTGTTG GTGCGAGGCG AGAAGACTTT CAAAGTGTTT
TCTCCGCGCG ACGCTGGACG CATGTACACG ACTGGGAAGA TATGTTGGGT GCACGAGAAC
GGGCTCATTA ATTACAAAGG CGCGAGCACG CTTCAGGATG GCGATGTGGC ATTAGCCACG
GGGGAGTCCG CGTTGGAGTT TAGATTGCGC AAGGCTCGCG GCGACGAAGA TGAAGACGAA
CCGGCGATCG GCAGTAGCGA CGACGATTTC GCCGACTTTG ACGGCGTCGA TGATTACGAT
GAACTCAATC AAACTCCAGA ACCTTCCGAC GACGATTCGG CCGAAGAGGA AAAACGAAAA
GACGATGATG ACGAAGAGGA CCCACCGAGT TTTAGCCGGA TCGATTACGA CCGTCTCGAT
GAATTTCCTC TCTTCAAAAC CGCGTCGGCG ATGGAGTTCA CCGTGAAAGC TGGCGAAGCG
CTGTACTTAC CCACGGGATG GTTCCACGAC GTCTCGAGCC GAGATGCAGG CGAAGGACAC
GTTGCATTCA ATTATTGGTT TCATCCACCG CCTCTCGGTG AGGCGTGGGA AGGTCGCCGC
GCGCTCTGGG AAGAAGATTT TCAGCGCTCG GTGCTGCCAA AGTTCCAATA G
 
Protein sequence
MSHQSRSGAK RRKIAVGDDP LLRAFLNQYA GCAVPDHGNP PQIDRVRYAS LDADAFETKY 
VLGRRPVILT DALSDESDLA GANMWTNAFL RDACGGDAST SRVESRDEGA NERFGRGKYK
SMTFDAFMDK YEARDGGWYL SAGGKAEPFA APGRLLASET AKSSKGKLPL RPKGVPRSLV
PADVNLWMGR NSTSTSSGLH HDYHDNLYVL VRGEKTFKVF SPRDAGRMYT TGKICWVHEN
GLINYKGAST LQDGDVALAT GESALEFRLR KARGDEDEDE PAIGSSDDDF ADFDGVDDYD
ELNQTPEPSD DDSAEEEKRK DDDDEEDPPS FSRIDYDRLD EFPLFKTASA MEFTVKAGEA
LYLPTGWFHD VSSRDAGEGH VAFNYWFHPP PLGEAWEGRR ALWEEDFQRS VLPKFQ