Gene OSTLU_12589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_12589 
Symbol 
ID5002431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp103394 
End bp104719 
Gene Length1326 bp 
Protein Length441 aa 
Translation table 
GC content62% 
IMG OID640417852 
Productpredicted protein 
Protein accessionXP_001418159 
Protein GI145347408 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.96374 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCCG ATCGAGGCGA CGGAGAGGCG CGAGAGGCGC GGCGTAAAAG ACCCGAGGTG 
TGGCGACCGG GATCGGCGGA AGACGAGGAC GTGGAGCTGG AGTACGACGA GAGCGCGTAC
GACGCGCTGC ACGCGTTTTC GCACGAGTGG CCGTGCCTGT CGCTGGACGT GATGAGGGAT
GATTTAGGCG AAGGAAGGGA GGTGTTCCCG CACGAGATGA CGATCGTGAC GGGAACGCAA
GCGATGGAGG CGACGAAGAA CGTGCTGAGC GTGATTAGGG TGAGTCGGAT TAAGAAGACG
CGGCGAGACG CGGACGCGGA CGAGGACATG GAGGCGAGCG ATAGCGACGA CGACGAGGAC
GGTGGGTCAG ACGCGCCGAC GTTGACGGTG GCGAGCGTGG TGCATCATGG ATGCGTGAAT
AGATTGCGAG CGATGCCGCA AAGACCGAGC ACGTGCGCGA GTTGGAGCGA TTCTGGGCAC
GTGATGATTT GGGATTTGAG CGCGCAGTTG AAAAAGGTGA TGACGTCGAC GAACGATTCC
AAGGGCAAGA TCGATCCACC GTCTCGAGTG ACGCCTACGC AAGTGTTCAC GGGTCATAAA
GACGAAGGCT ACGCCCTTGA TTGGTCCTCG GTGTGCGAGG GAAGGCTCGC GAGTGGTGAT
TGCGCCGGGG CGATTCACAC GTGGGACATG GTGCAAGGTA AGTGGGACGT CGGCGCGACG
CCGTACACCG GGCACTATTC GAGCGTGGAG GACATCCAGT GGTCCCCGAC CGAGCGAGAT
GTGTTCATAT CGTGCTCTGC CGATCAAACG GTGTGCGTGT GGGACACGCG ACAGCGCGCA
AAGCCGGCGT TGCGCGTCAA GACGCACGAC TCGGACGTCA ACGTTCTGTC GTGGAACAGA
CTCGCCAACA GCATGGTCGC CACAGGCGCC GACGACGGAT CGTTGCGAAT CTGGGATCTT
CGCAACTTTA ACGAAACCAA CGCGCAATTC GTGGCCAACT TCACCTTCCA CCGTGCCGCG
GTGACGTCCG TGGATTGGGC GCCGTTCGAC TCAGCCATGC TCGCCTCTTC CTCCGCCGAC
AACACCGTCT GCGTGTGGGA TTTAGCCGTC GAGCGCGACG CCGAGGAAGA AGCCGCCGCG
CTCGCGGCGA AGGATAACGC CGCGCCGCCC GAGGACCTTC CTCCGCAGCT CATGTTCGTC
CATCAAGGCT TGAAGGATCC AAAGGAAATC AAGTGGCACC GTCAAATCCC GGGCGCGTGC
GTCACCACCG CCGCCGACGG CTTCAACATT TTCAAAGCCT ACAACGTCGG CCCCGCCGTG
CCGTGA
 
Protein sequence
MDADRGDGEA REARRKRPEV WRPGSAEDED VELEYDESAY DALHAFSHEW PCLSLDVMRD 
DLGEGREVFP HEMTIVTGTQ AMEATKNVLS VIRVSRIKKT RRDADADEDM EASDSDDDED
GGSDAPTLTV ASVVHHGCVN RLRAMPQRPS TCASWSDSGH VMIWDLSAQL KKVMTSTNDS
KGKIDPPSRV TPTQVFTGHK DEGYALDWSS VCEGRLASGD CAGAIHTWDM VQGKWDVGAT
PYTGHYSSVE DIQWSPTERD VFISCSADQT VCVWDTRQRA KPALRVKTHD SDVNVLSWNR
LANSMVATGA DDGSLRIWDL RNFNETNAQF VANFTFHRAA VTSVDWAPFD SAMLASSSAD
NTVCVWDLAV ERDAEEEAAA LAAKDNAAPP EDLPPQLMFV HQGLKDPKEI KWHRQIPGAC
VTTAADGFNI FKAYNVGPAV P