Gene OSTLU_50173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_50173 
Symbol 
ID5003240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp352112 
End bp353309 
Gene Length1198 bp 
Protein Length389 aa 
Translation table 
GC content55% 
IMG OID640418661 
Productpredicted protein 
Protein accessionXP_001419137 
Protein GI145349431 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.828985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAAAAAGGCG CCGAGCGATA TAGCGTCGAT GATTGACGCG CTGGACGATG AGGATGAGCG 
CGATGGTGGG TCGCAGGCCG GACCGTCGAA GGCGTTGGCG GCGTTCCGGG GCGAAGACGC
TAAAGCCAAC CTGCCGGTGT CCATCTTCGG TAGACACGCA AACGCGGCGA CTGGGGCGAA
TATCGCGAAG AGATTGGCGA GCGAATGGCC GGAGCCGGAG TGGCGAGCGC CGTGGAAGCT
TTATCGTGTG ATTTCGGGAC ACCAAGGGTG GGTGAGATCG TGCGCCGTCG ACCCGGGGAA
CGAGTGGTTC GTCACGGGCA GCGCAGATCG CACCATCAAG GTTTGGGACT TGGCGAGTGG
CAGCTTGAAG CTCACTTTGA CCGGTCACAT CGAACAAGTC ACCGGTATCG TGGTGAGCCA
GAGGCATCCG TACATGTTCT CGTGCGGTTT GGATAAAAAA GTCAAGTGCT GGGACTTGGA
GTACAACAAG GTGATTCGTA ACTATCACGG GCACCTTTCG GGAGTGTATT CGATCGCGAT
GCACCCGACT TTAGATCTGT TGATGACGGG CGGTCGAGAC AGTGTGTGCA GAGTTTGGGA
CATGCGCACA AAGAGACAAG TGTACTGCCT CACTGGACAC GAGAACACCG TTGGATCCAT
ATTAGCGCAA GACGAGAATC CGCAGCTCGT CACCGGTTCG TACGACAGCA CGGTTCGCTT
GTGGGACTTG GCGACTGGTA AAACGATACA TACACTGACT CATCACAAGA AGGGCGTGCG
TGCTATGGCG ATGCACAAGA AGGAATTCGC ATTCGTTTCC GCTTCAGCTG ACAACATTAA
AAAATTTTCG TGCCACGGTG ACTTCATGCA CAACATGTTG AGCAAACAGA ATTCCATCGT
GAACACGCTG TCTATGAACG ACGATGATGT TGTCTTTAGC GGTGGTGATA ACGGTAGCAT
GTGTTTTTGG GACTACAAGT CTGGGCATTG CTTCCAACAA GAAAAGGCGT TGGTGCAACC
CGGTTCGTTG GAAGCCGAAT GCGGGATCTA CGCCTCCACT TTTGACGTCA CCGGTTCGCG
CTTGATCACG TGCGAGGCCG ACAAAACGAT CAAGATGTGG AAGGAGGACA CCGAGGCTAC
GCCTGAGAGT GCGCCGATTC TTCCCTTTGC CCCACCCAAG AATATTCGGC GAAGTTGA
 
Protein sequence
MIDALDDEDE RDGGSQAGPS KALAAFRGED AKANLPVSIF GRHANAATGA NIAKRLASEW 
PEPEWRAPWK LYRVISGHQG WVRSCAVDPG NEWFVTGSAD RTIKVWDLAS GSLKLTLTGH
IEQVTGIVVS QRHPYMFSCG LDKKVKCWDL EYNKVIRNYH GHLSGVYSIA MHPTLDLLMT
GGRDSVCRVW DMRTKRQVYC LTGHENTVGS ILAQDENPQL VTGSYDSTVR LWDLATGKTI
HTLTHHKKGV RAMAMHKKEF AFVSASADNI KKFSCHGDFM HNMLSKQNSI VNTLSMNDDD
VVFSGGDNGS MCFWDYKSGH CFQQEKALVQ PGSLEAECGI YASTFDVTGS RLITCEADKT
IKMWKEDTEA TPESAPILPF APPKNIRRS