Gene OSTLU_31385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31385 
Symbol 
ID5001543 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp781786 
End bp783493 
Gene Length1708 bp 
Protein Length462 aa 
Translation table 
GC content59% 
IMG OID640416964 
Productpredicted protein 
Protein accessionXP_001417356 
Protein GI145345734 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.000357166 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAGTC CGACGCGACG ACGCGCGACG GTTGAGTCGA CGACGTCGTT CGAGCACTGT 
CGAGCGGGGA CGCCGACGGG ACGAGCGCAG GTGCGCGCGC GCGACGGCGA GGCGAGGCGA
GGCGCGCGCG CGCGGAATTC GCGTCGCGAA AACCGTTCGC GTCGCGCGCG GTGATAAGCG
CGGTTCGACG CGCGCGCGGG ACGCGCGGGA CGCGCGGATG GCGATGGCGA TGGCGATCGA
GGGGATGGAT GATCGTCGAG CGAATCGACG CGCGAGACTG ACGATGTTGA CGGTTCGATA
GGTCGATCGA TTCATCCCGA GCCGTAGCGC GTTGGATTTG GACGTGGCGC ACTACAATTT
GTCCCGAGAG GGCGGGGAAT CGGAGGTGGA TGATGCGGTA AAGGAGATCA AGTCTCCGGC
GAAGGTGCGC GCGTCGACGC GCTCGAAGCG CGCGCGGACG AACGCGTGGA GTGAATGAAT
GAGTTCGTTG AGACTGACGA CGAAATCGAT CGCGCGATGA TATGATGCAC AGGAAGCGTA
TAAGAAAAGT CTTGCGGATA ACTTCCACGT GGATAACGGA AGCGATTCGG CGAAGATTCT
CGCGTTCAAG TCAAAGGCGC CGGCGCCGCC GAGCGGGTTG GAAAACTCGG CGCGCGGTGT
CTACACGAAC AACTCCGCGG GAGTGAAGGC GAAGAAGACG TTCCGTCAGA TTCCCAGCGC
TCCCGAACGC ATCTTAGATG CGCCGGAGTT GATCGATGAC TACTATTTGA ACCTTATCGA
CTGGGGGTCG TCGAACCAAG TCGCGGTGGC GTTGGGGTGC ACCGTGTACA TGTGGAACGC
GGATACTGGG GCTATCAACC AATTGTGCCA GACCAACCCG GATGACGAAG ATGATTACAT
CACCTCCGTC AACTGGGGTG CGGACGGTAA GCACATTGCG GTGGGTACGA ACAGTGCGGA
GGTTCAAATT TGGGACGCGG CGCAGTGCAA GAAGGTGCGT ACGTTGCGAG GTCACGCCGC
GCGGGTGGGT GCGGTCTCGT GGAACGGTTC GCAGCTTGCA ACGGGTAGTC GTGATAACAA
CATCATGATT CACGACGTTC GCATTCGCGA GCATTGCACC TCGACGCTCC AGGTTCACCA
GCAAGAGGTT TGTGGCTTGA AGTGGAGCCC GAGTGGCAAT CAGCTCGCGT CTGGCGGTAA
CGACAACTTG TTGCACATCT TTGATGCGAG CTCCATCGGC AATCAACAAG CGTTGCACAG
ATTAGATGCG CATCAAGCTG CCGTTAAGGC TCTCGCCTGG TGTCCGTTCC AGTCCAACTT
GCTCGCTTCG GGCGGCGGTA CCGCCGACCG TTGCATCAAG TTTTGGAACA CGAACACCGG
CGCCATGCTC AACTCTGTGG ACACGCACTC GCAAGTGTGC TCGTTGCAGT GGAACACGCA
TGAGCGGGAG CTTTTGTCGT CGCACGGTTA CAGCCAAAAC CAGTTGTGTT TGTGGAAGTA
TCCGACGATG ACCAAGATGG CCGAGTTGAC GGGTCACCAA GCGCGAGTGC TTCACATGGC
GCAGTCTCCG GACGGTACCA CGGTGGTATC GGCGGCCGCG GATGAGACTT TGCGATTCTG
GAAGTGCTTC GATAACGCTA GCGAGAAGAC CAAGAAGGTG CGCGATTCCA ATGACTCATC
TGTTTTGCGC AGGTTCAATT TCCGCTAA
 
Protein sequence
MLSPTRRRAT VESTTSFEHC RAGTPTGRAQ VDRFIPSRSA LDLDVAHYNL SREGGESEVD 
DAVKEIKSPA KEAYKKSLAD NFHVDNGSDS AKILAFKSKA PAPPSGLENS ARGVYTNNSA
GVKAKKTFRQ IPSAPERILD APELIDDYYL NLIDWGSSNQ VAVALGCTVY MWNADTGAIN
QLCQTNPDDE DDYITSVNWG ADGKHIAVGT NSAEVQIWDA AQCKKVRTLR GHAARVGAVS
WNGSQLATGS RDNNIMIHDV RIREHCTSTL QVHQQEVCGL KWSPSGNQLA SGGNDNLLHI
FDASSIGNQQ ALHRLDAHQA AVKALAWCPF QSNLLASGGG TADRCIKFWN TNTGAMLNSV
DTHSQVCSLQ WNTHERELLS SHGYSQNQLC LWKYPTMTKM AELTGHQARV LHMAQSPDGT
TVVSAAADET LRFWKCFDNA SEKTKKVRDS NDSSVLRRFN FR