Gene OSTLU_33779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33779 
Symbol 
ID5001024 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp706040 
End bp707879 
Gene Length1840 bp 
Protein Length578 aa 
Translation table 
GC content59% 
IMG OID640416445 
Productpredicted protein 
Protein accessionXP_001417029 
Protein GI145345035 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGCG CGCAGGCCTC GCGCGCGCGC GCGCGCGCGC ACGTCGCGAC CAAACACCGC 
GCTCGTCCTC GTGCGCGCGC GTCGACGCGC GTCGCGGCGT CAGCGACGTC GTCAGCGCGC
GCGAATGCGA TATCCAAAGT CGTCGTCATC GGTGCCGGCT GGGGCGGCAT CGGCGCCGCG
AAATCGCTGT GCGAAGCCGG GGCGGACGTG ACGCTCGTCG ACGTGCAAGA CGACCCCACG
GGCGCGACGC CGACGCTGAC GAAGAGCGGG AAACCGTTCG AGGCGGGAAC ACGCGGTGCG
TGTAGAATGA CGACGAAAAG AACGACGACG CGCGTCGCTC GCGTGCGCGA GAGAGCGCGA
CGACTGACGA TGAAACGAAA CTTTGTGCGA TGATTCAGGG TTTTGGAAAG ATTATCCTAA
TATCTCCGAT CTGTGTCGAG AGATGAACAT CGACGAAAAG GATGCGTTCA CAGAATTCAC
GCCGAGTTCG TTTTGGTCGC CGGACGGGTT GGAGGCGACG GCGCCGGTGT TTGGGGATTC
GATGGCGCTC CCGAGCCCTC TCGGACAGGT GTTTGCGACT TTCGATAACT TTAAACGACT
GCCTCTGAGC GATAGGGTGA CGATGGTCGG CTTGCTGTAC GCGATGTTGG ATTTGAATCG
CGATGAGAAG ACGTTTGAGG CGTACGATAG GCTCACTGCG CACGAGCTGT TCATTCGCAT
GGGATTGAGT AAGCGACTGG TGGACGATTT CATTCGCCCG ACGTTGCTCG TCGGGTTGTT
CAAGCCACCA GAGGAGCTTT CGGCGGCTGT CGTGATGGAA TTGTTGTATT ACTACGCGTT
GGCGCACCAA GACTCGTTCG ACGTGCGCTG GATCAAGACG AAGAGTATAG CCGAAGTCAT
CGTCGGGCCC ACGATGGCTC GCCTGCAGAG CGAATACGGA TTGAAAGTCA TGGGCTCGAC
GTTTGTCAGC AAAGTTGAAG TGGACGAGGC GACAAAAAAG GCGACGGCGG TGCACTACCT
GAAAAAGGAC GGCGGGAAAG CAGGCGTGAT CAAAGACGTT GATGCGGTGG TCTTCGCGCT
CGGCGCAAAG GGCATGAAGA GCGTGGTGTC CAACTCTCCC GTCTTGGCTC GCATGGCGCC
AGAGTTTAGC GCCGCAGCGT CGCTCGGCGG CATTGACGTC GTGGCGACGC GAATCTGGTT
GGATCAGTAC GTGGACGTGC AGCATCCAGC GAATGTATTC AGTAGATTTG AAGCCCTTCG
TGGCGCCGGG GGTACTTTCT TCATGTTGGA CCAGTTGCAA AAAGACTCGG AAGTTGAATT
GTGGGGTGGC GAAGAGCCAA AAGGAAGCGT CATCGCCGCC GATTTCTACA ACGGCGGCGC
CATCGCGTGC TTGAGCGATG ATGATATCGT AAAACTATTA ACGGACGAGC TCCTTCCGGC
CGCCGTACCG GGATTCAAGG GCGTCAAAGC TGTCGACTTT GAAGTGCGAA GGTACCCGGG
AGCGGTTTCC TGGTTCTCTC CAGGCTCGTA TTCGAAAAGA CCGCCGCTCG AAACGTCCAT
TTCAAACATC GTATGCGCGG GAGATTGGGT GCGCATGGGC GACAGAGAGC ACGGCGCTAA
GGGCTTGTGC CAAGAGCGCG CCTACGTCAG CGGTTTAGAG GCTGGAAATT CTCTTTTACG
CAGATGCGTC GTCTCCGGCG CGGGCGTTTC CGGCGGCGCC AGCCATCCCG TCATTCCGAT
TCGCCCCGAC GAAGCCCAAG TCGTGCTCGG CCGCGCGCTG AACAAGCAAA TTATGGACAC
CCTGAGCCCG TTCGGCCTGG CGTCGCCGTG GATTCGTTAA
 
Protein sequence
MLGAQASRAR ARAHVATKHR ARPRARASTR VAASATSSAR ANAISKVVVI GAGWGGIGAA 
KSLCEAGADV TLVDVQDDPT GATPTLTKSG KPFEAGTRGF WKDYPNISDL CREMNIDEKD
AFTEFTPSSF WSPDGLEATA PVFGDSMALP SPLGQVFATF DNFKRLPLSD RVTMVGLLYA
MLDLNRDEKT FEAYDRLTAH ELFIRMGLSK RLVDDFIRPT LLVGLFKPPE ELSAAVVMEL
LYYYALAHQD SFDVRWIKTK SIAEVIVGPT MARLQSEYGL KVMGSTFVSK VEVDEATKKA
TAVHYLKKDG GKAGVIKDVD AVVFALGAKG MKSVVSNSPV LARMAPEFSA AASLGGIDVV
ATRIWLDQYV DVQHPANVFS RFEALRGAGG TFFMLDQLQK DSEVELWGGE EPKGSVIAAD
FYNGGAIACL SDDDIVKLLT DELLPAAVPG FKGVKAVDFE VRRYPGAVSW FSPGSYSKRP
PLETSISNIV CAGDWVRMGD REHGAKGLCQ ERAYVSGLEA GNSLLRRCVV SGAGVSGGAS
HPVIPIRPDE AQVVLGRALN KQIMDTLSPF GLASPWIR