Gene OSTLU_29544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29544 
SymbolVEF3501 
ID5006799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp420166 
End bp421581 
Gene Length1416 bp 
Protein Length471 aa 
Translation table 
GC content58% 
IMG OID640422220 
Productpredicted protein 
Protein accessionXP_001422742 
Protein GI145357063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00819423 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGCCGA GCGCGTACAC GCCGTCGACG TCGAAAACGT TCGACGGTCG CGCGCTGCGA 
CCGGACTACA AGTCGGCGAG CGCGGGCGCG GACGACGTGG ACGAATACCC CGTGAGCTCG
CCCGTGGAGA GCGATTGTTT CGGGGACTCG GAGAGCGAGC CGGACGCGCG TTTCGCGACG
TACACGGGCG CGATCAGCGT CTACGGCGAG TTTCGCGATC GACGGTCGCC GAGGCAGGTG
TTCTTGCGAC GGAATCTGAG TTATCACGTC GAACGCGCGA GCGCGGGGTC GAGGGATGAG
CGAGGAAGCG CGCGGGCGCG CGGCGCGGCG GCGCGCGAAC GCGAACTCGC GGAAAAATAC
GAGACGACGT ACGAGTACGA TCGATGCACG ACGAGAGTCA GAGTGCGAGG ATACGGGTGT
AATATGTGTA AATGCGTGTG CGTCGGCTTG CGGGGGTTGA TGACGCACCT GAGGGCGTCG
CATGATTTGT TTAGGTACAG CGCGAGACGG GAGGGACGGC GAGCCATCGT GCGAATTTAT
CCAAAGGGGG AGAATTTTAC CGCGGATAGG AGCTTCGTGT TGCGGTCGCA GACGGATATT
CAGACGGCAA ATGATAAGGA GTTTTCGTTT TATCGAGGGA AACGGTCGAA GAGGGAGGTA
TTTAGAGAGA GAGTGACCAA GGCGGAGCTC GACGCGTTGT ACGAGCGAGA CGTGCGAGCG
GCGCCGCCGC CGCGGGTGGT TTGGCGCGCG GTGCCGAATG ATGATCACTA CGATCGTGTA
AAATTAGAGA AGCGCAAACG CGCGCAAGAA GAAGAACGGA GAAGAATCGC GGCGTTACCT
TTCAAGCCGA TGGTATGCAA GAAGAACGCG AATGGGGCGG CGAGCGCCGC CGCGACGAAA
CCAAAACCAA AACCGCCAAA GAAACCACTT GGACCGTTTT ACAATTCGAG GTCATTCGTC
GAGATGTCAG AGATTCCAGA GCAAGATTCT GACGATGAGA ACTTGATTCC CGTAGAAATC
ATGGAAAGCA AACGCTTCAT GGAAGAGTTC GTGGATTTTA GCACCGAGGA ACTGTCATTC
ATGGAGGCTT GGAACGACGT CGCGATGAAA TTTCGCTGCG TCGCGGATTA CGAAGCCCCC
TCGCTATGCG AAGCGTTCGT GCGCGTGCAT GGCGATAAAC TGAAAGTTAG CGACGAGTTC
TTCAAAATGT TCGTGCTCAC GCTCTTCGGG ATGTACGAAA ACGGCATTCT CAACCGCCGC
GCGGTGGCGG AAGCCTTGGG AAAATGCAAG TCGCTCTCCG GCGCCTCCGC CTTCGATCGA
GTCTCCACCT CTGTCACAGA CCGCGAGCAC TGTTCCGTCG CCCTGTTCGA AAAGTTTCCC
AAGTTTGTCA CGACCCTGAA AAATCTTAGC TACTAG
 
Protein sequence
MAPSAYTPST SKTFDGRALR PDYKSASAGA DDVDEYPVSS PVESDCFGDS ESEPDARFAT 
YTGAISVYGE FRDRRSPRQV FLRRNLSYHV ERASAGSRDE RGSARARGAA ARERELAEKY
ETTYEYDRCT TRVRVRGYGC NMCKCVCVGL RGLMTHLRAS HDLFRYSARR EGRRAIVRIY
PKGENFTADR SFVLRSQTDI QTANDKEFSF YRGKRSKREV FRERVTKAEL DALYERDVRA
APPPRVVWRA VPNDDHYDRV KLEKRKRAQE EERRRIAALP FKPMVCKKNA NGAASAAATK
PKPKPPKKPL GPFYNSRSFV EMSEIPEQDS DDENLIPVEI MESKRFMEEF VDFSTEELSF
MEAWNDVAMK FRCVADYEAP SLCEAFVRVH GDKLKVSDEF FKMFVLTLFG MYENGILNRR
AVAEALGKCK SLSGASAFDR VSTSVTDREH CSVALFEKFP KFVTTLKNLS Y