Gene OSTLU_34350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34350 
Symbol 
ID5000569 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp774825 
End bp776084 
Gene Length1260 bp 
Protein Length401 aa 
Translation table 
GC content51% 
IMG OID640415990 
Productpredicted protein 
Protein accessionXP_001416757 
Protein GI145344475 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.719224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.952947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACG CCGGGGGGCC GTATCTGAGC GTCGACGGGG ACGCGCTCAA CTCTTGGTCG 
TTCGATGTGT TCACCGTAGG GGAAAACAAG GAGCTATTTC CGTACGTCGT GCGCATGTTT
CACGAAATGA ACATGTTTGA GCTCGTCGAT CAGTTCAAGT TTCGTCGCTT CTTATCTGAG
GTCGAAGGCA TGTACCGGGA TAATCCGTAC CACTGCTTCA AGCACGCCGT CGACGTCACG
CACACGACGT ATTTATACAT TTGCGCGGTG AAAGAGCAAG TCAGTATGAC GCAAGTAGAA
ATATTCTCTC TCCTCGTTGC CGCTTTGGTG CACGATTTGG ATCATCCCGG GGTTACGAAC
GGGTATCTCA TCGCCACGCA CGATAACATA GCGCTGACGT ACAACGACGA GAGCGTGTTG
GAGAATCTTC ATCTCTCGCG GTTCTTTTCT TTGTGTCAAA ACAACGAAGA CGCAAACATT
CTCTCTGCAT TTGACGAGAG TACGTACAAA GAGATTCGTC GATCTATTAT AAGTTGCGTG
CTACACACAG ACATGGCGCA TCACTTCAAG CTCGTCTCGC GATTGAACGA ACTCGTCGCT
CTGGGTAAGA AGAACAACAT CATCAACGGT TCACCGATGA AATCGACAGT CATAACGAGC
ACTCGTGATG ATGATCCGAT AAATGTGAGC GTGACGTTCA AGACAGACGA CGAGCGTCAG
CTGATGCTGA ATGTTCTCCT GCACTGCGCG GACATCTCCA ACGCCGTCAA GCCAAACGAG
CTTTGCGTCA AGTGGGCATC GCGGGTGCTG GAAGAGTTCT TCAATCAAGG CGATCGCGAA
CGTTCAAGAG GAATGTCGAT CAGTCCTATG ATGGATCGCG AGACCACGTC GGTGGGGCTA
TCGCAGATAA ACTTTATTGA GTTTGTTATC GCGCCTCTCT ACGTGCAATT TGTGGCGGTG
TTCCCAGCCC TGAACGGCCT CTTGACGCGA CTGATCGAAA ATCGTCGATA CTATCAGGAG
ACGTATGAAA ACGAACTCGC CGACATGACT AAAGGCGATG GCACCGATCG CTCTCCAGAG
AAAGAATCAC TTCGCGCGCG ATTCCGCACG CTCATCGAAA AGCACTCGCT GCGTCGCTTC
GTGGATGAAA GCGATAATCT CATGAAGGCC ATTCTATCGC TTCCACACAC GCGACGGTCG
AGCGCGACTC GTTCTTTCAA CGCGATGCTT TCATCGCCGT CGAAGAAGAA AACGATGTGA
 
Protein sequence
MSDAGGPYLS VDGDALNSWS FDVFTVGENK ELFPYVVRMF HEMNMFELVD QFKFRRFLSE 
VEGMYRDNPY HCFKHAVDVT HTTYLYICAV KEQVSMTQVE IFSLLVAALV HDLDHPGVTN
GYLIATHDNI ALTYNDESVL ENLHLSRFFS LCQNNEDANI LSAFDESTYK EIRRSIISCV
LHTDMAHHFK LVSRLNELVA LGKKNNIING SPMKSTTDDE RQLMLNVLLH CADISNAVKP
NELCVKWASR VLEEFFNQGD RERSRGMSIS PMMDRETTSV GLSQINFIEF VIAPLYVQFV
AVFPALNGLL TRLIENRRYY QETYENELAD MTKGDGTDRS PEKESLRARF RTLIEKHSLR
RFVDESDNLM KAILSLPHTR RSSATRSFNA MLSSPSKKKT M