Gene OSTLU_16849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16849 
Symbol 
ID5003821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp475646 
End bp477097 
Gene Length1452 bp 
Protein Length483 aa 
Translation table 
GC content63% 
IMG OID640419242 
Productpredicted protein 
Protein accessionXP_001419608 
Protein GI145350430 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.27107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA CGTGGACGCC GTTCGAGCGC GCGCGCGTCA TGCGCGCGGG ACGCGCGAAA 
CACGTCGCGG GGAGCGAGGC GTACGGAAGG GGGGATTACG GCGAAGCGAT TGCGTGCTTC
ACGCGCGCGC TGGAGATCGA CGCGCGGAAT CAGTTTGCGC TGTGTAATCG AGCGCTGGCG
CGGTTGCGAC GGGGTGAGGA CGGCGACGCG GCGCGAGCGC TGGAAGACGC GAGGGCGTGC
GTGAGGAACG CGCCGACGTG GCATAAGGCG TGGTTTCGGT TGGGGCAAGC GCGCGCGGCG
AGCGGGGCGG AGGTGGAGGC GCTGAGCGCG TACGCCGAGG CGATTCGATG CGACGAGTGC
GGACAAGATC GGGATGAGTG CGAAGCCGTG ATGAAGAAAT TGCGCGCGCG CGTGGAACTG
GACGAGCGGA AGGCTGCGGA AGCGGCTGAG GCTGAGGCGC GGCGCGCGGT CGAGGTCGCG
GAGGAAGACG ACGAGATTGA ACGCCGCCGC GCGAGACGCG AGGCGAAATT GGAAGCGAAA
CGCGCGAGAC ACAAAGTCGC GGAAGAAAGG CTGAGCAAAC AACTGGGATA TCCCGAAGGT
ATGAACGCGG TGGATGTGAA CGCGTACGCG GATGAGTCTG ATGATAGCGA CGATAGCGCG
CTGGAGACGT ATGACGCGTG CGGGCGGGCG ATGTTCATCG AGCGGAAAAT TACAGACTTT
GTCAGCATCG CGAGGCGCCT TCGAACGAGC GTGGGCGCGA CGAAAACGAT GGACATGATG
TGCGAGTTGC GTCGGTACGG ATGCTGCCAG ATTTCGCTCG GGGCCGCGGT CGGGGCGACG
GCGAAATTTA AAGTCGCCGC CGACGCGGCG GATCGATGCG CGCTGCCGTA CGTCGTCATC
CCGGCGTGGC GGGAAGCGTC GACGCCGTGG CCGATGTCGC TCGCGCAGTC GCACCAAGAC
GTTCAGGATG TGATGTGGCT CATGGAAACA GTCGCTCGCG TGACCACGGG AGCGATATAT
GAAGACGCAG GAAAATCTGA AGCCGAGCTC GACCGCGCGC TCGATTCGCC TCTCAGCGCG
GGCGTTCTCG AAAGCGCGCC GTCCATCGAC GAAGTTTCTC TTTCGGCGCT CGTCTTCGCG
AATGACGAAT CATTTCTTCG CGCTTACGCT GAGAAGAACT CGGCGCTCGT GCGCTTGGAT
TGGCTCGAGG CGTACGAAGA GGAAGACGCC GAGGCGGATG GAGATTCGAC GCTCGTGGAG
CTCTTCGTGG GCGCCGAGTT GACGCGCCGA CTCGATGACG AATTTATCAC CGTGGACGAG
CTCGACCCAC GCGATCGCGA CGGGCACGGA CCCGATGGCA CGAGTTTCGC GTGTCCTAGG
GCAAAGCGTC CCGCCGATCG CACGCGCGTT TCATTTTTTC TTTGCCCGAA GAATTCGAAT
AGCCATTTTT AG
 
Protein sequence
MAKTWTPFER ARVMRAGRAK HVAGSEAYGR GDYGEAIACF TRALEIDARN QFALCNRALA 
RLRRGEDGDA ARALEDARAC VRNAPTWHKA WFRLGQARAA SGAEVEALSA YAEAIRCDEC
GQDRDECEAV MKKLRARVEL DERKAAEAAE AEARRAVEVA EEDDEIERRR ARREAKLEAK
RARHKVAEER LSKQLGYPEG MNAVDVNAYA DESDDSDDSA LETYDACGRA MFIERKITDF
VSIARRLRTS VGATKTMDMM CELRRYGCCQ ISLGAAVGAT AKFKVAADAA DRCALPYVVI
PAWREASTPW PMSLAQSHQD VQDVMWLMET VARVTTGAIY EDAGKSEAEL DRALDSPLSA
GVLESAPSID EVSLSALVFA NDESFLRAYA EKNSALVRLD WLEAYEEEDA EADGDSTLVE
LFVGAELTRR LDDEFITVDE LDPRDRDGHG PDGTSFACPR AKRPADRTRV SFFLCPKNSN
SHF