Gene OSTLU_42459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42459 
Symbol 
ID5003399 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp79670 
End bp80849 
Gene Length1180 bp 
Protein Length356 aa 
Translation table 
GC content64% 
IMG OID640418820 
Productpredicted protein 
Protein accessionXP_001419063 
Protein GI145349277 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGG AGACGCGCGA GGACGCGACG GATGGATACC GCGCGCGCGT GGACGTCGTG 
TACGCGCGGC TGAGCGGCGC GGACGCGAGG GAGGACGGCG ATAAATTGCG CGTGCTGCTG
GTGCGTCGCG CGAGGGCGAG ATCGAGCGCG CGGAGGGGAA GGGGAGAGGC GCGCGCGAGC
GGGCGGGACT GACGAGACGA CGACGCGCGC GGCGACGACT CGAACGAAGA ATTACGGAGA
ACACGGACGG GAATTGGTGC CCGTGGACGC GGCGGTGCGA TTGCTGGAGA CGATGGCGGA
GGGTGAGGAC GCGGTGGTGG CGCTGGCGCG CGGGAGGGGT GTGGACGAGA CGCGGTTGCG
AGCGATGCTT CGAGCGACGA CGTTCGCGGT GGTGCCGATG GAGAACGAGC GAGGACGGGA
GCTGGTGGAG AGGGGGGATC CGTGCGAGCG GAAGAATGGG CGAGGGGTGG ACCCGAATAG
GAATTGGGGG GTGAATTGGG GGGTGAAGGC GCCGGATTAC GATCCCAAGG AGGAGTTTCC
GGGGACGGCG CCGTTTAGCG AGCCGGAGAG TCGGATATTT CGGGATCTCG TCGCGTCGTT
CGAGCCGCAC GCGGTGGTGA ATTGGCACAG TGGGATGTCG GCGATATTCA CGCCGTATGA
TCACGTCGCG CGCGAGCCCA CTGGGGCGGG GGCGGAGGCG ATGATGCGTT TCGCGCGCGT
CATCGACGCC GAGCACTGCG CGAAAAAGTG CACGCTCGGT TCGGGCGGGA AGGGCGTGGG
GTACCTCGCG CACGGTACGG CGACTGATTA CATATACGAA AAGATGAAGG TGCCCGTGGT
GTACACGTGG GAAATATACG GCGATCTCGA TGCGCCTTTC GAGGATTGTC ATCGCGCGTT
CAACCCGACG ACGAAGGAGA CGCGCGACGC CGTCGTCGAA GCGTGGTTCG GAGCGCCCAT
CACGCTCGTA TCCATGCTCG ACCAGCACCC CGACATAAAT TTCAAACATC AAAGCGTCGT
GCCGGCTGTC GCGTCATCTT CATCGTTCGC AGTTGGTGAT GAACAGCGCC GTTTCCCTTG
GACAATTTCG TTGGCCTTCG CGTTCTTCAC CTTGGTGGCG CTGCGTCGAC TTCGGCGATC
CAAGCGCGGC GGTGGTGCGG CGATCGGTAC ATCGCTTTGA
 
Protein sequence
MTTETREDAT DGYRARVDVV YARLSGADAR EDGDKLRVLL NYGEHGRELV PVDAAVRLLE 
TMAEGEDAVV ALARGRGVDE TRLRAMLRAT TFAVVPMENE RGRELVERGD PCERKNGRGV
DPNRNWGVNW GVKAPDYDPK EEFPGTAPFS EPESRIFRDL VASFEPHAVV NWHSGMSAIF
TPYDHVAREP TGAGAEAMMR FARVIDAEHC AKKCTLGSGG KGVGYLAHGT ATDYIYEKMK
VPVVYTWEIY GDLDAPFEDC HRAFNPTTKE TRDAVVEAWF GAPITLVSML DQHPDINFKH
QSVVPAVASS SSFAVGDEQR RFPWTISLAF AFFTLVALRR LRRSKRGGGA AIGTSL