Gene OSTLU_18885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18885 
Symbol 
ID5006492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp99578 
End bp100696 
Gene Length1119 bp 
Protein Length372 aa 
Translation table 
GC content70% 
IMG OID640421913 
Productpredicted protein 
Protein accessionXP_001422391 
Protein GI145356341 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.0152699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0000000221114 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGGCGA AGTGGATCGC GTACGACGAG ATCGAGTGCG TGACGCCGAG GTGGCCGGGG 
ATCGGGGCGA CGAGCGGGAC GCCGTCGACG ACGTGTCACG TTCGAGCGTC GAACGACGGC
GGCGAGCGAT TCGACGATCT CGTCGCGGCG GGGGGGGACG GCGATGGGAC GTTCGCGACG
CCGATGGCGA ACGCGGCGAA GATTGACTTC ACGTACGACG CGAGCGGCGC CGCGCCGAGC
GTGAGCTCGG TGGAGACGTC GCGCGCGCCC GCGGCGAGCG CGCGGGGCGC GCGGGGACCG
TTCGACGGCG GGACCGTGGT CACCGTGCGA GGGAGCGGGT TTTTATCGAG CTCGAACTTG
GCGTGCAAAT TTTTCGATCC GCTCGGGAAC GAGGTCGTGG TGCGCGCCTC GTACGAGAGC
TCGAGCGAGG TTCGATGCGC GTCGCCGTCG CAAATCGCGA GCGTCGACCC GTACGCGGTG
GATTACGTCG CGATGACGTC ACCGTGCTAC GCCTCCGCCG TGCACGTGTC GAACACCGGT
CTCGTCGGCT CGTGGAGCGC CGCGAACTCG GCGCCGACGG CGCAGTTCTT CTATTGCGAC
TTGTACGTCG ACTCGAGCGC GGCGTCGGCG TCGAGCGCCG ACGGGAGCGC TCTGAAACCG
TTCGACACGA TTCAACGCGC GCTGCAGTCA GCCTTGACCG GCGTCCAGAG CGCGAGCGAC
ACGCACATCG GGCGCGAGTT CGCCCTCGCG AATCCCACGG CGAACGCGCT GTTGAACGCC
GACGTCGTCC GGCTCGCCCC CGGCGCGTAC GCCGGCGCCG GCGCCGTCAG GCTCGTCGCC
GACCCCACGT CCTCGGTCCG CGTGCGCGCC GCGACGGGCG TCGCGTCCGC CGCCGCCGAC
CGCGCGTACA TCGATTGCGA GGGCTCGAAC CCACTCTTCG CCGATCTCGA CGCGCAGTCG
TCGTCGTCGC GCGTCGCCGT CGTCGTCGAC CCCGACGTCG CCGTCGTTCG ATGTCGCGAC
GCCGACGCGA GCGTCTACGG CGTCGAGAGT TGCGAGACCA TCGTCGCGAC CGATGGGTCG
GGCGTCACCG CGCGGACGTG TAATTTCGCC GCCGCCTAG
 
Protein sequence
MPAKWIAYDE IECVTPRWPG IGATSGTPST TCHVRASNDG GERFDDLVAA GGDGDGTFAT 
PMANAAKIDF TYDASGAAPS VSSVETSRAP AASARGARGP FDGGTVVTVR GSGFLSSSNL
ACKFFDPLGN EVVVRASYES SSEVRCASPS QIASVDPYAV DYVAMTSPCY ASAVHVSNTG
LVGSWSAANS APTAQFFYCD LYVDSSAASA SSADGSALKP FDTIQRALQS ALTGVQSASD
THIGREFALA NPTANALLNA DVVRLAPGAY AGAGAVRLVA DPTSSVRVRA ATGVASAAAD
RAYIDCEGSN PLFADLDAQS SSSRVAVVVD PDVAVVRCRD ADASVYGVES CETIVATDGS
GVTARTCNFA AA