Gene OSTLU_25015 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25015 
Symbol 
ID5003724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp141440 
End bp142650 
Gene Length1211 bp 
Protein Length394 aa 
Translation table 
GC content63% 
IMG OID640419145 
Productpredicted protein 
Protein accessionXP_001419723 
Protein GI145350671 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0269626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGACG CCCCGCTGTT GACGCGCGAC GGCGACGGGG ATCGACGACA CCGCTCGTCG 
CGCGCGCGCG CGTCCACGTC GGTGTTTCGA CCGACGTTCG CGCTGGGCGT GACGCTCGCG
TGCGCGTGCG CGCTGGCGAA GACGAATGGT GTTGGACGAA CGATCGCGCG CGACTGGATG
TCTGGAGACG ACGCGCGCGA ACGCGCGAGC GCGAATGAAC GAGGGAAAAT GAGCGCGTTC
GGGAGCGCGG CCGCGTTGGC GACGAGCGGA GACGGGACGA CGCCGTTCGC GGTGGTGAGC
GACCCGTTCG TCACGGGCGG CGGCGACGAC GACGGAGCGA GCGAGGAGTA TTCTTTAGTG
CGGAAATCGC AAATGTCGTC GAGTGGAGAC ATAGAGGTGA ATCACAGAGA AGATGTCGAG
ATGATACCAC GCGCGGCGGC GACGATGGTG CACTTGACGT TGCTCACGGC GTGCGCGCAA
CTCGGGTCGT TGACGTTTGC GCCGGGAGCG TGGGAGGACG TCGTCGGCGC TCGGGTGACG
ACAAAGTCGA TGTCGAACGA CTTTTTATTC TCCCAAGCGC AGGAAATGAC GCAAACAAGG
TGTGGAACGT TTGAAGTGGA CGTCATGCTC GGTGCAGGGG AGCAGTTTGG ATTTTATTTG
TACCCGCTCG ACAACACGAG CGACGAGGCG ACGGTGTCCG ACATCGGTTG CTTGCACAGG
GGGGGCGGGC GATGTCCGAA ATTTGCGACT CCATCGGCTC TCGAAGGCAT GGAGGTTTGT
ACCTCTGTCA TCGAGGAGGG CGATGACATC TTTTACAACC GCGTATTCGA TGGGAAGACG
TTCACGTACG TCTACGGTTC GTGCGACGAG GGATGCGCGC TGAAAGCGCC GAGCGGGTGT
CCGGCGTCCC ACATGCCCGA AGTCACCACT TTAGACACCG GAGTGTGCAC CGATCCCGCA
CACGCCGGTA TTTACAACGC CCTGTGCGCG CAGAGCTGCG GTGCGGGCAC CACGGACTGC
GACGCGTCGT GTCGAGCGGC GTCCGACGCC GGCGTCTCCC TCTCGTGCGT TCCCGGCGCG
CGCGGCGCCG ACGCGTGTCG ATGCGCCCAC GTCGCCGCCA ACGCGACGAC CGAGTGCACC
GTTCCCGGAT ACGACTGCTG CACGTGCGAG AGCATCATCG TGTGATCACA TTCATAGACC
CGTAGTTCCT T
 
Protein sequence
MEDAPLLTRD GDGDRRHRSS RARASTSVFR PTFALGVTLA CACALAKTNG VGRTIARDWM 
SGDDARERAS ANERGKMSAF GSAAALATSG DGTTPFAVVS DPFVTGGGDD DGASEEYSLV
RKSQMSSSGD IEVNHREDVE MIPRAAATMV HLTLLTACAQ LGSLTFAPGA WEDVVGARVT
TKSMSNDFLF SQAQEMTQTR CGTFEVDVML GAGEQFGFYL YPLDNTSDEA TVSDIGCLHR
GGGRCPKFAT PSALEGMEVC TSVIEEGDDI FYNRVFDGKT FTYVYGSCDE GCALKAPSGC
PASHMPEVTT LDTGVCTDPA HAGIYNALCA QSCGAGTTDC DASCRAASDA GVSLSCVPGA
RGADACRCAH VAANATTECT VPGYDCCTCE SIIV