Gene OSTLU_17301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17301 
Symbol 
ID5004581 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp245634 
End bp247597 
Gene Length1964 bp 
Protein Length562 aa 
Translation table 
GC content53% 
IMG OID640420002 
Productpredicted protein 
Protein accessionXP_001420457 
Protein GI145352231 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.567212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCG AGACGAACGA TAAGACGAGC GTCGAGTATC AACGCATGTC GTGGGACGCG 
CTGAAGAAAT CGATCAATGG ATTGGTGAAT AAGGTGAACG CGTCGAACGT GCAACACGTG
GTGCCGGAGC TGTTTCAGGA GAATTTGATA CGCGGGCGGG GATTGTTCGC GAGGAGCGTG
ATGAAGTCTC AGATGGCGTC GCCGCAGTTT AGTGGTGTGT TCGCCGCGCT GGTGGCGGTG
GTGAACACGA AGTTTCCTGA AATCGGCGAG TTGATCGCGA AGAGATGCGT TTTGCAGTTT
AGACGGGCGT ATAAGAGGAA CGATAAGCCG GTGTGCGTGG CGGCGACGAG GTTTTTGGCG
GCGCTGATTA ATCAACAGGT ACGATGCTCG ATGGACGACG CGATGCGACG CGGTGCGCGC
GATGTGACAG AGGATGAATT ATGATCACGC AACACTCACT GATTCCCGTG AGGATCAAAA
TCATATCGAG CTTTTTAACA CTGATGTCAC GCCGCCGCCG ACGCGGTCGC CGACGCGACG
GGGCGCTTCT CCAGCGCGGG TGATTGACCG TTTACCTTTT TTCTCGACAG ATCATTCACG
AGTTGATAGC ACTAGAGTTA TTGACTGTGT TGCTAGGCAC TCCCACGGAC GACAGCGTAG
AGGTTGCGAT TGATTTTGTC AAGGAGTGCG GCTTCACGCT CCAAGAGCTC ACGCCGCAAG
GTTTGCACGG CATCTTTGAG CGATTTAGAG GAATTTTACA CGAAGGTGAG ATCGACAAGC
GCGTTCAGTT TATGATCGAA GGGTTGTTCG CGTTTCGAAA AGGTGGATTC GAGGGGAAAA
AGGGGGTCTC TCCCGAGCTA GATTTGGTCG ACGAGGACGA CCAAATCGTG CACGAAATCG
GCTTGGACGA CGAGATGCAA GCGCAGCCTG GTTTGGATGT GTTCAAAGAA GATCCCGAGT
TTGAGGAAAA CGAGCGTCGA TATGCAGACA TTCGCAAGGA AATTCTCGGT GAATCGAGCT
CGAGCTCGAG CGATAGCGAC AGCGACAGCG ACAGTGGTTC GTCGTCGAGT TCTTCGTCGA
GCGACGACGA GGGCGCGGGC GCCATTGTGG CGAGCCGAGG CGATGGCAAG GTCGAAATCG
CGGATCTCAC GGAGACAAAT CTCGTAAACC TCCGAAGGAC GATTTATCTC ACCATCATGT
CTTCGTTAGA TTTCGAAGAA GCTGGGCATA AATTGATGAA GCTCAACATT CCGCCGGGCG
CCGAAGTAGA GTTGTGCACG ATGCTCGTTG AATGTGCGTC GCAGGAGCGC ACCTACTTGC
GGTACTACGG TTTGCTCGCG CAACGGTTTT GTTTCATCCA CAAAATCTAC CCGCAACTGT
TCGACGAAGT TTTCATGAAG CAGTACAGTA CGATTCACCG TTTGGAGACA AACAAGCTTC
GCAACGTGGC AAAATTCTTT GCGCACTTAC TCGCCACCGA CGCCATGTCG TGGACGTGTC
TGGCATACAT CCAACTCACC GAGGAAGCTA CGACGTCAAG TTCACGAATT TTTATCAAGA
TTCTGTTCCA AGAACTCGCA GAGGCGCTCG GTTTGAAGCA GCTCAACGAA AAGATGCAAA
ATCCTGAAAT GCGTGAGTAC TTCCAAGGCA TCATGCCCAA GGACGAGCCT CGCAACACGC
GGTTTAGCAT CAACTTTTTC ACCTCCATCT CTCTTGGTGC GCTTACTGAG GACATGCGTG
AGTGGTTGAA AACAGCTCCC AAGACGGTGC CGAAGCGCTC TAGGAGCTCG TCGAGTTCGT
CGAGTTCAAG CTCCAGTTCG TCGAGCTCGA GCTCCAGTTC GTCGAGTTCG ACCTCCGGGT
CGCGTAGTTC GTCGAGTTCA TCCAGTGGAT CGAGCTCGCA TAGTTCATCG TCCTCGCGCT
CGCGCTCCAG GGACAGGCGT CGTAAGCGAT CTAGAAGACG TTGA
 
Protein sequence
MMRETNDKTS VEYQRMSWDA LKKSINGLVN KVNASNVQHV VPELFQENLI RGRGLFARSV 
MKSQMASPQF SGVFAALVAV VNTKFPEIGE LIAKRCVLQF RRAYKRNDKP VCVAATRFLA
ALINQQIIHE LIALELLTVL LGTPTDDSVE VAIDFVKECG FTLQELTPQG LHGIFERFRG
ILHEGEIDKR VQFMIEGLFA FRKGGFEGKK GVSPELDLVD EDDQIVHEIG LDDEMQAQPG
LDVFKEDPEF EENERRYADI RKEILGESSS SSSDSDSDSD SGDGKVEIAD LTETNLVNLR
RTIYLTIMSS LDFEEAGHKL MKLNIPPGAE VELCTMLVEC ASQERTYLRY YGLLAQRFCF
IHKIYPQLFD EVFMKQYSTI HRLETNKLRN VAKFFAHLLA TDAMSWTCLA YIQLTEEATT
SSSRIFIKIL FQELAEALGL KQLNEKMQNP EMREYFQGIM PKDEPRNTRF SINFFTSISL
GALTEDMREW LKTAPKTVPK RSRSSSSSSS SSSSSSSSSS SSSSSTSGSR SSSSSSSGSS
SHSSSSSRSR SRDRRRKRSR RR