Gene OSTLU_25332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25332 
Symbol 
ID5004924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp346609 
End bp349911 
Gene Length3303 bp 
Protein Length836 aa 
Translation table 
GC content60% 
IMG OID640420345 
Productpredicted protein 
Protein accessionXP_001420662 
Protein GI145352672 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0150408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGATCGACGC GCGCGACGCC GACGCCGACG CGCCGTCGCG TTCGCGCGCG CGCGACACCG 
CTCGAGAGAA TTTCACTCGA ACGCTCGCGC GCGCGCGACG GCGCGCTCGA CGTCGCGGGC
GACGCGCGCG ACCGCATTCG CGCGCGCGCG CGCGTCGGAT CGCGACGCGC CGCGCGGCGA
TCGTCGCGAC GCGCGCGACG TCGAGCGAGG GTTCGCGCGA TCGAACGCGC GCGGCGGTGG
GGCGAAGGCG CGGTGACGGT CGATCGATCG GTCGGATCTC GCGCCGGCGG AGATCGGCGC
GGTCCAGAGG TGGATTGGAT CGGATTGGAT TGCGTTGGAT TGCGTTGGAT TGCGTGTGAT
TGATTGGTCT GATTCGATCG CGTCGATTCG ATTTGATTGA TTTCGATACG TCGATCTTGG
ACGTAGAAAA GGCTCGGTTG AACGCGAACC GACGCCCCCC CCGACGAGAA CGAGCGATCG
CGAGGCGGAA CGCGCCGTCG ACGATCGATT GAACGATTTT CATTCTCGCG CGCGCGCTCG
CTCGCGAATC GATTTTCTTT CATTGAAAAA CACCAACGCG CTCGACGCGC GGGTAGACGT
TCTACGCCGG ACTCGATCGG AGAAATTCCG AGAGGACGAT AATAGACGTC ACCGCGTTTG
GCCTGTTAAA TAAATAATAG GAACTTCCTC GAGGCGACCG CCGCGAGAGA AGGGAAGAAC
AGAGGCGGTT TCGTCCGACT CGACGCTCGC TCGGCGAGTC TTTGACTGGC GATGACGGAC
GTGATCACCA TGTCGTCTTC GTCAGACGAT GATCGACCGC CGATCGCACG CGGTGGAGAG
GCATCGTCTT CTTCATCGGA TGGCGACGCT GGGTTCAATG GAGACGTGTT GGCGACTGAG
TTGGCAAAGA TGCCGCTCGC GCTTCGCGCG CAGCTCACCG CCGCGGCCAA GTCTTCGACG
AGCACCAACC AGCGCACGCT CGTGAGCGAG TGGTCGGAGA AGACGAAACT GTCAGTTGAT
TTGGTAGAAA AAATCTTCGC TTCCTTTGCG AAATCGTCGC GTGCGTCAGC GGAGGCAGCG
CGCGAAACCG TCGCGCGCCT CGCCGCCGCC GACGGAGTCG ACTCCGCCGA TGGAGACGAG
TACTCGCCGG GTGCCGCCGA AGCGGCGACT GCGGATGACG ACGCTATGGA CGACTCGCCG
AAAATCCCAG CGTCTAAGCA GCCGGTGCTC GAGGAAGGTG TTCCGTTTCC TGGAATTACT
CGAGACGTCT CCGTGGACGT CGCGTGCGGC GATCTCAGAG GCGTTCTCGA GCTGAAGAAG
GGATACAAGA AACTTCAGGA GCGCGTGCGA TGCGAGGGCG AAATGATGAC GCCAAGTAAG
TTTGAGAGCG AGGGCGGGCG CGGGTCGGCG AAGAAGTGGA AGATTAGCTT ACGCATCGTG
CGTTCGGATG GAAAACTCGG GATGACTGTC GGAGACTGGA TCGACAGGTA CGGATACCAT
CCGTCGGGCT TAGTCATAGG TGGGGAGGTG GCCGACGCGC CGGCGCCGCC GAAGGTGAAA
AAATCAGCGT TTGAGTCACA GCTAGAGACC TTACTCGACT ACGATGGGGA CATTGCCTTG
CGTAGCATCG CTAAATTTGT ACGAATGATG CGAGAGACGA CCAAGCCTAA GGAACGCGGA
CTTTTGCTCC AAGTCATTCG AGGCACGAAA AACAAAGAGT GTTTGCGCCA GTTCGGTCAG
TCGGCGGAGA TTAAAGGATT AGACACGCTG CAAGATTGGA TGGACGACGC TAAAAGGAAG
TTTCAATCCA CTCTTTTAGT GAGCATTCTT AGAACATTGA AGATGATTCC AGTTACATTG
GACGCGCTCA CGCGTACTTC GATCGCCCCG AATCTGGGCA AGTTGAAGTC GTACGTGGTG
CCAGAGGGTG AAGAGGAGTT TGCCAACACC GAGATGAACA CTAAAGTTGT CTTGTTATCC
AAGTCGGTCA AGAACGCGTG GAAAGCGCAA ATCACGGCGC CCCACACGGC GCCGGCGCCG
GCGCCGAAAC CGGCGCCCGT GGTCAACCCG GCGCCCGCGG CGGTGCCCGC GTCCAAGGCT
GTCGAACTCG GTGACGATGA CTTGTTCGGT GCGAAGTCGA AAATCTCACC AGCGCCTTCG
AAAGCGCCGG TGGTGAAGAC CACGGTGACG AAAATCACGA TGGAAAAGAA GGTGGCTCCG
CCGAGCGTGA CGACGAAGAA ACCGAGCGTG AGCGTGAACG ATTTGCTCAA GACGAGTTCG
CAATCGACGA AGATCACCGC ACCGCCGGTG AAGACGAAAG AGAAGGCAAA GGACGATGAT
AAAATCGACG AGAAGACTGG GAAGAAGCGC AAGCGGAAGA CGGTGACTTG GGCGAAGGAT
GAAAATTTGG AGCAAGTCAG AATCTTTGAG AAGGACGCCA AGCAACCGAA GGAAACGGCT
TTCCCAGATC CGACGAGAGA CGGTGGGACC GATGGCGCAA GCCGCAAGGC GCTCGAGAGG
AGAGACAGGG AAGTGGAGGC GGAAAGAAAG GCGGCGGCAA AGCAGCACCA GCGTCGACTC
GACGAGATGC GCGCCACGAC AACCTGGCGC CCGCCTCGGC GCATCGAAAT TCCTCGTTGG
GAAGAGGAAG AGTCGGACAG AGTCCCCGGT GACGAGTCGG AAGAATCCCA GAGAATTCTT
CGCATCGAAG CCGAGAAACC GAGCGTCAAG TACAGGAGTC TGAAAGACAT TCCAGACTCT
CCGGCGGAGG CGCCGAACGA GGACGCGCAA CTCGACCTCG ACAACACGCC AGCCTTCTAC
ATGAAGTTTC AAGAAGAACC GCTGTCTCAA GACAGCGGCG AACCTTCACC CGCGATGCCC
CAACAGCTCC AAATGCCACA GGCTCCGGGA TCGTTACTTC CTCCGAACAT CGACTTTGCC
GCGCTTCAAC GCACGTTGCT CGCCGCCAAC GCGCCGCATC AAAACGCCGG TTACCAGTAC
CCCCCGCAGG CGCCTCTTCA ACACGCCTTC CCACCACCAC CCCCTCAAGC CGCGTACGCA
CCTCAGCCCC CTTACCAACC CGGCGCGCAA CAGCCCGCGC AGCGACCGAC GAAACAGGCG
CTCGTCGGCG GCGCCCCCGT GCCCGCGCAG CAGGCGCTCA ACCTCAACGG CAAGACGTAC
CGCGGCGTGT GCGCCTTCTT CAACACCCCT CGAGGATGCA GCTGGGGAGA TAAGTGCGGC
TACCTCCACC AAGTCGGCGT CAATCCTCCG TCGACGGGTT GATCCCGCGT CCGCGACTCG
CCC
 
Protein sequence
MTDVITMSSS SDDDRPPIAR GGEASSSSSD GDAGFNGDVL ATELAKMPLA LRAQLTAAAK 
SSTSTNQRTL VSEWSEKTKL SVDLVEKIFA SFAKSSRASA EAARETVARL AAADGVDSAD
GDEYSPGAAE AATADDDAMD DSPKIPASKQ PVLEEGVPFP GITRDVSVDV ACGDLRGVLE
LKKGYKKLQE RVRCEGEMMT PSKFESEGGR GSAKKWKISL RIVRSDGKLG MTVGDWIDRY
GYHPSGLVIG GEVADAPAPP KVKKSAFESQ LETLLDYDGD IALRSIAKFV RMMRETTKPK
ERGLLLQVIR GTKNKECLRQ FGQSAEIKGL DTLQDWMDDA KRKFQSTLLV SILRTLKMIP
VTLDALTRTS IAPNLGKLKS YVVPEGEEEF ANTEMNTKVV LLSKSVKNAW KAQITAPHTA
PAPAPKPAPV VNPAPAAVPA SKAVELGDDD LFGAKSKISP APSKAPVVKT TVTKITMEKK
VAPPSVTTKK PSVSVNDLLK TSSQSTKITA PPVKTKEKAK DDDKIDEKTG KKRKRKTVTW
AKDENLEQVR IFEKDAKQPK ETAFPDPTRD GGTDGASRKA LERRDREVEA ERKAAAKQHQ
RRLDEMRATT TWRPPRRIEI PRWEEEESDR VPGDESEESQ RILRIEAEKP SVKYRSLKDI
PDSPAEAPNE DAQLDLDNTP AFYMKFQEEP LSQDSGEPSP AMPQQLQMPQ APGSLLPPNI
DFAALQRTLL AANAPHQNAG YQYPPQAPLQ HAFPPPPPQA AYAPQPPYQP GAQQPAQRPT
KQALVGGAPV PAQQALNLNG KTYRGVCAFF NTPRGCSWGD KCGYLHQVGV NPPSTG