Gene OSTLU_25982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25982 
Symbol 
ID5004231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp33188 
End bp36409 
Gene Length3222 bp 
Protein Length1073 aa 
Translation table 
GC content52% 
IMG OID640419652 
Productpredicted protein 
Protein accessionXP_001420058 
Protein GI145351379 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGA GAATATTCAG TGGGATTGTC GGTGTCTTGT CGTTACCGCG ACGAGGCGAG 
TACGAGACGG TATGCATGGA AGTCTTGGAG GCGACGCGCG CGTTTTGCGC GGCGGCGTCC
ACAGAGGTGC AAGCAGCATT TCGAGGCGCC TCCGAGATCG CACCTATGCT CGGGTTCACG
ATTTCTACGC TCCTTGCGAT TTCGCACGAA GAAACAAGCG CGGGGTCGCA AGGGAGTAAG
CTTCTGCGCT CGAGTGCGTT GAACACGCTC GATTGCTTGA TTCGATTTGT CGCCGACGCC
GATGCTCTCG CGTTTTTCCT TCCGGGCACG GTGTCTGGGT TGACCAAGGC GTTAGTCGCG
GCGAGTGGTG TTCGCCCAAA CGTTGGCGCT GGGCCGGGAG GAACTGGCGC AGATGGCGTC
GAGTTTGCCC TGAGTTCGAT GGCGTCCATT CTGTCGTGCG TTCTGAACGA CTCGTTGTAC
GAATCAGAGC TCGGCGATTC CGCAGAGGGC GAAGGTGCAA ATGCGTTGCA AAGCGCACTG
GGTTCTCTCG TCGCTCGATC GATAGACGAT CAGCTGGAAT CGAACGCAAA AAATATGTCG
AGGGATATGA ACGACGAAAA CGCGTCCGTA TCCCCTGCGA ACGCTGGTAA AGATCTCAAG
GTGGTTCGAG ATAAAGAATG GCTCTCTCTC GCGAGCTCTC GCGTTGAAGC GGCGCTCTTA
GCGACAATTC CACCCTTAGC GAGTCATCAG CGCGCGACCG TGAGACTTGC GACTGGAAAA
ATGGCGAGTA GCGTCTTAAA ACATTGTGCA AAAGTTCTGG GCGCGAAGAC GAGGCGTAGA
ATGATGGAGT GCTTACTCAC GCTCGCGGGA GATAGTTGGG CACAAGTCTC GACGCCCATA
TTGGAAGATT TGAGATCTCT TGACAACCTC GGCTACGTCG TCCGAGCAGA CTTGGAGGCA
ATCATAAAAG ACGATTTGTC CAACATAGCA GACACGCTGC GTGATTCTTC CGATAAGGCT
GTCGGGCATC TTCAGCGCTT GCTCATCGCT CTGGAATTCG CCGGCGCTCG GCGCTTGAAA
GAAACGCTAT TCGCCCGTCC GAGCTCGAGA GAACATTTGT GCTCTACCAT TGCGGAGTGT
CTGCTCATCG AAACATTTTC TTCTCAACGA CGAGATGGCA GCGCGCAGAT GATAAGTTTA
ATTGATTTGA CCGAAGCCAG CGTGTCGACG GCGCAAGGGT TGCCACGGGC GCCTCCTAGG
TTGCAGTACT TCCCCGACGC ATCCATGTAC AAAAGTTTTG CAAAAATGCT AAGATTACTC
GGTCGAGCGG CATCAATCAC AAACGAATCT TCACTGGAAA CGTATTTTGT GCCAATGGCG
CAATACTTTC TCGGCACATT GAGAAATGAT GCAGAGCTCG ACGCATTGGG CACCGCGGGT
GCATGGCAAA GAAACGCGGC AGCCCACCTG ATAGCGCTCA ATGAAATGCT TTTTGGCGCC
ATCACTGAAT CAGCAGTCGA AAGAGAATAT CTTGTTCGCG TCTCCAGTTT GATCGTCGAA
GAGTACGTAT GCAGCGAGGT GTGGAATCTT GACGCTGCGG AACCGGACAA CGCTCTTTTA
CTACGCGTGT TGATGGAAGG CATGGGGATC ATCGGAGAAG GACTCGGCAA AGATCACATA
CGAACAAGTG CATTTTTGAC AAACGTACTT TGTCCATTAC TCGATAAGCT CGGTGAAGAA
TCGCTTGAGG TGCGAGACAC CGCAGCGCTT GTTTTGCTCA AGTTGGCTCA AAGCGGAGAA
TACAATTCAA CCGATGATTC ACCAATAGCG AGTCTTGTCG TTGCGAACGC AGACTACGTT
GTGGATATGC TTTCTCGACA AATGCGGCAT TTAGATAAGC ACTCACGCGC TCCTCGACTC
TTTGCGGCTA TTCTGAGGAG GACTGATGCG GCAAAGAGCA TGGTGAAGCT TCTCGCAGAG
CCGATTCGAT TGGCGTTGCG CACATTTTTG ATCGTCAATA GAGAAAAAAA TGCGGAGCAC
TCCGAAGATT TTCTGCTCGT AATGAATGAA TATTGTTCAG CAGTGTTGCT CGAGGTGCGA
GAGATAGAGA CACACTCGCG AGGTGTTGTG GCTAGCATAC AAAAGTATCA CCCGGAGGAG
CCAGACAGCG AAGACGAGAA CGAGCTGACG GAGGTCTTTG TCGAAGAAAT GCGTTCGGTG
CTTTCGCAGG ACATTGAAAC CACCCGGGCT CAGTCCTTGG AGCGCGTTCG TCGCCTTCAG
GCCATGGTCG GTTGTGCGAT AGACTTGCTG AGATCGGTCA GCGCGCTGCT GGAGTCGCCT
TACGCCGGCG TTCGAAGTTT ATCTGCAACG GCTTGCTCGT TCGCTCTTGA GAGCCTTTCA
GCAGCAGAAT CTGCATTGAC GCATGATAAA TACGTCTTAA AAGTACTCAA AGCTTACGGA
GGCAGGAATT CCACGCCTTT TGATGTGGCT TCCCTCTACA AAGATGCCCG TGTACTGCCA
CACATCCATA ACATCTGGCC TCATGCGGTA TCATCGCTCA GCGACCGATT TCAAATTTCG
ATTCAAGCTG AAGCGTACGA AGCGAGTTTG TCGTTACTTC GAACCATGGC TTCGACAAGT
GGTGGCGATT TCATCGCCAA ACGCATGAAC TCGGATCTAT GGCCTATACT GTCGCGCGTT
TTGCAACAAG GTGTTTGTCA CGTTGACAAA CGACACCGAT CGCTCGAACT TCTTACGCTG
GCAGATACCG TCGATGCAAG CTTCATCGCT GAGTCGGAGA TTTCATCAGA GCTTACGAGA
CGCATCCGTA TCAAAATATG CGATACTTTA GAATCCATCG CATCGTGCGA AAAATCAAAG
CAAGCGCTAC ACGATTTGCT CGGGGCGGCA GTGCCCGTCG TGGCCAAGCT TGCCATGGAG
GACGACGAGG CTTTGCGACT CGTCGCAGCG AACGCAACGC GTGCTTTCGC AAAAGTCAAC
GGTGATGAAG TTTGGCTATA CTCGATGGCA ACCGCGGCTC GTGGCTGTCT TTTGGTTGAC
ATTCCCACTC CAGTCTGGAG TGTCGCGCCC GAGGCTGGAT CGCAGCGAGT CAGTCTTCCC
CCTGTTTCTG CCATCGTGCC TGAGTGTTGC TTTGATGGTG ACAGGAAGGA AAGTAGGGAA
GCGACTTTCG CGGCAAACAT TCTCAAATCA TTAGTCTTAT AG
 
Protein sequence
MGLRIFSGIV GVLSLPRRGE YETVCMEVLE ATRAFCAAAS TEVQAAFRGA SEIAPMLGFT 
ISTLLAISHE ETSAGSQGSK LLRSSALNTL DCLIRFVADA DALAFFLPGT VSGLTKALVA
ASGVRPNVGA GPGGTGADGV EFALSSMASI LSCVLNDSLY ESELGDSAEG EGANALQSAL
GSLVARSIDD QLESNAKNMS RDMNDENASV SPANAGKDLK VVRDKEWLSL ASSRVEAALL
ATIPPLASHQ RATVRLATGK MASSVLKHCA KVLGAKTRRR MMECLLTLAG DSWAQVSTPI
LEDLRSLDNL GYVVRADLEA IIKDDLSNIA DTLRDSSDKA VGHLQRLLIA LEFAGARRLK
ETLFARPSSR EHLCSTIAEC LLIETFSSQR RDGSAQMISL IDLTEASVST AQGLPRAPPR
LQYFPDASMY KSFAKMLRLL GRAASITNES SLETYFVPMA QYFLGTLRND AELDALGTAG
AWQRNAAAHL IALNEMLFGA ITESAVEREY LVRVSSLIVE EYVCSEVWNL DAAEPDNALL
LRVLMEGMGI IGEGLGKDHI RTSAFLTNVL CPLLDKLGEE SLEVRDTAAL VLLKLAQSGE
YNSTDDSPIA SLVVANADYV VDMLSRQMRH LDKHSRAPRL FAAILRRTDA AKSMVKLLAE
PIRLALRTFL IVNREKNAEH SEDFLLVMNE YCSAVLLEVR EIETHSRGVV ASIQKYHPEE
PDSEDENELT EVFVEEMRSV LSQDIETTRA QSLERVRRLQ AMVGCAIDLL RSVSALLESP
YAGVRSLSAT ACSFALESLS AAESALTHDK YVLKVLKAYG GRNSTPFDVA SLYKDARVLP
HIHNIWPHAV SSLSDRFQIS IQAEAYEASL SLLRTMASTS GGDFIAKRMN SDLWPILSRV
LQQGVCHVDK RHRSLELLTL ADTVDASFIA ESEISSELTR RIRIKICDTL ESIASCEKSK
QALHDLLGAA VPVVAKLAME DDEALRLVAA NATRAFAKVN GDEVWLYSMA TAARGCLLVD
IPTPVWSVAP EAGSQRVSLP PVSAIVPECC FDGDRKESRE ATFAANILKS LVL