Gene OSTLU_34085 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34085 
Symbol 
ID5000650 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp842715 
End bp845831 
Gene Length3117 bp 
Protein Length1007 aa 
Translation table 
GC content59% 
IMG OID640416071 
Productpredicted protein 
Protein accessionXP_001416776 
Protein GI145344514 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.297969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.623648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCGA GACTCGATCG AGACGATCCG CTGGGCGTGG ATCTGACGCT GCGAGGGGCG 
TCGACGCGGC ATGGGGCGGG GAAACGGATG ACGCTGTATG ATTACGCGAG GACGGTGAAG
GCGGCGCATC CGCGGAAAAT ATCGTTGATT CGAGTGGGAG ATTTTTACGA GTGCTTGGGA
TACGACGCGG TGATGCTGGT GATGCACGCG GGGTTGAATC CGATGGGGAT TTCGGGGGTG
CCCAAGGCGG GGTGCCCGGT GGTGAAGATT CAGGAGACGC TCGATCGGTT GACGTCGAGA
GGGTACTCGT GCGTCGTTTG CGAAGAGGTG CCGCAGATGA ATAGATACGG CCAACCGACG
CCGCCTAAGG ATCGGTACAT CGCCGCGATC GTGACGCCGG CGTCGCCGAA TTACGTCAAA
GGCGCGGCAT CGCAGGGCGA AGACGTCGAT TTCGGGGATG GGAGCGCGCC ACCGGTGATT
GGGCTCGCGA GCAGCGCGCT CGGGTACACC GTAGTCACCG TCGAACCGGA TCTGCGTCGA
GTATCCGTCC TCGAGGGATT GACGGCGGAA GCCGCCGCGG CGAAGCTGTC GGCGGGCGGC
ATCGCGCCGC CGCTTTACAG GCACGCAAGT CTGGGAAGTG GTCACGCACA GAGCGGTCGG
CACGCGGCCG GCACGAGCGC GCCGAGTCGG CGTTTACGCT GGGAGGTGCA ATCGATTTTG
AGCGCCGCCG AAGGCGTCGA CGCGGTGAAA TACAACGGCG ACGTCGTGGA GAAGCTCCTA
GATCTCGTGC GTTTGGATCA CGGACTCGGG CCCGAAGATG CTTTCCATCG CGTGACTGTG
CCTAACAAAG GCCGCCCAGC GCCGCTCTCA CTCGCGACCG CGCAACAGCT CGGAATCTTG
CCGTCGCGAT CCGTGCCTCC GCTCTTGACG CATCTGTTGC CCGAGAGGTC GGCACCGGCG
GCGTGTCGCT CGTACTTACA AGAGCTATTG CTTCATCCAC CGCCACCTCA AACTGCGCTC
GCGATTCAAG AGGCTTCGGT GCTTTTCACG AAGTCGACGT CGGCCATGCC GCAACTCGAA
GTCCTGCCGC CGAGCAAAGT GGCGAAACTT TTGGCCCAAC GCGAGGCGAG CCATACATTT
TTCGCCGATC TCGCGTCCAT GGCGCGCGGT GTCGAGGCAC TGCTAACGAA CAGCAGCGCG
GACATTCGTC GTGCCGGACA TTTGTTGATC GATCCCACGT CTTTGAAGCT CGGCGCCAAG
CTGAACGGCG ACGCGCTCGC CAAGGTGTGC TCGGAGGCGA GCGCGATGAT CGAAAACGTG
GTGAGCGAAG ACGTTCTGAA TGGAATCGCA GTTGTTCGTC AGAAGAGCGA TGAAGACGAA
AGCGACGACG AGGATGAGCG CGATGACGCA TCTTTCGTCA TGGACGGTTT AGATCAACCA
CTCAAGCTCC TCAACATACC GAACAGATTT ATGTTTGAAA ACGAGCGTTG GCGAGGCAGA
GTTCGTCGCG AGCACATTCC CGAAGCGATC GATCGGGTTG AATCCGCCGC ACGCGCGCTT
GAAACGGCAG TGAACGAAGA TTTCATGCCA ATCATCGAAA AAGCCGCGGC CGAAAAGAAG
ACAAAAGCGA GGAAATGTGA GTTGGAACAC GACATGCGAA ACAACGCGCT TTGGATGCGT
CACGCTCCAA AAGATGCGAT GAAAATGGAC GATTTTATTC ATCCGCGCGA TAGATTCGGA
AAAGAGGTCG CAGATAGATG GACGACGGCA CGAGTCGAGC TTGCTTTGGA CGATTACAGA
GTTGCGACGC AAAAAGCCGC GGTCGCAGTC TCTGACACTC TCATCAACCT CGCCGACGAT
TTGCAGGAGC ACATATCGTC GCTCGTGGGC GCGGCGACTC TTTCAACGGT GACGATTGCG
ATTCTCGCCC ACGCAAGCAA CGCCATCAAC AAACGTTGGA CGCCGCCAAC GCTTCTTCCC
GAAGGCGACA CGGCGAGTCC GCTTGCGGTC GAGGGCTTGG TACCGTTTTG GATGCGACTT
GATGGCGCGG AGACGGTGCC GAATAGCTTT GACATCGATG GTTTGGTTTT ACTCACCGGT
CCAAATATGG CGGGAAAGAG TACGGTTTTA CGCTCAGTGG CCGCGCTAGC GCTTCTTGCG
CAGTGTGGAC TACACGCACC CGCGATTTCC GCCCAAGTAC CTCGCTTAGA CTCGCTCATA
GTTCGAATGG CGAGCACGGA TTCCCCAGTC GAAGGTTTGA GCTCGTTTGC GGTTGAGATG
CTCGAAATTA AATCCATGCT TAGCTCTTGC ACCGCTGGTA GTCTGATCAT GGTTGACGAA
CTTGGTCGAG GCACGGAGGC TTCGCACGGC ACTGCAATAG GCGGCGCAGT TGTCGAGGCG
CTCGATGAGT GCGGCGCGCG TGGAATTTTC GCCACGCACC TGCACGGCAT TCTGGATTTG
CCGTTGCGCG TCTCGCCGTG GACGCGTCGA GCGCGCATGG AGACGGCAAA GTCGGATGAT
GGAAGCACGC GCCCGACGTG GAGAATGGTC CCAGGCGAAT GCCGTGAGTC ACTCGCGCTC
CAAACCGCCC TTGATTGCGG TATTTCACAC GCCATCGTCG CTCGAGCCAA TGCATTGCTC
GAGGAACAAA CGAGCATCCC GCTCGTAAAG TTGAGCGACT CAGAGCAGGC GACGTTGATC
GAGAAGCAAG ACACCAGCCC GGAAAGACCG CGTGTCGATG GTGAATATTT GAAACTTCTC
CTCGCTGAAT CAACTGCGCG AGCGCTTCAA CTTGAAAATG CGCAAGTGAT TCACGTGGGT
CCGAATCAAA CGCCGCCGAT TGGCTCCGCC GGACAAACGT GCGTGTACAT TCTTCGCCGC
GGAGACGGCT GGTGCTATTG CGGCGAGAGC GACCATCTTC CTACGCGACT CGCGACGCAT
CGTCAAAGTT CCCAGCGTCT CATCGAGCTA GTGTACGTCG CGGTGCCGAA AGAAGCGGGA
GGAAAGAGCG CCGCGCGCGC CCTCGAGAGC CGCGTCATCC AAGCGTTACA GCGAGCGCGC
GTGCCGTTGT GGTCGGATCA AGACGCCGCA CACAAACATT TTGGCGCCGC AGGGTGA
 
Protein sequence
MIARLDRDDP LGVDLTLRGA STRHGAGKRM TLYDYARTVK AAHPRKISLI RVGDFYECLG 
YDAVMLVMHA GLNPMGISGV PKAGCPVVKI QETLDRLTSR GYSCVVCEEV PQMNRYGQPT
PPKDRYIAAI VTPASPNYVK GAASQGEDVD FGDGSAPPVI GLASSALGYT VVTVEPDLRR
VSVLEGLTAE AAAAKLSAGG IAPPLYRHAS LGSGHAQSGR HAAGTSAPSR RLRWEVQSIL
SAAEGVDAVK YNGDVVEKLL DLVRLDHGLG PEDAFHRVTV PNKGRPAPLS LATAQQLGIL
PSRSVPPLLT HLLPERSAPA ACRSYLQELL LHPPPPQTAL AIQEASVLFT KSTSAMPQLE
VLPPSKVAKL LAQREASHTF FADLASMARG VEALLTNSSA DIRRAGHLLI DPTSLKLGAK
LNGDALAKVC SEASAMIENV VSEDVLNGIA VLLNIPNRFM FENERWRGRV RREHIPEAID
RVESAARALE TAVNEDFMPI IEKAAAEKKT KARKCELEHD MRNNALWMRH APKDAMKMDD
FIHPRDRFGK EVADRWTTAR VELALDDYRV ATQKAAVAVS DTLINLADDL QEHISSLVGA
ATLSTVTIAI LAHASNAINK RWTPPTLLPE GDTASPLAVE GLVPFWMRLD GAETVPNSFD
IDGLVLLTGP NMAGKSTVLR SVAALALLAQ CGLHAPAISA QVPRLDSLIV RMASTDSPVE
GLSSFAVEML EIKSMLSSCT AGSLIMVDEL GRGTEASHGT AIGGAVVEAL DECGARGIFA
THLHGILDLP LRVSPWTRRA RMETAKSDDG STRPTWRMVP GECRESLALQ TALDCGISHA
IVARANALLE EQTSIPLVKL SDSEQATLIE KQDTSPERPR VDGEYLKLLL AESTARALQL
ENAQVIHVGP NQTPPIGSAG QTCVYILRRG DGWCYCGESD HLPTRLATHR QSSQRLIELV
YVAVPKEAGG KSAARALESR VIQALQRARV PLWSDQDAAH KHFGAAG