Gene OSTLU_31151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31151 
Symbol 
ID5001507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp396511 
End bp399933 
Gene Length3423 bp 
Protein Length1135 aa 
Translation table 
GC content56% 
IMG OID640416928 
Productpredicted protein 
Protein accessionXP_001417493 
Protein GI145346016 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.219431 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGCGC TCGCCAAACT CGCGCAGCTC TCGCTCGTGA ACAAAGTCAC GAAGGAGCTC 
GAAAATCACC TCGGGATCGC CGATAAGACG CTGAGCGAGT TCGTCATCGC GCTCACGGAC
GAAAACGACA CTCCGGTGAC GTTTCGGAAG GCGCTTGGCG CGGTCGGCGC AGAAGTGGAC
GATGCGTTCG CGGAATCTTT GCTTGGGCTG ATTCAGCGGA TGCGGAAGAA GCCGAGCGGG
ACGGCGCGCG GGACAAATGC GGCGACTGCG GGAAGCGCGG GGACGTCTGG AAGGTCGAGC
GGCGACGCAA ACTTTCCTGG ACTTGCGGTG CGGGACGATG ATTCGCACGC GAAGGCGTTG
ACGAGGGAAT TATACGGGGA CCGGGACGCG CTCGCAAACG CGGCGGACGA CCCGAGCAAG
AGCGAACGGG CGAACGAGAG GGAGGTTTCG CACTCGCTTG CGAGCGCGCA GACGAAGAGC
GATGAGAAGA CGCGAGATGG TGAGCCAAGT GTGGGTGGGG TGTACCGCGG AAGGGTGACA
AATGTGATGG ATTTCGGAGC CTTTGTGGAG TTGGTGGAGT TCAAGCGCAA GGCGGAGGGC
TTGGTGCACG TGAGCGCCAT TACCGATCGA CACCTGAAAA GCGCAAAGGA CGGCGCGAGG
CGAGGTGAGA GCGTGTTTGT CAAAGTTTTG AATCGTAACG GAAACAAGAT ATCGCTGTCG
ATGAAGGATG TGGACCAAAT CTCGGGGAAG GAGACTGCGC GCGCCACGTT TGGCGGTAAC
AGAAGTAATC CAGCGCCGAG GAATTTACCG CCACCCGGGC CGTCAGCAGC AGGGGGAATG
GCGTCACTTA AAGGCCTTTC TGGCATCGAT GCCAGCACTT TGGATGAAGC AAACACAGCC
TCGCGCAAAC GACCTGCAAA GAGGCTGAGT TCGCCGGAGC TGTGGGAAGC TCGGCAGCTC
ATCGCGAGCG GGGTGTTGAA GGTGCAAGAC TATCCGCAAT TCGATCCGGA AAACGACGGC
ATGCTATCCT ACGAAGAAGA AGCTGAAGAG GAAGTTGAAA TCGAGATCAA CGAAGACGAG
GCGCCGTTCT TGCAGGGACA AACTGCGGCG AGCACGGGTG ACGTTTCGCC GATTAAAATT
GTCAAAAACC CAGATGGTTC CATGCAACGC GCCGCCATGA CGCAAGCCAC GCTGGCGAAA
GAGCGACGCG AGCTTCGCGA TCAGCAGCAG CGAGCAAATA CTGAGGCCGA GGGTCAAGTA
GCGGCGCGTC CGTGGGAGGA TCCGATGAGA AGACAGGGCG ATGCGTCGTT GACAGAGGAG
GCTCGACAGT ACGGCGGCAG TCGCGGAGGC CGCGATATGC CCGCTTGGAA GGCCAAGTCC
ATGGGCCAAG GTCAAAGGAT GGGCCAGCCA CAGACGATGC CCATTCATCA ATTACGGCAG
ACGCTACCGA TTTATAAACT GCGCGATCAA TTGATTCAAG CGGTGAATGA AAATCAGATT
TTAGTGGTGA TTGGCGAGAC GGGTTCGGGG AAAACGACGC AGATGACCCA ATACTTGGCC
GAGGCTGGGT ATACGTCACG GGGTAGAATA GGATGCACGC AACCAAGGCG TGTGGCCGCA
ATGTCCGTAG CCAAGCGTGT CGCCGAGGAG TACGGCTGTC GACTCGGTGA AGAGGTTGGT
TACGCGATTC GTTTCGAAGA CTGCACGTCA CAGGACACAG TTATCAAGTA CATGACCGAT
GGTATGCTAC TTCGCGAGGC TTTGTTGGAT GACTTGTTAT CGCAGTATTG CGTCATCATG
CTCGATGAGG CACACGAAAG AACGATCCAC ACGGACGTAT TGTTCGGATT GCTGAAGAAG
TGCTGCGCAA AGCGAAAAGA CTTGAAAATC ATCGTGACGT CAGCTACTTT GGACGCCGAA
AAGTTTTCGA CGTACTTTTT CGATTGCCCC ATTTTCACCA TTCCTGGTCG AACGTTTCCT
GTCGAAGTAT TGTATACTAA AGCTCCCGAG AGCGACTATC TTGATGCCGC TTTGATCACG
GTGATGCAAA TCCACCTCAC AGAGCCCGAG GGCGACATCT TACTGTTCCT CACGGGTCAA
GAAGAGATTG ATGCGGCTGC GGAGATTCTA TTCGATCGTA TGCGCGCATT AGGTCCAGCG
GTTCCAGAGT TACACGTCTT GCCGGTGTAC TCTGCTCTTC CTAGCGAACA GCAGACGCGC
ATTTTTGAGC CGGCGCCGCC CGGCAGTCGC AAGTGCGTCA TCGCGACAAA CATTGCCGAA
GCCTCGCTCA CTATTGACGG CATCTTCTAC GTCGTCGATC CCGGGTTTTC AAAACAAAAA
GTGTACAATC CTAAGATTTC CATGGACTCT CTCATCGTGG CTCCGATTTC ACAGGCTTCA
GCTCGACAGC GAGCCGGTCG CGCCGGTCGT ACAGGTCCCG GCAAGTGTTA TAGATTGTAC
ACGGAAAGTG CGTTCAAGAA CGAAATGCTC CCGACATCGG TGCCAGAAAT TCAAAGGACG
AACCTGAGCA TGACGGTGCT GACGATGAAA GCTATGGGGA TCAACGACTT GATTAATTTT
GATTTCATGG ACCCGCCTCC ACCGGCGACG CTCGTCACGG CTTTGGAGCA ACTGTACAAT
TTAGGCGCGT TAGATGAGGA AGGCTTGCTC ACTCGACTCG GGCGCAAGAT GGCAGAGTTT
CCATTGGAGC CGCCGATGAG TAAGATGCTC ATCGCGAGCG TTGATCTCGG GTGTTCTGAG
GAAATTTTGA CCATCGTCGC CATGCTGAGC GCTCAAAATA TCTTCCACCG CCCGAAGGAA
AAGCAAGCGC AGGCGGACGC GAAGAAGAAC AAGTTTTTCC AAGCCGAAGG CGACCATTTG
ACATTGTTGT CGGTGTATGA GGCGTGGAAG GCTCAAGGTT TCAGCGAACC TTGGTGTTAC
GAGAACTTTT TACAGGCGCG TTCGATGAAG CGCGCGCAAG ACGTCCGTAA ACAACTTCTC
ACCATCATGG ATCGATACAA GCTCGGCACC ACGAGCGCTG GACGCAACTA CAACAAAGTT
CGCAAAGCCA TCTGCTCGGG TTTCTTCTTC CACGGCGCCA AGAAAGATCC GCAGGAGGGT
TACAAAACCA TCGTCGAGCA GACTCCGACG TACATTCACC CCTCGAGCGC GCTCTTCCAA
CGTCAGCCCG ACTGGGTCAT CTACCACGAG CTCGTGCTCA CGACCAAGGA GTACATGCGT
GAAGTCTGCG CCATCGACCC CAAATGGCTC GTCGAGCTCG CTCCGCGCTT TTTCAAGCTC
AGCGACCCTC GCCATCTGAG CAAGCGCAAA AAGAGCGAGA AAATCGAGCC CCTGTACGAT
CGCTACAACG ATCCGAACGC GTGGCGATTG AGCAAGCGTC GCGGCTGATC CGCGCCCGCG
CGA
 
Protein sequence
MAALAKLAQL SLVNKVTKEL ENHLGIADKT LSEFVIALTD ENDTPVTFRK ALGAVGAEVD 
DAFAESLLGL IQRMRKKPSG TARGTNAATA GSAGTSGRSS GDANFPGLAV RDDDSHAKAL
TRELYGDRDA LANAADDPSK SERANEREVS HSLASAQTKS DEKTRDGEPS VGGVYRGRVT
NVMDFGAFVE LVEFKRKAEG LVHVSAITDR HLKSAKDGAR RGESVFVKVL NRNGNKISLS
MKDVDQISGK ETARATFGGN RSNPAPRNLP PPGPSAAGGM ASLKGLSGID ASTLDEANTA
SRKRPAKRLS SPELWEARQL IASGVLKVQD YPQFDPENDG MLSYEEEAEE EVEIEINEDE
APFLQGQTAA STGDVSPIKI VKNPDGSMQR AAMTQATLAK ERRELRDQQQ RANTEAEGQV
AARPWEDPMR RQGDASLTEE ARQYGGSRGG RDMPAWKAKS MGQGQRMGQP QTMPIHQLRQ
TLPIYKLRDQ LIQAVNENQI LVVIGETGSG KTTQMTQYLA EAGYTSRGRI GCTQPRRVAA
MSVAKRVAEE YGCRLGEEVG YAIRFEDCTS QDTVIKYMTD GMLLREALLD DLLSQYCVIM
LDEAHERTIH TDVLFGLLKK CCAKRKDLKI IVTSATLDAE KFSTYFFDCP IFTIPGRTFP
VEVLYTKAPE SDYLDAALIT VMQIHLTEPE GDILLFLTGQ EEIDAAAEIL FDRMRALGPA
VPELHVLPVY SALPSEQQTR IFEPAPPGSR KCVIATNIAE ASLTIDGIFY VVDPGFSKQK
VYNPKISMDS LIVAPISQAS ARQRAGRAGR TGPGKCYRLY TESAFKNEML PTSVPEIQRT
NLSMTVLTMK AMGINDLINF DFMDPPPPAT LVTALEQLYN LGALDEEGLL TRLGRKMAEF
PLEPPMSKML IASVDLGCSE EILTIVAMLS AQNIFHRPKE KQAQADAKKN KFFQAEGDHL
TLLSVYEAWK AQGFSEPWCY ENFLQARSMK RAQDVRKQLL TIMDRYKLGT TSAGRNYNKV
RKAICSGFFF HGAKKDPQEG YKTIVEQTPT YIHPSSALFQ RQPDWVIYHE LVLTTKEYMR
EVCAIDPKWL VELAPRFFKL SDPRHLSKRK KSEKIEPLYD RYNDPNAWRL SKRRG