Gene OSTLU_51248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_51248 
Symbol 
ID5004986 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp370104 
End bp372402 
Gene Length2299 bp 
Protein Length722 aa 
Translation table 
GC content59% 
IMG OID640420407 
Productpredicted protein 
Protein accessionXP_001421130 
Protein GI145353672 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0753689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCG GTGACGGCGC GCGCGACGGC GCGTCGACGC CGCGCGCCAT CGGGCGCCTG 
CCGAGCGACG TCGTCAATCG CGTCGCCGCG GGAGAGGTGC GTCGAGGGCC GGCGCGCGAC
GGCGCGCGCG ATGAGAGATT GAATGTTTGA ACGTCAGCTC GACGACGGAC GACGCGAACG
CGCGACTGAC GACGCGAGAC GACGCGACGA CCGAACGCGC GCGCAGGTGA TCCATCGACC
GTCGAACGCG CTGAAAGAGC TGGTGGAGAA CTCGTTGGAC GCGGGCGCGA AGTCGATCGC
GGTGACGACG AGGGAGGGCG GGAATAAACT GTTGCGAGTG CAAGACGACG GACACGGAGT
GCGAATAGAG GACTTGCCGC TGCTGTGCGA GCGACACGCG ACGAGTAAGA TTGAAAAGTT
TGAGGATTTA GCGCGATGCG AGAGCTTTGG GTTTCGAGGA GAGGCGCTGG CGAGCATGAG
CTACGTGGCG CACGTGTCGG CGACGACGAT GGCGGCGGGG GCGACGCACG CGACTCGAGC
GACGTATACG GATGGGAAGA TGGATGCGGA GGGGGCGAAA CCGATCGCGG GGGTGTTAGG
AACTACGATT AGCGTGGAGA ACTTGTTTTA TAACGTCGTG ACTCGAAGGA AGGCGTTGAA
GAGCGCGTCA GAGGAGTACT CGAAAGTGCT CGAGGTGTTG CAGAGGTACG CGGCGTTGCG
AACGGATGTG GCGTTCACGT GTCGGAAGCA CGGTGAGTCG CGAGCGACGT TGCACACTCC
CGTGGCGCAA TCGCGCGTCG AGCGGTTGCA GGCGATTTAC GGTCCCACGG TGGCGAGAGA
TTTGAAGAAG CTCGACTTCG ACAGCGAGCT GTCCAAGAAA AAGTTTGATT TCAAGCTGCA
AGTGGACGGT TTAGTGAGCG GTGGGAATTA TCATTCAAAG AAGACGACGT TCATTTTGTT
CATCAATTCG CGTTTAGTGG AGTGCGCGCC GCTCAAGCGC GCGTGTGAGT CGGTGTACGC
GGCGATACTC CCCAAGGCTG AGAAGCCGTT TGTATTCATG CACCTCCGCC TGCCGTTTGA
AGACGTCGAC GTCAACGTGC ATCCCACGAA ACAGGAGGTG CACTTTCTGC ACCAAGAAGC
CATTGTGGAG TTGATTCAGT CCAAACTAGA GAAGATTCTT CTCGCGACGA ATTCGTCGCG
AACATTCACC GTGCAAACAC TGCTTCCTGG CGCGGAGAAA CTGGCAAAGA AGGATGACGA
AAACGACGCC GAGCGAAGCG GCGACAAGGA AAATAGCGAA AAAGCGGACG AACCGCCGGC
GTCGCAGGCG AAGACGATGC GGACACAGCG CGAACGCGCG GGTGGTGATC ACAAGCTCGT
TCGCACGGAT GCGAATTTAG CAGCGGGGAG TTTGGACGCG TACTTGCAGC GAGCGATGAA
TTCCGAGGGA CGCGAACACG AGAAAATAGA AGAGGTTCGA CGCGCGGTGA GAGAGCGTCG
AGGACAGCGC ACGGAACCCG AAGACACGTA CGTGTGCGAG TTGACGTCTA TTCGCCAGCT
TAACACCGAA ATCGCCAATC GCGCGCACAA GGAGCTCGGC GACGTGATTA AAAATCACAC
ACTCGTCGGC GCCGTGGACG CGCGCAAAGG CGTGTGGTTA CTTCAGCACC AAACCAAGCT
CTTCATGGTG GACGCCGTAA AGCTCACCGA GGAAATGTTC CATCAAATGG CTTTGAAGAA
CTTCGCCAAC TTTGGGTACC AATCGCTGCA AGATCCCGCG TCTTTGGCCG AACTCGCGCT
GTGCGCGCTG GAGGATAAAT TCGTCGACGA CGAAGAGTGG GACGCGAGCG ATGGCTCCAA
GGAGGAAGTC GCAGAGAAAA TCGCAGAGAT GCTCGTCGAA AAGGCGGACA TGCTCAAGGA
GTATCTCGGC GTCGTCATCG ACAAGGAACG GCGTCAGATC ACCGGAGTGC CGTCGATGCT
TCCCGGGTAC GCGCCGGAAA TCGGCAAACT TCCCGAGTTC GTCCTCGCCC TCGCCGAAGA
CGTCGATTGG ACGAGTGAAA AAGAGTGCTT CGAAACCTGC GCTCGAGTCA TCGGCGCATT
TTTCGCCATG GACTGCTCTT TCCACGATCC GAAAGCCGAA GAAGGCGACG CCGAGTCCGA
CGCTCGTCGC GTCGCTCGCC TCTGCGTCTT TCCCGCGATG AAGCGCCGTC TCGCCCCGCC
TCGTCGTTTC GCCGACGACG GCACCGTCAT TCAGATCGCG TGCCTCGAGC AGTTGTACAA
AATTTTCGAG CGCTGTTAG
 
Protein sequence
MTRGDGARDG ASTPRAIGRL PSDVVNRVAA GEVIHRPSNA LKELVENSLD AGAKSIAVTT 
REGGNKLLRV QDDGHGVRIE DLPLLCERHA TSKIEKFEDL ARCESFGFRG EALASMSYVA
HVSATTMAAG ATHATRATYT DGKMDAEGAK PIAGVLGTTI SVENLFYNVV TRRKALKSAS
EEYSKVLEVL QRYAALRTDV AFTCRKHGES RATLHTPVAQ SRVERLQAIY GPTVARDLKK
LDFDSELSKK KFDFKLQVDG LVSGGNYHSK KTTFILFINS RLVECAPLKR ACESVYAAIL
PKAEKPFVFM HLRLPFEDVD VNVHPTKQEV HFLHQEAIVE LIQSKLEKIL LATNSSRTFT
VQTLLPGAEK LAKKDDENDA ERSGDKENSE KADEPPASQA KTMRTQRERA GGDHKLVRTD
ANLAAGSLDA YLQRAMNSEG REHEKIEEVR RAVRERRGQR TEPEDTYVCE LTSIRQLNTE
IANRAHKELG DVIKNHTLVG AVDARKGVWL LQHQTKLFMV DAVKLTEEMF HQMALKNFAN
FGYQSLQDPA SLAELALCAL EDKFVDDEEW DASDGSKEEV AEKIAEMLVE KADMLKEYLG
VVIDKERRQI TGVPSMLPGY APEIGKLPEF VLALAEDVDW TSEKECFETC ARVIGAFFAM
DCSFHDPKAE EGDAESDARR VARLCVFPAM KRRLAPPRRF ADDGTVIQIA CLEQLYKIFE
RC