Gene OSTLU_37894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37894 
Symbol 
ID5003976 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp476054 
End bp480632 
Gene Length4579 bp 
Protein Length502 aa 
Translation table 
GC content64% 
IMG OID640419397 
Productpredicted protein 
Protein accessionXP_001420191 
Protein GI145351670 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.525832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAACGC CGTTGGAGGT CGTGGAGGCG ATCGAGCGGT GCGTGAGCGT GCCGGGGGGG 
GAGGTGCTGG ACGCCGCGAG CGAGACGCTG AGGGCGATTC GCGTCGAGCA ACGGCGGATT
CGAGAGGAGT TACGAACGAT GCTGAACGCG ACGAGCAAGG AGATGGCGAG GAAGAATTTT
GCGGAGCGCG CGCAAATCGT CACCAGACTT GGCAGACAGT GCATACCGAT GAAACTCGGG
AGCGCGGGCG AGCTCCCGGG AGTCGTGCTC GACGTCTCCG GCACCGGGAA CACGGTGTTC
AAAGAGCCGC AAATCGCGGT GCCATTGAAT AATGCTCTGG CCACGCTTTC CGCGAGCGAA
GATGCGGAAA TAGAACGAAT TCTAGTCGAG CTCACCTCAA TCGTGCAAAC GCACGCGGAT
GTTCTGCTCG ACGCGAACGA GGCGTTGACA GAGTTGGACG TGGCGAACGC GCGCGCCCGG
CACGCGGAGT GGTTCGACGG CGCGGAGCCG ACGATCGTGG ACGCCAATCA AGGCATGTGC
GTGCGTGAGC TGCAGCATCC GTTGTTGTTA GAGCGGCATC TGACGCCGTT GCCCAAAAAA
GCCGCCATTG GAGAAGAGGA ACAGGTATCG GCGTTCGGCG AGAACGACGC CTCCGAAGAC
GATTCGGCGC ATTCGCAGCG ACACGTCAAT CGTCGAGATG TGCGCGATGT GGTCGTCCCG
ATTGATTTCA ACGTCGATTC GTCTATCAAA TGCGTCACTA TCACCGGTCC GAATACGGGC
GGTAAAACCG CGTCGTTGAA AGCGATCGGC GTCGCGTGTT TGATGGCGCG CGCTGGTTTA
TATCTGCCAT GCGAATCCGG TTGCGAGATT CCATTCTTTC GTCACGTCAT CGCCGATTTA
GGAGATTCGC AAACCCTCGA GCTCGACGGC GGCTTGTCCA CCTTTGGCGC GCACCTCAAA
GGTCTGCAAC GCATTTTGGA CGCCGCGACC GACGATACAT TGGTCTTGCT GGACGAGCCC
GGGAGCGGCA CCGACCCGGC GGAGGGCGCG TCTCTCGCCG TCGCCGTCCT GAACAAACTC
TCACGCACGT CTCGTTTGAC GATCGCGACG TCGCACTACG AAGAAGTCAA GGAAGCCACA
CTCGCATCCG ACACAGCTCA AGTCGCGGCG GTGGAGTTCG ATCTTCAGTC GCTGCAGCCG
ACGTATCGGT TGCTGTGGGG CGAAACCGGC AAGAGCAACG CGTTGCACAT CGCAGCGGGG
CTCGGATTAG AACCGTGGAT ACTCGCCGAG GCGCGCATCG CGCTAGCCAA GGCGGATGCC
AACGCAGAGG TAGACGCCAG TGGCGCGATC GCGCGGGAAA ATCGCGCGAA GTTGGCCAGC
GCACTTGATG AAGAATGCGA CGTGCAGCTC GCCCGCCGAG CCGCGGCGGC GGCGACGCTC
GAAGAGACGC GCGCGCTGTT CGACGAAGTC AGGAGTAAAT CTGCGCACTT GGATCTGAGG
AAACAAATCA TCAGGGATGA CGCCAACAAT GAGATCGAGC GAAAGATTGA AGAAGCTCGC
GAGTTATTGG CGGCGTGCGA CACGCGAGAG GACATCGACG ATGTCGTCGG CGCGTCGCTT
CCCGCCGGAT GGGTCGTCGA CGCGAGCGGC GAAGCCGTCC CGGGCGACAG CCTTGACAGC
GCGTCCTCGC GATGGATTCC CAAAATTGGC ACGCTCGTCG TCGTCCGTCA GCTAGGGAGC
GCAGAGGCAG AAGTCATCGA AGTCCACCCA GATGCGAACG AAATCACCGT CAAGCTCGGT
CGGATCAGTA CTCGGGTTTC GCTCGCCAGC GGTGTGAGTA AGGTTGACGT GAGCAAAACG
AGTTGGCGAC GATAGCTGTA CTGTAGTAGA AGTATATTTT ACATTCATTG CAAACACACA
ACGCTGCGAC AGTGAAGCGG TCGCGGTTAG TCGGATAGTC GGCGCGAACG GAAAGCCAAC
AGCGCGCCAC GGATTCCTCA GTCGAGTTGT CATGAATCAT CTTCATGAGG ATGGTCAGCA
AAGATTAAGC TATTTGAAAC ATCATCGGAA ACCACGAGGG CGAGAACAGG TTCGATCATC
GCGTGTACGC ACCTAGCACT AAAACGTCGG CGCGATAGTG GACTCTTCCG TGAACACGCC
CTGCTCTACT GCGACGTCCA ATTCTCGAAA CGCCAACACG CGTCTGGAAG ACTCCTTCAA
CGTGCGCGCG ACCCAGTCGT CCTTGTCTTC CCTCAGATCG AACATAATTT TTTCATGAAG
CTTTAAGAGC AAATACACGT CTAGCGCGGC ATACCGTCGA ACGTCATCTG TTAACGGACG
ATCCGCCCAA AGCGTCGATT CCTCGACGGC GTACAACTTC TTCACGCGAA CCTTCAAGTC
ATCAGCCACC GCGGTCTCCG CGCTCGTCAG ATGCGTTCCC GCGCACTTCA CGATCCCCGC
GACGCGATCG ATCATCTTCC CGAGAGCCCG ACGCGCGGCG ACGTCGAGCA CCTGCACGTC
CATCACGTTT TCCAATCGCA CGTCGAATTG ATGAAACAAC GCGTCCGAAT CCATGCGACA
ATCAAACATG AGCTTCATCG GCGCCTCACT CGACTCCAGG ACATCCCGCA ACCCTCCCGC
GTTTCTGTCT CCGAACGCTC GTCCCCCGAG CGACTGCACG TCGATCAAGT AAATCGCGTC
GCGCGTCGCG CACTGCACCA CCGTCACCGG ACCCGTGCGC GACATTCGCA CTCCCTCGCA
ATCAACCGCG ACGACGCTCG ACGCCTTGAT CGTCTCCACG CATGCCGGCA TTTTGATTTC
ATAATTCGTC GCGTCGATGA TTTCGCACCG TTGATCCAGC GTGTTCACGA TCTCGAGGCA
CATCGGATCC GACTTCGCGT TCGTCGGCGC CGCCGGCTTT TTCCGTCGCT TCTTCTTCTT
CTTCGCCGCG CCGCCGGACC CACCGGCTTC CGCGCGCTTA CCGCCGCCCT CGAGCGCGCC
GCGTTCGAGG CCAGACGAAT CATCGCCGTT CGCGCGCTTG AATCCGAGCA CGACGCCGCC
GTACGCGTCG TCGTCGTCGT CGAGCGTTTC GCCCTTCGCC GAGGCGGCGG CTCGGCACGG
CTTACAAAAC CGCGTCGACT GCGCGGTCTG ACGCTTGAAC GTCGACGCGC ACTTGGCGCA
CACGAGCCTG GGTTTGTTGC CCATCCGCGC GACGCCGCGC GCGCGCGCGA GCGCGAGAAC
GCGCGCGGGC GGGCGCGCGC GCGAGAGCGC GGCGCGCGAC CGGGACGCGA CCGCGACCGC
GACCGACGAC GCGCGACCGC GCGCGGACGC AGAGGACGAC GCGCGCGCTG AAACGCGCGC
GGGGATGAGC ATACACCGAA ACGACGCGCG ATCGGCGCGC GAGCGCCGGT GCGCGCGACG
GTGTGACGCC GCGACTCGCG CCGACGCGCG TTCGGCGTCA ACCGACGCCG GCGCCGAGCG
GCGCGCGACA TCGCGTCGAC GTCAAGTCGC AACCGGGATA CGCGCGGCGC GCGTCGGAGA
CACCCAAATC GCCGTAGCAC GCGCATCGAC TGCTCGCGCG CCCTTCGGCG TCCGTCGTGG
TTTCGTTGCG CGGGTATCGC GGTCCCGAGT CGTCCTCGCG GCAGCGCACG TCGACGAAGT
CGCCGTCCGA CGTGGAATCG CAATCCGGGA ACATGCGCGC GCGCTGTTCG CGCGCCGCCG
CACCCGCGTC GTACCGCTCC AACCGCGCTC GAATCTCGAG CGTCAACGCC GTCGCCGCCC
CCGTCTCGGC GTCGTAAAAC TCGCCGTTCG CCAAGACGCC CGCGGCGCGA TACTTGTCGC
GCATGAAATC CCTCCACCCT CGGATCGCCG CCAACTCCCC GTCCGTCAGC CCCGAGACGT
CGCTCACGCA CCCACTCTCG CTGAAGTCTC CCGTCGCGAA CGCGCGGCTC GCGTCGCGTC
CCGCGAAGCA CGTCGCGTAG TCGCCGTCGC CCGCGTAAAA CCGTCGCCCC GCCGACACGT
CGAAGCACTC GCCCAGCACC GCCAGCCACA CGCGCGGCCG TCGTCGCAGC GCCGCGCGCG
CGCGCGTCCA CGTCGACGCG TCGCTTCGCC GCGCGCGATC GACCCTGCGC GCGAGCGCGT
CGGGCGCGAC CATCGCGACG CGCGCGTTCG CGTTCGCGTT CGCGTTCGCG TCGCGCCGAC
GCGCGACGAC CGCGACGACC GCGACGACGA CGGCGAGCGC GACGCGCGCG AGTCGCGCGC
GCGCGTCGTC TCGCGCCATC GCGCGCGCCG TCGCGGCCGC CGGCGCCGTC TGGCGTCGCG
CGTCTGGCGC CGCGCGTCCG ACCGACCGCG TCCGACCGAC CGCGTGTATG CGTAAACAGT
GAGACGTACG ATACCACGAC ATTCGGCATC GAATTCAAAC GACCGCCCCG TTCGCCGCGG
CGCGCGACAC CGCGACGCGC GCGCGACGTC CGACGCGGTC GTCATGCTCC GACTGAAATC
GCGAAGCGTT CTGTGCACGC CGTCCAGCGG GCTGTCCAGA GCGCTGCGCA CGCTGAACGT
CGCGACGCCG AACGCGCGC
 
Protein sequence
MRTPLEVVEA IERCVSVPGG EVLDAASETL RAIRVEQRRI REELRTMLNA TSKEMARKNF 
AERAQIVTRL GRQCIPMKLG SAGELPGVVL DVSGTGNTVF KEPQIAVPLN NALATLSASE
DAEIERILVE LTSIVQTHAD VLLDANEALT ELDVANARAR HAEWFDGAEP TIVDANQGMC
VRELQHPLFR HWRRGTDVRD VVVPIDFNVD SSIKCVTITG PNTGGKTASL KAIGVACLMA
RAGLYLPCES GCEIPFFRHV IADLGDSQTL ELDGGLSTFG AHLKGLQRIL DAATDDTLVL
LDEPGSGTDP AEGASLAVAV LNKLSRTSRL TIATSHYEEV KEATLASDTA QVAAVEFDLQ
SLQPTYRLLW GETGKSNALH IAAGLGLEPW ILAEARIALA KADANAEVDA SGAIARENRA
KLASALDEEC DVQLARRAAA AATLEETRAL FDEVRSKSAH LDLRKQIIRD DANNEIERKI
EEARDGLSRA LRTLNVATPN AR