Gene OSTLU_432 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_432 
Symbol 
ID5002516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp696848 
End bp699334 
Gene Length2487 bp 
Protein Length829 aa 
Translation table 
GC content60% 
IMG OID640417937 
Productpredicted protein 
Protein accessionXP_001418550 
Protein GI145348213 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0880854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.487842 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCAAACGAA TCGACGACGT CGTCGTGCAT CGTATATGCT CCGGCCAAGT CGTCCTCAGC 
CTCGCGAGCT GCGTCAAGGA GCTCGTCGAA AACGCGCTGG ACGCCGGCGC GACGAACGTG
GAGATTCGAC TGAAAGACCA CGGCGCGGAC GTCGTCGAAG TGTCCGATAA CGGCTCGGGC
GTGCCCAAGG CGTCGTTCGA GGCGCTGACG ACCAAGTACG CGACGAGCAA GTTGAAGGCG
TTCGAAGACT TGGAGACGCT GCGAACGTTT GGATTTCGCG GCGAGGCGCT GTCGAGTTTA
TGTGGGATAT CGGGGGAGTT TAGCGTCACC ACGCGGACGG CAGACGATGC GAGTGGGACG
AAAATCGTGT ACGACGCGAA GGGGGCGATC GTGAGCGAGA GTGTGGTGCC GAGATCAGTG
GGGACGACGG CGACGGTGTG TCGGTTGTTC GAGCCGCTCG CGGTGCGGAG GAAGGAATTT
TTGAGAAACG TGAAGCGCGA GTACGGGAGA GCGTTGCACG TCGTGCAGGC GTATGCGTTG
ATGTCGAAGT CAGTGAGGAT ATTGTGCACG CATCAGAGCG GAAAGTACGG ACGGACGAAT
GTGTTGCACA CGCGCGGGGG CGAGGAAGCG TCAGTGCGGG AAAACGTCGT GACTGTGTTC
GGCGCGAAGA TGGTCGCGGC GATGCAAGAA ATCGACTTTG ATTTGAGTGA TGCCGATGGT
GATGGGTCGT CATCGTTGAC GTGTCGCATC GTTGGGTACG TCTCCAAGGC GCAAAATGGA
TGTGGTCGCG CGGGCACGGA TAGGCAATTT TATTACGTCA ACGGCAGACC GGTGGATTTA
CCGCGTGTGG CCAAGGTTCT TAACGAGACT TACCGTTCGT TTAACCCCAA CCAAGCGCCC
ATGGCGGTGT TAGACGTTCA ACTCCCCACG GACTCGTATG ACGTGAATGT GACGCCAGAC
AAGCGCAAAG TGATGCTACA CCAAGAGCAA GAATTGCTAA CCAAGATGAA GGAAAAGCTC
ACGGAGGCAT TCGCGCCGAG TCGATACACG TACGCCGTAT CGCAGGCGCC ACTAAGTAAC
ACGAAGAAAC GTCCCTCACT TTCGACATCG TTTGAAGGCG ATGACGGTGA CGACGTTATG
GAGGATGAGC ACGACGACGA AGAGTTGACG CCAACCGAGT GCATGGACGA AGAAACATTT
GAAACACTCT TGCTCACGGC AAAGGAGCCT GCCGCGAACG ATACTCGTGG AAGAGGAAAG
AAGCGTGACG CGAGCTCTGC GCAGAAGGGT TTACAAAATT TCGGCTTCAC GCGCGAAACC
ACGGCGGTGG CAATCGGTGG CGGTTGGACG ATGGCCACGG CGGACGGCGC AGGCGGCGAG
ACCGAGATGC GCGATGCTCC GGCGATTGTC GCGTTCGAGG AAGACGTCGT AAAGAAACCA
AAGGTTGGGG ACAGTCAATC CGACGACACG CGAACGAAAG AACCCGATAC CGAGGACGAA
CGCGCCAACG TGGGACACGA GCAAGTGACG TTTGACGACG TCGTGGTCGC CGCAGTGGTG
GAAGAACCTC GAGGACCAGA TTCAACGCGC GAAGACTCCA GTGGCGGCGC TTTGGCGTTT
TCGATGGAGA CGATGCGTGC GCGAAGGCGA AACGTGCGCA GCGAGGTCGT CACGACGGTG
GACGAAGCGT CCAAATCGAA GTCGGAAATC GCGTTCGCCG CCGCGCGAAT TCCCGCCGTG
GATGGCGAAA CGGAACCGTC TCACGCGACG CACGCGGCGG CGGCGAGCGA ACTTGAGCGC
GTGTTCAACA AAGCCGATTT CGCCAAAATG CGAATCGTGG GCCAGTTCAA TCTTGGTTTC
ATCCTCGCCG TGCTCGGCGA CGACTTGTTC ATCGTCGACC AGCACGCGAG CGACGAGATT
TACAACTTTG AGCGCTTGCA ACGCACATCG ACACTGACGC GGCAGCCGCT CATACACCCG
GTTCCGCTCG ACTTGACCGC GAGCGAGGAA CAGACGGTGT TGCAAAACAT GCCCGTATTC
TTGCAAAATG GCTTTGGGTT TTGCGACGTC GCCGAGACCG TTCCCGGCGC GGACATGAAC
AACTCCTCCA TCGATCCCAC GGCGAGATGC GGCGCGCTGC GACTGAACGC GGTTCCATTT
CTCAAGAACG TCGCGTTCGA CAAGTCCGAC GTCCAAGAGC TCGTGTCCAT GCTCGACCAA
GGCCAACATT CTCTTCCCTC CAAGAGCCAA CTCTCCATCG GCCTCGCGCG AGAGGACGCC
GCCGCCGCGC GGTCGCGACG GGACGCCTCA CCGCGCGTGC TCCGCCCCTC CAAAACCCGC
GCCGCGCTCG CCATGAAGGC CTGTCGCTCG TCCATCATGA TCGGCGACGC CCTGGACGCG
CGTTCGATGC GCCGAGTCTT GCGCAACCTC GGCGCGCTCG ACGCGCCGTG GAACTGCCCG
CACGGCCGCC CGACGATGCG CCACGTC
 
Protein sequence
IKRIDDVVVH RICSGQVVLS LASCVKELVE NALDAGATNV EIRLKDHGAD VVEVSDNGSG 
VPKASFEALT TKYATSKLKA FEDLETLRTF GFRGEALSSL CGISGEFSVT TRTADDASGT
KIVYDAKGAI VSESVVPRSV GTTATVCRLF EPLAVRRKEF LRNVKREYGR ALHVVQAYAL
MSKSVRILCT HQSGKYGRTN VLHTRGGEEA SVRENVVTVF GAKMVAAMQE IDFDLSDADG
DGSSSLTCRI VGYVSKAQNG CGRAGTDRQF YYVNGRPVDL PRVAKVLNET YRSFNPNQAP
MAVLDVQLPT DSYDVNVTPD KRKVMLHQEQ ELLTKMKEKL TEAFAPSRYT YAVSQAPLSN
TKKRPSLSTS FEGDDGDDVM EDEHDDEELT PTECMDEETF ETLLLTAKEP AANDTRGRGK
KRDASSAQKG LQNFGFTRET TAVAIGGGWT MATADGAGGE TEMRDAPAIV AFEEDVVKKP
KVGDSQSDDT RTKEPDTEDE RANVGHEQVT FDDVVVAAVV EEPRGPDSTR EDSSGGALAF
SMETMRARRR NVRSEVVTTV DEASKSKSEI AFAAARIPAV DGETEPSHAT HAAAASELER
VFNKADFAKM RIVGQFNLGF ILAVLGDDLF IVDQHASDEI YNFERLQRTS TLTRQPLIHP
VPLDLTASEE QTVLQNMPVF LQNGFGFCDV AETVPGADMN NSSIDPTARC GALRLNAVPF
LKNVAFDKSD VQELVSMLDQ GQHSLPSKSQ LSIGLAREDA AAARSRRDAS PRVLRPSKTR
AALAMKACRS SIMIGDALDA RSMRRVLRNL GALDAPWNCP HGRPTMRHV