Gene OSTLU_40782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40782 
Symbol 
ID5002498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp425163 
End bp428504 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table 
GC content60% 
IMG OID640417919 
Productpredicted protein 
Protein accessionXP_001418476 
Protein GI145348063 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.119207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.227387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATCG AGTCGAGCGA TGACGATGGT GGGAGCGATT TCGACGCGGA GGCGGAGGAA 
GAGGACGACG ACGACGAAGC GTTTGAGGCG GATTCGAGCG AGGCGTCGAG CGAAGACGAG
GAAGAGGAGG ACGACGACGA TTTCTCAGAG AGCGAGTCAA CGCCGGTGAA GAAGAAGAAG
ACGACCAAAG TGGCGGTGAA GAAGGTGGTG CCGACGTCGA CGACGAATGG ACGAAAGAAG
ACGACGGTGA ATACGACGTC GGCGCCGACG AAGGCGACAG AGGCAAAGGA TTTGAACGCG
CAGGACGTTC TCGCAGGCGT CGGTCCGGAG CAATACGACG CGCGAGATCG GTTGAAATTT
CCATTTATGC AACCGGAGAA GATTAAGGAT GCCGATGGTC GGCGTCCTGA CGATCCGGAC
TACGATCCGT CCACGTTGCT GTTGCCGTCG ACGTTTCCAA AAATGCGCGA CGCCAGCGGA
GTGCAGTGGA CGGTTTCCCC CGGGCAGGCG CAGTGGTGGA AGTTCAAGGC GGCGAATTTC
GATTCTGTAT TGTTGTTCAA GATGGGCAAG TTTTATGAAA TGTTCGAAAT GGACGCGCAC
ATCGGTGTGC GAGACTTAGG ATTGATGTAT ATGCGCGGCG AGCAACCGCA CGCTGGATTC
CCAGAGAAGA ATTACGCGAT GCACGCCGAG CAGCTCGCTC GCAACGGTCA CCGCGTCGTG
TGTATTGAAC AGACAGAGAC ACCAGCGCAG CTTGCCGAGC GAAAAAAGAA AGACAAGACG
TGCAAGGACA CTGTGGTTCG CCGCGAAATG GTTCAAGTGT TGACAAAAGG TACGATGGTT
GACACCGGTA TGTTGAACTC TTCGCCGGAT GCTGCCTTCG TTTGCTCTAT AATAGACGGC
TGCGAAGAGG AGGACGGCGA AGGTTGGGTC GGTTTGTGCG CTGCAGACTG CGGTACGGGT
CGTTTCCTCG TCGGTGCGTG GCGCGACGAC GAAGGTGCAA GCTGTTTGCG CACAGCACTT
GCGGAGCTGC GCCCGGTTGA AATCTTAGTT CCGCCCACTG GCTTGAGCGC GCGAGCGAAG
ATGGCAGTTT TGGATATGTG CTCACACGCT CAGCAGCGTA CGTTTAAGTC GACGAGTGCG
AACGAAGCGC TGGAGGACGC CGAGGCGGAG GGATACTTCA AGACGCTTAA GACTGGTTTG
CCAGAAGCCA TTAAAGAAAT GCGAGACACT GCATGCCATC CTGCTCGCGA ATGCGGTATC
GGAGCGTGGG GCACGGTCGT TGCGTACTTA CGCGCCGCGC TGATAGACGC CGATTTAGTG
CCACAAGGGC GAGTAGAGTC CCTGCACACA ACAGACGCCG GCGCGCGGGA GCACTTGGCG
CGATGGGCAC ACTCTACGCA TGTCGCCATG GATGCCGCTG CACTTTCTGG GCTTGAGGTG
CTCGAGAATA CCGCAGGTGG CTCCGCGGGT ACGCTTTTGG CGTCGCTCGA TCGGTGTGTG
AGCGGGCCCG GTCGTCGTTT GTTGCGCCGT TGGGTGTGTC GCCCTTTGAC AAGCGCGTCA
GCTATTCGAG CTCGACAAGT CGCCGTCTCT ATGATGCGGG GATGCGGGAT CGAGGCCACA
GGGATTGCGC GCAAGCTTTT GCGCGCCGCG CCAGACGCGG AACGCGCCAT CTCGCGCGTT
GTCGGATCGA GCGGCGAGAA AGGTCGATCC GCCTCGCACG TCGTCTTGTA TGAAGATGCG
GCGCGAGCAA AGTTGAACGA TTTCCTCGCA GCGCTCGAGG GTATTCGTGC GGTTCGCGAC
GCTACAAAAG CGATCGCGGC GTGCGTTGAT GCATGTGAGA AATCTGACGT GCTTCGCGCA
TTGTGCATCG TAAACGACGC CGCGGCGACG CGCGAGGACG TGTTCACCGC AGTCGGCGGC
GTCGCGATGC CGGATCTGAG CGCGCTGGAC GAAATGGAGT CTGCGTTCGA TTGGAACGCC
GCCAAATCGA GTGGGAGAAT CGAGCCGGCG CAAGGCGTCG ACGCCGATCT GGACGCTGCC
GAGGAGCAAC TTACAGCAGC GGACGCGGAT CTGGCGTCGT GGTTGGAAGA GGCGCGCGGC
GAGCTCGGTG GTCACAAAAC GGAGGTTTGT TTCGTGAATG CAAACAAAGA TACGCATCTT
GTCGAAGTTC CTGACCGCCT CGCGTCCAAG GTCCCTCATC ATTGGGTGCG TGAAGGTAAG
CGCAAAGGAT ACGAACGGTT CACGTGCGAT GATTTGGTAC CCTTGCGCGC GAAGCGAGTT
GCGGCGGAGG AAGCGCGCGA GGACGCGCTC GCCGGCGTGT TCCGTCGAAT CGTCGCAAAG
TTTTGCGAGA ACGCTTCGGA GTGGCAAGCG GCGGCAAGTG TTGGTGCAAT CATAGATGTT
CTAGCCTCTT TGGCTGTCGT GAGCGAGGAA ATGTACGCTT CTTGTGGAGC AGTTTGCACC
CCGAAGGTGC ACCCGCAACC GCGAGACGGC GAGCCCGCGA CGCTTGAGTC CGTCGGCCTG
TCGCACCCGT GCGCGTCTTC TCTCGCGCGC GCGTTCGTTC CGAACGACGC CAGGCTTGGT
GGTAAACATC CGGGTTTCTG TCTCATCACT GGTCCGAACA TGGGTGGCAA GTCCACGTAC
CTCAGGCAAG TCTGCTTGGC CGCGATCATG GCGCACGTGG GCGCCGACGT TCCCGCGGCC
AAGTTTGAGA TGACAGCTAT GGATGCGGTT TTTGTGCGCA TGGGCGCCAA AGACAATTTA
GCGGGTGGGC AGTCGACATT CATGGTCGAG CTCAGCGAAA CTGGAGCGAT GCTTCGTCGA
GCGACGACGA ATTCCCTAGT CGCGCTCGAC GAACTCGGCA GAGGAACCGC CACCGCGGAC
GGTACAGCCA TCGCGTGTGC GGTTGCGTCG CATCTCATCG ACAAGCGCTG CCGTACGCTC
TTCAGCACGC ATTACCATAG ACTCGCCGAC GACCACGCGC GCGACCCTCA CGTGGCGTTG
GCACACATGG CGTGTCGAGT CGAGACGCCG AGCGGAGCAG GTGTCGAGAC GCTCGGACGC
GAAACGGTCA CCTTTCTGTA CACCCTCGCG AGCGGCAACT GTCCGCGAAG CTACGGTGTC
AACGTCGCGC GTCTCGCTGG TTTACCCGAG AGCGTTTGCC TCGCCGCCGC GCGTCGCGCG
GCACACCTCG AGGCTGGCCA ACTGGAGAAG CTCGTCGGCG ACGAGCGCAG TCGCGAGCGC
GTCGTCGACG CGTGCCGCGA AGTCCTCCGC GACGTCTCCG CCGCAGACTT GACCGAACCA
AACGTTCGTC GCGCGCAAGA TAAAGCCCGA ACCGCCCTGT GA
 
Protein sequence
MVIESSDDDG GSDFDAEAEE EDDDDEAFEA DSSEASSEDE EEEDDDDFSE SESTPVKKKK 
TTKVAVKKVV PTSTTNGRKK TTVNTTSAPT KATEAKDLNA QDVLAGVGPE QYDARDRLKF
PFMQPEKIKD ADGRRPDDPD YDPSTLLLPS TFPKMRDASG VQWTVSPGQA QWWKFKAANF
DSVLLFKMGK FYEMFEMDAH IGVRDLGLMY MRGEQPHAGF PEKNYAMHAE QLARNGHRVV
CIEQTETPAQ LAERKKKDKT CKDTVVRREM VQVLTKGTMV DTGMLNSSPD AAFVCSIIDG
CEEEDGEGWV GLCAADCGTG RFLVGAWRDD EGASCLRTAL AELRPVEILV PPTGLSARAK
MAVLDMCSHA QQRTFKSTSA NEALEDAEAE GYFKTLKTGL PEAIKEMRDT ACHPARECGI
GAWGTVVAYL RAALIDADLV PQGRVESLHT TDAGAREHLA RWAHSTHVAM DAAALSGLEV
LENTAGGSAG TLLASLDRCV SGPGRRLLRR WVCRPLTSAS AIRARQVAVS MMRGCGIEAT
GIARKLLRAA PDAERAISRV VGSSGEKGRS ASHVVLYEDA ARAKLNDFLA ALEGIRAVRD
ATKAIAACVD ACEKSDVLRA LCIVNDAAAT REDVFTAVGG VAMPDLSALD EMESAFDWNA
AKSSGRIEPA QGVDADLDAA EEQLTAADAD LASWLEEARG ELGGHKTEVC FVNANKDTHL
VEVPDRLASK VPHHWVREGK RKGYERFTCD DLVPLRAKRV AAEEAREDAL AGVFRRIVAK
FCENASEWQA AASVGAIIDV LASLAVVSEE MYASCGAVCT PKVHPQPRDG EPATLESVGL
SHPCASSLAR AFVPNDARLG GKHPGFCLIT GPNMGGKSTY LRQVCLAAIM AHVGADVPAA
KFEMTAMDAV FVRMGAKDNL AGGQSTFMVE LSETGAMLRR ATTNSLVALD ELGRGTATAD
GTAIACAVAS HLIDKRCRTL FSTHYHRLAD DHARDPHVAL AHMACRVETP SGAGVETLGR
ETVTFLYTLA SGNCPRSYGV NVARLAGLPE SVCLAAARRA AHLEAGQLEK LVGDERSRER
VVDACREVLR DVSAADLTEP NVRRAQDKAR TAL