Gene OSTLU_43144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43144 
Symbol 
ID5005428 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp629151 
End bp630843 
Gene Length1693 bp 
Protein Length542 aa 
Translation table 
GC content56% 
IMG OID640420849 
Productpredicted protein 
Protein accessionXP_001421519 
Protein GI145354496 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID[TIGR00583] DNA repair protein (mre11) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.275751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.938333 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCGG CCGCGGGGGT GAAACCGCCC GATGCGAACA CGCTGCGCGT GCTCATCGCC 
ACGGACACGC ACCTGGGCGC GCACGAGCGC GATCCGATTC GAAAAGATGA CGCGTTTTTA
GCGTTTGAAG AAATCTTCGA TCACGCGAGA AAACAACTCT GTGATTGCGT GTTTCTCGCG
GGAGACGTGT TCGACGTGAA TAAACCGAGC CGAGAGACGC TGGTGCGGTG CATGGACGCG
CTGCGGGAGG CGACGCGAGG GAATAAAGGG ATCGAGATTG AAGTTTTGAG CGATGGGAAG
GAAAACTTTC CGAGTCGAGG TGCGCGCGAA GCGAGGCGAA GACGAGGTCG AAACGATGAC
TGACGGCGCG GATGCGGGAA AATCAGGCAT GGCGAATTAT GAGGATCCGA ATTGTAACGT
GTCGTTGCCG GTGTTTAGCA TACACGGGAA TCACGACGAT CCCGCGGGGG AGGCGAATTT
GAGCGCGATG GACGTGCTCG CGAGCGCGGG ATTGGTGAAT TATTTCGGCA AGCACGCGCT
CGGTGGGGGG GGAGCCGGTC GCGTGGACTT GAAGCCGGTA TTGTTACGTA AAGGACAGAC
TAAGGTGGCG CTGTACGGGT TGGGATACAT TCGCGATAAT CGTTTACATC AAATGTTTAG
CGTCAAGGGA TGCGTGCGAT GGCATCGACC GGCGGAGACG GAGGATTGCT CGTCGAGTTC
GTGGTTTAAC GTGATGTTGA TTCATCAAAA TCGAGCCGCG CATTCGAAGA ATGCGATTTC
CGATCGTTAC TTGCCGAGTT GGTTGGATTA CGTCGTTTGG GGGCACGAGC ACGAGTGTTT
AGTGGAGCCA ACCGAGAGCG CGCAGGGATT TCACGTGTCG CAACCGGGAT CGAGCGTGGT
GACGTCTTTG ATTGAAGGCG AGGCGAAGGA GAAGAAGATT TGCGTGCTCG AAGTGCGAAG
TGATCCGGAG AATCCGAATA GCGCGCCATT CTGGCGCACG ACGCCCATCA CCTTGCGAAC
GACGCGACCG TTCGAGTTTG AGCAAATGAC GTTGGCGAAC ACGCCCGAGC TCGAAGGCGC
GGATGCCCAA GGCGTGGCGA CGTATCTGGA GAACCGCGTG AACGCCATGA TAGTCCGGGC
GGGGCGCAAG CATAGAGAAC GACACGCGAA AAATGGGAGA GACGATGTCG ACATGCTCGA
CCGCTTGAAT TTGCCTTTGA TTCGTCTGCG CGTGGATTAC TCGGGCGGCT TTAGCACCAT
CAATCCGCAG CGCTTCGGTC AAAAGTTTGT CGGCAAGGTG GCGAATCCGC ACGATGTTTT
GCTCTTTCAT AAATCTCAAA AGAAGCAACG TCGCGATGGC GTGGACGTGG ATGAAGACAT
GATCGATGAG GAGGCGGCGG CGTTGGAGGA GGAAGACGCC CTCGCCGATG GCATGCTCGA
GAATCAACGA CGAATCGATC GACTCGTGCG CGAACACTTG TCGACGAGCG ACGGTTTACA
ACTTCTCACA CCTAACGATC TCTCCGCCGC GCTCGACGAT TTCGTCAATC GCGACGAAAA
GGCGGCGATT TCCAAGCTTT GTCAAACGCG CTTAAAGGCG GTGCAAACAT CGGTGAATGC
GGATGATCAA GAAAACACCG ACGACGTCGA TCGATTGACT TCGAAGATTT ACGAAGCCGT
GAAGGTGCAG TTA
 
Protein sequence
MRPAAGVKPP DANTLRVLIA TDTHLGAHER DPIRKDDAFL AFEEIFDHAR KQLCDCVFLA 
GDVFDVNKPS RETLVRCMDA LREATRGNKG IEIEVLSDGK ENFPSRGMAN YEDPNCNVSL
PVFSIHGNHD DPAGEANLSA MDVLASAGLV NYFGKHALGG GGAGRVDLKP VLLRKGQTKV
ALYGLGYIRD NRLHQMFSVK GCVRWHRPAE TEDCSSSSWF NVMLIHQNRA AHSKNAISDR
YLPSWLDYVV WGHEHECLVE PTESAQGFHV SQPGSSVVTS LIEGEAKEKK ICVLEVRSDP
ENPNSAPFWR TTPITLRTTR PFEFEQMTLA NTPELEGADA QGVATYLENR VNAMIVRAGR
KHRERHAKNG RDDVDMLDRL NLPLIRLRVD YSGGFSTINP QRFGQKFVGK VANPHDVLLF
HKSQKKQRRD GVDVDEDMID EEAAALEEED ALADGMLENQ RRIDRLVREH LSTSDGLQLL
TPNDLSAALD DFVNRDEKAA ISKLCQTRLK AVQTSVNADD QENTDDVDRL TSKIYEAVKV
QL