Gene OSTLU_28688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28688 
Symbol 
ID4999580 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp143477 
End bp146602 
Gene Length3126 bp 
Protein Length936 aa 
Translation table 
GC content61% 
IMG OID640415001 
Productpredicted protein 
Protein accessionXP_001415744 
Protein GI145341286 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0163307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGAACG AACCCGACGC GCGCGCCGCG ACGGCGACGG GCGCGCTCGC GCTCGACGAC 
GCCGCGAGCG CCAAGTTCGT GCGGTTTTAC CGCGGGCTTC CGAGCGAGAC GGCGCGCGTG
GTGCGGTTTT TCGATCGTAA AGATTGCATC AGCGCGCACG GCGATGACGC GATGTACATC
GCGCGGGCGT TCTATAAGGT GCGTCGCGAG CGCGCGGGCG CGCGGGTGCG ATCTTCGAAC
GCGTCGGCGC GCGCGCGGGG CGCGGGGCGC GCGCGGAGGC GACGAGAACG CGGATGATGA
TTGGTGATTT TTGAATATCG GCGCGCGGCT GTCGCCGAGA CGCGAAATCG GCGACGAGCG
CGGCGGCGCG ATCGTCGAGG GTGCGCTCGA GCGATGCGCG GGCGAGATGA CTGACGCGAA
GCGTTTGACC GTTTGGTCCA CTTTTGACGT CGGAGCAGAC GACGAGCGTG ATCAAGACGA
TGGGATCGGG CGACGACGCG TTGCCGGGCG TCGCGCTCAA CCGATCGATG TTCGAGAGCG
CGCTTCGCGA ACTTTTACTC GATGGTGACG GCGCGCGCGT GGAGTTTTAC GAAGAGAGTA
AACCGAGCGG GACGTGGACG TGCGTGAAGT CGGCGTCGCC GGGGAAATTG CAAGCGTTCG
AGGATGAGCT GTTTCGGTCG AACGAGATGT CGGACGCGTC TGTGGTGTGC GCGGTGCGCG
TGGCGAACGG GAACGTGGGC GTGGCGTACG CCAACACGAC GACTCGGGAG CTCGGGGCGT
GCGCGTTCGT GGATGATGAG CAGTATTGCA CGCTGGAAAG CGTGTTATGT CAGATTGGAG
TCAAAGAGTG CGTCGTGCCC AAGGAAGGGA CCGAGACGCC GGAGGGGCGT CGATTGCGAG
ACGTCGTCTC GCGATGCGGT GCGCTCGCCA CGGAGCGTCA GGCCCGCGAT TTCGACGCGC
AAGACTTGGA GAATGATTTG GGGCGACTCG TGCGAGGAAA CGTTGAGGCG CATCGGGCGG
TGATTGATCA ATCGCACGCC GCAGCGTGTC TCGCGGCGGT TTTGCGATTC AGCGAAATGT
TGGCGGATAG CGCCAACCAC GGTCGATGCA CGCTGAGTAT GTACGACACC GGTCGGTACA
TGCGACTCGA TGCTTCGGCA CTGCGCGCTC TGAATGTTCT CCCTGAGCGC TCTGACGGCC
CGAGCAGCTT CAGTTTGTAC GGTTTATTGA ACAAGTGCCG AACGCCGATG GGGCGCCGTT
TGCTCTCGCG ATGGCTGAAG CAGCCGCTCG TGGACGTTAA CGAAATCGCC ACGCGACACG
ATGTCGTGAA CGAATTCGTC ACCAATGCAG AGGTTCGTGA CGCCCTGCGC GGTGCGCACT
TGCGAGCGTT GCCGGACATC GAACGGATCA CTCGTAAGCT CGAACGCCGC AAGGCGTCGC
TCATGGATTT GTGCCGATTG TACCAAGCGA GCGCGGCGCT TCCACACATG GCCGAGGCGC
TCGAACGGTG CGAGGGACGT CATGGCGATT ACATTCGCAA GAAGTATGCG GAAGAACTCA
AAAAGCTGAG CGCCCCGAGT CATCTCGGCC GGTTCGAGGC GCTCTTAGAA GCGGCAGTGG
ATCTGAGCAA GATTCCCGAC GAGTACGTCA TTTGCGCTTC GTACGACGCC GAGCTGGGCG
AGTTGCAAAA ACAGAAGGAT ACACTGGAGA AGCAAATCCG TGATGCGTTT GCGGATGCGA
GCGATGATTT GGGCATGGAA CGCGATAAGC AATTAAAGTT AGAACACAAC AACATGCATG
GTTGGTTCAT GCGATTGACG AAGAAAGACG AAACGAGCGT GCGAAAAAAG CTCAGCGTGA
GTTATCAGAT TCTGGAAGCG AAGAAGGATG GCACGAAGTT TACGAACAAG AAGATTCGTG
GTCTTTCGGA GCAGCGTGTA TCTCTCGACA GAAGCTACGA CGCGAAGCAG CGACACTTGG
TCGATCGCGT CGTCGACGTC GCGGCGACGT TTTCCGAAAT CTTTCTGAGC GTTTCGGCGA
TGACGGCGGA AATCGACGTC CTCGCGTCTT TCGCTGAGGT CGCCGTCAGC GCGCCGGTGC
CGTTCGTGCG CCCGATTATG CACGAGAAGA CGTCTGATAC GATCCACCTG GAGAACTCGC
GCCATCCAAA CGTCGAAGCG CAGGACAACG TGCGATTCAT CGCCAACACG TGCTCGATGA
AGAAGGGCGA GTCTTGGTTT CAAATTATCA CTGGCCCAAA CATGGGCGGT AAGAGTACAT
TTATTCGCCA AGTTGGCGTG TGCGTGCTCT TAGCGCAGGT TGGAAGCTTC GTGCCGTGCG
ACGACGCCGT GATTGCTGTA CGAGATGCCA TCTTCGCGCG CGTCGGCGCC GGCGACTGTC
AACTCCGCGG CATTTCCACC TTCATGGCGG AGATGTTAGA AACCGCCGCC ATCTTGAAGG
CCGCGACTTC ATCGAGTCTC GTTATCATCG ATGAACTCGG CCGCGGAACG AGTACATACG
ACGGATTCGG TCTCGCGTGG GCGATAAGCG AGCACATCGT CAATGAGATT CAAGCGCCGT
GTCTGTTTGC CACTCACTTT CACGAGCTCA CCGCGCTCGA AGGCCCGAGT GGCGTCTCCA
ACTTTCACGT CGAGGCGCTC ATCGACCAGG AGAGCCGAAA GCTCACCATG TTGTACCAAA
TCAAGCCGGG GGCGTGCGAT CAGTCGTTCG GGATTCACTG CGCCGAGTTC GCGCGGTTCC
CCGAAGAAGT GCTCAAAATC GCTCGCGCAA AGGCGGACGA GCTCGAAGAC TTTTCCAAAT
CCGGCGCCGA GCGCGCCGTC GCCGACATCT CCGACCCCAA GCGCCAACGA ACCGATGAAC
CCGGTGTATC CGACGACATG GCCCGCGGCG TCGTCCGCGC TCGCCAATTC CTCTCCGACT
TCGCCGCCGT CCCTCTCGAC CGCATGACGC CCGCCGAAGC CGTCGCGCGC GCTCGACAAC
TCAAATCCGA GCTCGAGACC GACGCCAAGC ACTCCCCTTG GCTCCTCGAC GTCCTCTCCA
ACGCCGCGTG ATCACTCAGC CGTCCGCGAT CCGCGCGCGC GTCGCGTTCG ACCGCGTCGC
GCCCAG
 
Protein sequence
MSNEPDARAA TATGALALDD AASAKFVRFY RGLPSETARV VRFFDRKDCI SAHGDDAMYI 
ARAFYKTTSV IKTMGSGDDA LPGVALNRSM FESALRELLL DGDGARVEFY EESKPSGTWT
CVKSASPGKL QAFEDELFRS NEMSDASVVC AVRVANGNVG VAYANTTTRE LGACAFVDDE
QYCTLESVLC QIGVKECVVP KEGTETPEGR RLRDVVSRCG ALATERQARD FDAQDLENDL
GRLVRGNVEA HRAVIDQSHA AACLAAVLRF SEMLADSANH GRCTLSMYDT GRYMRLDASA
LRALNVLPER SDGPSSFSLY GLLNKCRTPM GRRLLSRWLK QPLVDVNEIA TRHDVVNEFV
TNAEVRDALR GAHLRALPDI ERITRKLERR KASLMDLCRL YQASAALPHM AEALERCEGR
HGDYIRKKYA EELKKLSAPS HLGRFEALLE AAVDLSKIPD EYVICASYDA ELGELQKQKD
TLEKQIRDAF ADASDDLGME RDKQLKLEHN NMHGWFMRLT KKDETSVRKK LSVSYQILEA
KKDGTKFTNK KIRGLSEQRV SLDRSYDAKQ RHLVDRVVDV AATFSEIFLS VSAMTAEIDV
LASFAEVAVS APVPFVRPIM HEKTSDTIHL ENSRHPNVEA QDNVRFIANT CSMKKGESWF
QIITGPNMGG KSTFIRQVGV CVLLAQVGSF VPCDDAVIAV RDAIFARVGA GDCQLRGIST
FMAEMLETAA ILKAATSSSL VIIDELGRGT STYDGFGLAW AISEHIVNEI QAPCLFATHF
HELTALEGPS GVSNFHVEAL IDQESRKLTM LYQIKPGACD QSFGIHCAEF ARFPEEVLKI
ARAKADELED FSKSGAERAV ADISDPKRQR TDEPGVSDDM ARGVVRARQF LSDFAAVPLD
RMTPAEAVAR ARQLKSELET DAKHSPWLLD VLSNAA