Gene OSTLU_25194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25194 
Symbol 
ID5004282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp69442 
End bp73251 
Gene Length3810 bp 
Protein Length1266 aa 
Translation table 
GC content60% 
IMG OID640419703 
Productpredicted protein 
Protein accessionXP_001420405 
Protein GI145352119 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.551119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CGCAGGGCGC GAGGAAACTG TCGTCATCGA TCGCTAACAA TCATTCTGGC 
GGGAAAACTG CCATCGCGCG TGCGGCGCGC GCGCGCGCGT TGACGACGCC CGCGCGAGGC
GGACGCGGGC GTGTGATCGA TCCGCGCGTC GTGCGGGCGG CGCCTGCGCG CGCGGTGGCG
GACGTCGGCG GCGAGGGTGG GGGCGCGACG ATTCGCGCGA GCGAGCTGTC GCCGGCGCAC
TCGAGCGTCG AGCGATGGAT CGTGTTCAGT GACTTGCACG TCAGTAAACG CACCATGGAA
ACGTGTCGAC GGGTGCTCGA GCGCGTGCAC GCAGAGGCGC TGAAACGCGA GGCGGGAGTG
GTATTTTTGG GAGACTTTTG GCACGCGCGC GGAGCGATTC CAGTCGAACC GCTCAACGAG
GCGCTGGCGT TGATGTCGAG CGCGAAGTGG ACGGCGCCGA CGATCATGAT TCCGGGGAAC
CACGATCAAG TCACCGCCGG AGGGTTATCT CACGCGCTCA CGCCGCTGGC GAAGGCTAAT
CCGAACATCG TCGTCTTTGA CGGGCCGACG CTTTACGGCG GCGCGTTGTG GTTGCCGTAT
CGCCGAAACT CGGATGAACT CAAGCGCGCG ATCGAGGACA CGCGAGGCGA GTTCAACGCG
ATATTTTGTC ACGCCGATGT CGTCGGAGCT AGCATGAACG AGACGTTCCA AGCGCGAGAT
GGATTAGACC CCGCGCTCTT CGGCGGCGCG AACACGTACA CGGGACACTA TCACAAACCG
CACGTCGTTC CGAACACAAA CATTACGTAC GTCGGGTCGC CATACGAAGT CTCGCGGTCT
GAAGCTGGAC AAAAGAAAGA GTTCATCGTT CTTGACTCGC AGACGTGGGT CGAAGGCGCG
AATGCGCGCG TGAGTTTGGA CATCGGGCCG AAACATTTCG CCGTCGAGGG CGTCGATGCG
AGCGCGCCGC CGACGGCGCG TCCGGGAGAC ATCATCCGCT GGACGCTTCC GATCGAAGCC
ATGGATGCGG TTTCCGACGC CGTACCGTCA GTGGTGCAGA AAGCTCGGGA CCAGGGGTTC
ATCGTAGAGG TGTGCTACGT CACCAAGGAT CTCACGGCGC GCATACCAAA GGCGGAGGAG
CTCGGGCCCG CGGGACTGTT CGATGCATAC GCCGTCGCCA GCGAGATGGC ACCGAGCGTC
AGCGCTTTCG GGCGCAAAGT TTTACAAGAA GTCGCCGCGT CTGATGATGC GCCCGAAACG
CGCCGACAAT CGAAAGGTGT GAGCGTTTCT TTTGAGACGG TGGAAGTAGA GGGATTTGGA
ACGTTTCAGT CGGCGACGCG TTACCCGCTC GGCGCCAGAG GAGTGTGCGT GGTCGTGGGA
GAGAACAAAT CCGACACGTG TTCGGATTCT AATGGTGCGG GGAAGACGAC GTTAGTGATG
TCCCCGATGT GGGCGCTTAC GGGACAGAGT GATTTGCGAA TCGACGGCGC GGGATCGGGC
AAGTCGCTCA CAAAGTCGGA CGTCGTCAAC GATTCGTCGA AATTCGGCCG CGTTCGCTTA
GAAGGGTTCT TGAACGGTGG CACGCCGTTT TGGGTGGAAC GTAAAGTGAA CAGAACGAAA
TTAGTGAGCT TAAAGTACGC CATTGATGGC GAAGAAAAGA CGATGGCTGA GTCAAAACTG
ACGCAGCAAG GCTTGAACGA TGATCTCGGC GCCGACGTCA TCGCGAACAC GACGTTTCAC
GGTCAGCACA CTGTAGGAGC TTTGCTCGAC GCGAACGACG CTTCGTTGAA AGCGGCGCTG
GGTAAACTCG TCGAGGCGGA CACGTGGACG AAGGCGAAGG ACATTTCTCG CAAGCGAGTT
ACCGAGGCGA GAGGAAACGT CAACGCCATC GCCGCCGAAG TCAAGGCGCG GGAAGAGTAC
ATCGCTCGAA CGCGCCTTCG ACGCGATCAA GCGCACGTGG AGAGTGAGAA GTGGGAATTC
GAGTACAGGC GCCGCGTGAG CGAGCTTGAG ACGAGCTCGA GCGCGGTTTC AACAACCTTC
ACGAAGTATT TAGCGAGGAC AAATCATTTC CTCCAACGAT TGAGTCGCGC GAGCGAGGCA
CTTGAAGTGA CGTCGAGTCA CGCCGAACGC GTGCTAGACT CGTCACGAAA TGACTCTGAT
GATGCGGCAC GTCGCTTTGA AGTGAAAGAA TCCGAGTTCG AGCACAAAAC GTCCGTCATC
GAAGGCGAAA TCGAACGACT TAACGGTGTC GTACGCGAGT GGCAATCCAA GGAGGCGAGC
GCGGGCGCCA TCGGACGTCA AAGTCACGCC GCAGTCGGTA TGTTCGCGGG CGCTGGAGGC
ACACACTCGC ATCCAGATGG TGTCGGTACG TGCGATCGTT GCTTACAAGC CATCGACCCG
ACGCATCACA AAGAAACGCT CATAAAATTG AAGGATGAAG CGCGAAAGAC GGCGATTGAG
CACGGGAACG CCATCGAGCA ACTCAAGCGC GCCACGGCGG CTCTGAACAA GGCAATGGAT
GATAGACGCA ATCTCACCGA AGCAGCCGCG CTTGCACGCA AAGCCGAACG CATGCGCGCG
GAAACCACCG CGACAGCTTC CGCAAACGCG ACAGATCAGT TGCGACATTC TCAGAGGTCG
TTGAGCCTCA TCGGCAATGC GGTGGCGAGG GCGGAAATGT TGTTAAACTC AGCGCCCGAG
GACGTCGTCC ACGAAGCGCT GACAATGTCA ATCGATGATG CATTTGCGAT GAGCGAGTCG
ACGATCAGTA TATCCGGCGC GTCCTCGAAC GTGCTTGCGG GCCTCGATCC CGACTTGACG
CGAGGATCGA TGTCGTCTGC GTCTTTGTCG AACGCCTCTC CGCAAATCCA GCTCCTTCGC
GACGAACAAA TGAACGTCGC CGACGCGGAT TTCGTTCGAC AGTTGGTCAA AGGTGCAGAA
GACGCCATCA TGGATGGCGA ACGCGCCGGG CGTGACGCGG CACGACGCTT GCAAGAGCTG
AATGAGTTCT CGCGTTCGGC GCGTTCAAAT CCACACACGA GCGCGCTCGA AGAGCTCGAC
GCGCAACTCG CGGGTGAAAG CGAATCTCTC GGCGCGAGAA TCGAATCCTT GAATGGCGCA
AAAGAGCTCC TTGGCGTCGC TCAAGCCGCA GACACAGCGT TTAGCACGAA GGGCATTCAG
AGTTATCTGT TTGAAGGCGC GCTCGGAGAT TTGAGCGCGC GCGTGGGTCA ATACATGGAC
GCCCTCACCG GCGGTGCCTT GACGCTCGAG CTTCGTCCCG CGGGAGCGGC GTTCACCGGG
GACGATAACG ACGTCGTGCA GAGCGACGTC GACGCGGACG AGACGGCGAC GAGAAAGACG
AAGAGCAAAT CGGCCAAGGC GCCCGCGAGC GCGAGCGCCG CCGAACGCAT CGAGCGCGTG
ATCCACGCCC GTCGCCCCGA CGGCAGTTTG ATCGCCCGTT CGCTTCGACA ATTATCTGGC
GGAGAACGCC GACGCGCCGC TTTGGCGCTC GCTTTGGCGT ACGCCGATCT CGCCTCGGAG
CGATGCGGCG TCGCGTGCGA CGCCCTCGTC CTCGACGAAG TTCTCCAGCA CCTCGACGCC
GAGGGCATCG CGCGAGTCAC ATCCCTCCTC CGCGCGCTCC CTAAGCGTAC GGTATTACTC
ACGAGTCAAG CCGACAGCGC GACCGCGCAC TTATTCGACG TCGTTGACAA GGTTGTCAAA
TCCGACTTCG GCTCGGGCGT CGTCGTCAGC GCCGGCGACG ACGCCGATCT CGTCGACCTC
GCCGCGCGCG CCTCCGCCTA AGAATTACTA
 
Protein sequence
MTTTQGARKL SSSIANNHSG GKTAIARAAR ARALTTPARG GRGRVIDPRV VRAAPARAVA 
DVGGEGGGAT IRASELSPAH SSVERWIVFS DLHVSKRTME TCRRVLERVH AEALKREAGV
VFLGDFWHAR GAIPVEPLNE ALALMSSAKW TAPTIMIPGN HDQVTAGGLS HALTPLAKAN
PNIVVFDGPT LYGGALWLPY RRNSDELKRA IEDTRGEFNA IFCHADVVGA SMNETFQARD
GLDPALFGGA NTYTGHYHKP HVVPNTNITY VGSPYEVSRS EAGQKKEFIV LDSQTWVEGA
NARVSLDIGP KHFAVEGVDA SAPPTARPGD IIRWTLPIEA MDAVSDAVPS VVQKARDQGF
IVEVCYVTKD LTARIPKAEE LGPAGLFDAY AVASEMAPSV SAFGRKVLQE VAASDDAPET
RRQSKGVSVS FETVEVEGFG TFQSATRYPL GARGVCVVVG ENKSDTCSDS NGAGKTTLVM
SPMWALTGQS DLRIDGAGSG KSLTKSDVVN DSSKFGRVRL EGFLNGGTPF WVERKVNRTK
LVSLKYAIDG EEKTMAESKL TQQGLNDDLG ADVIANTTFH GQHTVGALLD ANDASLKAAL
GKLVEADTWT KAKDISRKRV TEARGNVNAI AAEVKAREEY IARTRLRRDQ AHVESEKWEF
EYRRRVSELE TSSSAVSTTF TKYLARTNHF LQRLSRASEA LEVTSSHAER VLDSSRNDSD
DAARRFEVKE SEFEHKTSVI EGEIERLNGV VREWQSKEAS AGAIGRQSHA AVGMFAGAGG
THSHPDGVGT CDRCLQAIDP THHKETLIKL KDEARKTAIE HGNAIEQLKR ATAALNKAMD
DRRNLTEAAA LARKAERMRA ETTATASANA TDQLRHSQRS LSLIGNAVAR AEMLLNSAPE
DVVHEALTMS IDDAFAMSES TISISGASSN VLAGLDPDLT RGSMSSASLS NASPQIQLLR
DEQMNVADAD FVRQLVKGAE DAIMDGERAG RDAARRLQEL NEFSRSARSN PHTSALEELD
AQLAGESESL GARIESLNGA KELLGVAQAA DTAFSTKGIQ SYLFEGALGD LSARVGQYMD
ALTGGALTLE LRPAGAAFTG DDNDVVQSDV DADETATRKT KSKSAKAPAS ASAAERIERV
IHARRPDGSL IARSLRQLSG GERRRAALAL ALAYADLASE RCGVACDALV LDEVLQHLDA
EGIARVTSLL RALPKRTVLL TSQADSATAH LFDVVDKVVK SDFGSGVVVS AGDDADLVDL
AARASA