Gene OSTLU_28121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28121 
Symbol 
ID5006062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp261784 
End bp264638 
Gene Length2855 bp 
Protein Length897 aa 
Translation table 
GC content60% 
IMG OID640421483 
Productpredicted protein 
Protein accessionXP_001422022 
Protein GI145355547 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0542452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0190346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGGTGCAT GTTGACGCGG AGCGCGCGGT CGTTCGGCGC TTCGAGCGCG TCCGTCGTCG 
CGCGCGCGTG TCGAGCGATG GCGACGACCG CGCGCGACGT CGCGCGGGCG CGAGGCGCGA
CGACGCGCGA CGCGCGCGAG AGAAGTTGGC GAAGACGCGC GATGGGTTCA AATTTCAAAC
CGACGACGAC GCGAACGACG CGCGCGACGG CGCGACGCGC GACGCGAACG TACGCGACGG
CGACGGGAAG CGCGGAGGAG AAGATCATCG ACGTTGAACT GGCGAGCGAG GCGAAGACGT
CGTACCTGTC GTACGCGATG AGCGTGATCG TGGGGCGAGC GCTGCCGGAC GCGCGCGACG
GGCTGAAGCC GGTGCATCGA AGGATCCTGT ACGGCATGCA CGAGCTGGGA TTGCGGGCGG
ATAAACCGCA CCGAAAGTGC GCGAGAGTGG TGGGAGACGT GCTGGGGAAG TATCACCCGC
ACGGGGACGG ATCGGTGTAC GAGGCGCTGG TGCGGTTGGC GCAAGATTTT TCGATGTCGG
CGCCGTTGGT GGACGGACAC GGGAACTTTG GGTCGTTGGA CGACGATCCG CCGGCGGCGA
TGCGTTACAC GGAGTGCCGA TTGAATAAGT TGGCGGAGAA GGGGTTGTTG GCGGACATCG
GGAACGAGTG CGTGAATTTC ACGGAGACGT TTGACGGGAG TCAAACGGAG CCGGAGGTGC
TGCCGGCGCG GGTGCCGAAT CTTTTGATCA ACGGCTCGAG TGGGATCGCG GTGGCGGTGG
CGACGAACAT GGCGCCGCAT AATCTCGGTG AGTCTGTCGA TGCGCTGTGC GCGCTGGCGA
AAAACCCGGA TTGCTCGTTG GACGAACTCA TGGCGTTGCT GCCGGCGCCG GATTTCCCCA
CGGGCGGGGT GGTGACGAAT AAGAGCGGGA TGAAGGAAAT TTACGAAACC GGCAAGGGCG
GGGTGACGCT TCGCGGGCGG GCGACGATCG AGCGCGTGTC GGCGGCGCGC GGTTCGCTGG
ATAAGGACGC GGTGGTGATC AGCGAGATTC CTTACCAAAC CAACAAGGCG AGGTTGGTGG
AACAAATCGC CGACCACGTC AACGGGCGCA CCATCGACGG CATCAGTGAT ATTCGCGACG
AAAGCGATCG CGATGGCATG CGCGTCGTGA TCGAGATTAA GCGTGGATAT GATGCAGCGA
GCGTGCTGGA GGAGCTTTAC GCCAAGACGA AGCTCGAAGT GAAGTTTTTT GTGAACAACG
TCGCGCTCAT AGACAACAAG CCGACGGTGA TGCCTCTTCG CCAGATTCTC GACGAGTTCA
TCAAGTTTCG CGTCGATACG ATCGAGCGAC GGACGAGATT TATGCTCTCA AAGGCGCAAG
ATCGCAAGCA TCTCGTTGAA GGCTTCTCGA TCGTGCTCGC CGACGCGGAT GGAGTGGTGA
AGATTATCCG AAAATCGAAA GACGGCCCGT CGGCGTCGAA AAAGTTGCGC GAATCGCACG
GTTTGTCCGA CATCCAAGCC GACTCGATTC TCGCCATGCC GCTTCGTCGA TTAACCGGAC
TCGAGGCGGA TAAGTTAGAC GCCGAGCTCA AGGAGTTGAA CGAGCAGATC GCACACTTCC
AAGGTTTGTT GAGCAACAAG TCAAAGGTCA TCGACGTCCT CGTGCAAGAG GCGATGGAGG
CGAAAGAGGC ATTCGCGCGT CCTCGACGTA CGTCCGTCGA GCAAATCGAA TCTTTAAGCG
GCGTCGAAGA CGATTCGCCA CCAAAGGATA ATATTTTGAC CCTCTCTGAG CGTGGATACG
TCAAGCGCAT CTGCCCGAAG AACTTTGGCG CGCAGAATCG AGGCACTCGA GGGAAGCGCA
TGAGCAAGCT CCGCGCCAAC GATGAGCTTT CCAAAGCCAT GCACTGCAAA GACAGCGATC
AAATATTATT CTTCTCCGAT CGAGGGCGAG TCCAAAAGCT CAGTGCGAAG GCGATTCCGC
AATCCGAACT GAACACGATA GGAGTTCCAG CGACCAGTCT GTTGAACACG TTCGCAAAGC
GCAACCAAAA CGTCACCGCC ATGTTGTCGA CGAACATGAA ACAGGGTGAA GTAGCGGACG
ACCAAGTGGT GGTGATGTTA ACAAGCCAAG GTAAGGTATC CGTGGCGTCC GCGGCGTCCA
TGCTCGGGCA CAAGGGTAAG AAGGTGATCA CGCTCGACAA GGGCGATAGG TTGCAGCAAG
TGATGTTCGC GCGCACGTCC GACCATCTCT TCATCACAGG CACCGGGAAA GCTGGAAAAG
GGCTCATCCT TCACTGCCGT GTCGGAGACT TCCGAGTCGT GAAGTCGGCG TGCAGGCCAA
TCTCGGGCAT CAAAACCATG GGCGAGAAGA AGGTGGCAGA AGCCGTCGTC GAAGACGCGG
GCGACGACGA AGACGACGAC GAAGACGACG GCGAACTTTT GCCGCCGAAA ACTGTCGGTA
TGGCTATCGT CCCAGGTGAA CGCATGGTGT CCGCGAGCGA AGAATTCGGT CCGTTTATCT
TATTTACGAC GAAGAAAGGT AAAGGCAAAG TGGTCGCCGC GAACTCGTAC CGCCTGCTCG
GCCGCGGTCG CTCGGGCGTC ATGTGCATGA AATTCAAAAA GGGTGACGAC GACGCCCTGG
CCACCATCAC TCTCGTCGAC CGCATCGGCG ACGACGTCAC GGATGAAGTA TTGCTCTCCA
CCACGGGCGG AATCTCCAAC CGCATCGCCG TCAACGATTT ACCCAAGCGC TCGGATCCCT
TGGCTCTGGG CGCCGCCATC ATCAAGCTCG ACGCCACGGA CGCCCTGAAA TCCGCCAATT
TACTCCCGAG CGAAGTCGCG AGCGAGCTCG CGTGA
 
Protein sequence
MGSNFKPTTT RTTRATARRA TRTYATATGS AEEKIIDVEL ASEAKTSYLS YAMSVIVGRA 
LPDARDGLKP VHRRILYGMH ELGLRADKPH RKCARVVGDV LGKYHPHGDG SVYEALVRLA
QDFSMSAPLV DGHGNFGSLD DDPPAAMRYT ECRLNKLAEK GLLADIGNEC VNFTETFDGS
QTEPEVLPAR VPNLLINGSS GIAVAVATNM APHNLGESVD ALCALAKNPD CSLDELMALL
PAPDFPTGGV VTNKSGMKEI YETGKGGVTL RGRATIERVS AARGSLDKDA VVISEIPYQT
NKARLVEQIA DHVNGRTIDG ISDIRDESDR DGMRVVIEIK RGYDAASVLE ELYAKTKLEV
KFFVNNVALI DNKPTVMPLR QILDEFIKFR VDTIERRTRF MLSKAQDRKH LVEGFSIVLA
DADGVVKIIR KSKDGPSASK KLRESHGLSD IQADSILAMP LRRLTGLEAD KLDAELKELN
EQIAHFQGLL SNKSKVIDVL VQEAMEAKEA FARPRRTSVE QIESLSGVED DSPPKDNILT
LSERGYVKRI CPKNFGAQNR GTRGKRMSKL RANDELSKAM HCKDSDQILF FSDRGRVQKL
SAKAIPQSEL NTIGVPATSL LNTFAKRNQN VTAMLSTNMK QGEVADDQVV VMLTSQGKVS
VASAASMLGH KGKKVITLDK GDRLQQVMFA RTSDHLFITG TGKAGKGLIL HCRVGDFRVV
KSACRPISGI KTMGEKKVAE AVVEDAGDDE DDDEDDGELL PPKTVGMAIV PGERMVSASE
EFGPFILFTT KKGKGKVVAA NSYRLLGRGR SGVMCMKFKK GDDDALATIT LVDRIGDDVT
DEVLLSTTGG ISNRIAVNDL PKRSDPLALG AAIIKLDATD ALKSANLLPS EVASELA