Gene OSTLU_41359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41359 
Symbol 
ID5002409 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp361938 
End bp364538 
Gene Length2601 bp 
Protein Length840 aa 
Translation table 
GC content59% 
IMG OID640417830 
Productpredicted protein 
Protein accessionXP_001418460 
Protein GI145348029 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0907147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGACGG CGCGCGCGCG CGCGGGCGGC GTGCGACGGC TCGTCGCGAG CGTCGGCGCG 
CGCGCGACGC GCGGGATCGC GTGGGGAGGA CGCGAGGGTG GATGGGGCGA GACGACGAAG
ACGGCGGCGA CGGCGCGCGC GGTCGCGAGC GGCGCGACGA CGACGACGAC GACGACGACG
ACGACGACGA CGAACGGGAA ACGGACGCTC ATGATCGTCG AGTCTCCGGC CAAGGCGAAG
ACGATCGAGA AGTATTTGGG TGCGAACGCG AAAGTGCTGG CGAGCTACGG GCACGTGCGT
GATCTGGTGA ATAAACAGGG AAGCGTGCGG CCGGAGGAAT CGTTCGCGAT GACGTGGACG
TCGACGTCGA GGCAAAAGGC GGCGATGCGA GACATCACGG AGGCGTTGAA GAAGACGGAT
GTGCTGCTGC TCGCGACGGA TCCAGACCGA GAGGGTGAGG CTATCTCGTG GCACTTGCTC
GAGGTGCTTC GGGAGAAGAA ATTGTTGCGC GACGACTTGG ACGTCAAGCG CGTGACGTTT
GGGGAGATTA CGAAAACAGC GGTGCTCGAC GCGGTGGGTT CTCCGCGGGA CATTAACGTT
CCGATGGTTG ATGCGTACAT GGCGCGACGC GCGCTGGATT ATCTATTCGG ATTCACGCTC
TCTGGGCTAC TGTGGCGTAA GTTGCCGTCG AGCGTGAGCC TGTCCGCGGG TCGCGTGCAG
AGCGTGGCGC TGCGATTAGT GTGCGAAAGA GAGGAAGAGG TGGAGGCGTT CGTGAGCGAG
CCGTATTGGA CGGTGAAGGC GAGTTTAAGT TCAAAAGACG GCTCTACGTT TGACGCCGCT
TTGACGCACG TCGATGGCGC TAAGCTTGGA AAGTTTACCA TCGCTTCGAA CGACGAAGCC
GAGGCGTACG CCAAGCGCGT GCGAGAATGC GATACGCTGC GGGTGGTGGA GGTGAAGCAC
AGCGAAGCCA AGCGTACGAG TGGACCACCG TTCACGACGT CGACGATGCA GCAGGAAGCG
AACAAGCAGC TCGGTTTCGG TGCGTCGAGG ACGATGTCGG CGGCGCAACG ACTGTACGAG
GGCGCGGGCA CGGGCGAAGG TTTGATTACG TACATGCGCA CAGATGGCAC GTACGTCGCC
CCACATGCGA TTGAAAGCTT GCGTACGACC GCTGGTGAGC TTTTTGGTGA TGAGTACGTG
CCTGAATCGC CCAGATACTT TAAGAAGAAG CAAAAGAATG CGCAAGAGGC GCACGAAGCG
ATTCGACCTA CCAAGGCTGG ACGGCTTCCC GCGCAAGTTG CGCGGCAAAT AGGACCAGGC
AGCGATGAAG CGAGACTTTA TGCGTTGATT TGGGCGCGCA CTATGGCGTC ACAAATGTCA
CCTGCGCTTA CCGATCGCAT AGCGGCTGAA ATTGCCTCCG CAGAGAACGA CTTAAAGCTC
AAGGCGAACG GACACAGGCT CAAGTTTCCG GGATTCCTTG CGGCGTATCG AACGTCCAGA
CCCGAAGCTG CGCCACCTTC AGACGACGCG TGGTTACCTA TTCTTGGTGA AGGCGAAGGT
TTGTCTGTGA AAACTGACGG TGAGAGTCTC GGGTGCGAGG CGACAGAGCA CAAAACGTCG
CCGCCGCCTC GCTACACGGA TGGTTCCATC GTCAAGGCGC TCGAAGAACG AGGCATCGGG
CGACCGAGCA CGTACGCACC GATTTTAAAG GTGCTCGCGC AGCGTGAGTA CGTCGCAAAG
CAAGGTGCCG CACTCGTTCC CACGACTCGC GGTCGTCTAG TCTCTGCGTT TCTCACGAAT
TACTTTGAGA CGTACGTCGA TTACGGCTTC ACGGCGGATT TGGAACACAA GCTTGACGAC
ATCACGAGCG AACAAGTACA ATGGAAACCG TTACTCGAGG AGTGGTGGAC GCCGTTTAGG
GATAAAATAT CGTCGCTTTC GGAGCTGCGC GTGAGCGAAG TCATCGACGC TCTCGATGAG
AAGCTCGGGC AGCATCTCTT CGGCGAGGCG ACGTATGACG GCCACGTGCA CATCAACCCG
GATCAAGTCG CCGAATTAAG CGATTCCGAC GACGTCGTGA GCGAAGCACG GAGATGCCCG
AGCTGTAAAA TTGGCAGACT TGGTTTAAAG CCATCAAAAG CTGGTGGATT CATTGGGTGC
AGTCGCTACC CTGAATGCGG CTTCACGCAC CCGTTGCATC CGATTCGAGG CGCCATCGTG
AGCGACACGG ATGATCCAGA TTTCGTCGCC ACCGTCGAAG ATCCCGACGC AGTGATGTAT
CCCAAGGTGC TTGGCGTTGA TCCAGCGACC GGAAAAGAAA TTTCTCTCCG ACTTGGTCCT
TATGGCCCAT ATTTACAACT CGGCGTTCAA GAGCTCGCGG CGGCGACGCC GGCAGAGGAT
GGGAAGAAGG CAAAGAAACC GAAGAAACCG CCTGCACCTC GTCGCGTCGG AGTGGCAAAC
ATCGGCAAGG ACGTCAACAA GATCACTCTC GCGGAAGCCA TCGGGATGTT TGAATACCCC
AAAGTGCTTG GTGTGCATCC GGTGACGCAG GCGCCCGTTT CGCTCAACAT TGGCCCATTT
GGTTGGTACG TCGCGTCCGA A
 
Protein sequence
MWTARARAGG VRRLVASVGA RATRGIAWGG REGGWGETTK TAATARAVAS GATTTTTTTT 
TTTTNGKRTL MIVESPAKAK TIEKYLGANA KVLASYGHVR DLVNKQGSVR PEESFAMTWT
STSRQKAAMR DITEALKKTD VLLLATDPDR EGEAISWHLL EVLREKKLLR DDLDVKRVTF
GEITKTAVLD AVGSPRDINV PMVDAYMARR ALDYLFGFTL SGLLWRKLPS SVSLSAGRVQ
SVALRLVCER EEEVEAFVSE PYWTVKASLS SKDGSTFDAA LTHVDGAKLG KFTIASNDEA
EAYAKRVREC DTLRVVEVKH SEAKRTSGPP FTTSTMQQEA NKQLGFGASR TMSAAQRLYE
GAGTGEGLIT YMRTDGTYVA PHAIESLRTT AGELFGDEYV PESPRYFKKK QKNAQEAHEA
IRPTKAGRLP AQVARQIGPG SDEARLYALI WARTMASQMS PALTDRIAAE IASAENDLKL
KANGHRLKFP GFLAAYRTSR PEAAPPSDDA WLPILGEGEG LSVKTDGESL GCEATEHKTS
PPPRYTDGSI VKALEERGIG RPSTYAPILK VLAQREYVAK QGAALVPTTR GRLVSAFLTN
YFETYVDYGF TADLEHKLDD ITSEQVQWKP LLEEWWTPFR DKISSLSELR VSEVIDALDE
KLGQHLFGEA TDSDDVVSEA RRCPSCKIGR LGLKPSKAGG FIGCSRYPEC GFTHPLHPIR
GAIVSDTDDP DFVATVEDPD AVMYPKVLGV DPATGKEISL RLGPYGPYLQ LGDGKKAKKP
KKPPAPRRVG VANIGKDVNK ITLAEAIGMF EYPKVLGVHP VTQAPVSLNI GPFGWYVASE