Gene OSTLU_33842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33842 
Symbol 
ID5000903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp660702 
End bp662624 
Gene Length1923 bp 
Protein Length641 aa 
Translation table 
GC content56% 
IMG OID640416324 
Productpredicted protein 
Protein accessionXP_001417013 
Protein GI145345003 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGCC AGAGCGCGCG CGGAGCGATC GGCGCGCGCG TGTTGAACGT GGCTGAGAAA 
CCATCGGTGG CGAAAGAGAT CTCGCGCGTG CTGTCGAACG GACGCGCGAG CGCGCGCGAA
GGGACGTCGA GGTATAATAA AGTCTGGGAA TTCCCGTACG AAGTGCGCGG TCGGCGGGTG
ACGATGGTGT TCACGTCGGT GACGGGGCAC TTGAGTAATT TTGAATTCGC CGACGACAGG
CACAGACGGT GGAACGGCGT CGATCCGCGA GAGCTGCTCG TGAACGCGGC GGTGGCGAAG
CGGGTGCCGG AGGATAAGAG ACAGGTGGCT GATAATGTGA AACGAGAGGC GCGAGGGTGC
GATTCGGTGA TTTTATGGTT GGATTGCGAT CGCGAGGGAG AGAATATCGC GTTTGAGGTG
CTCGCGGCGT GTCGAGAAGC GAATCGGGGC ATCGCTGCGT TTCGAGCGCG GTTTTCGGCG
CTGAGTCGGG GTGATGCCGA CCGAGCGTTG ACGAATCTCG TGGAACCGAA CCAACACGAG
TCCAAGGCGG TAGATATGCG CATGGAGTTG GATTTGAGAC TCGGCGCGGC GTTTACGCGA
TTCAACACGT TGGCGTTGCA GCGCGCAGGC GTGGGGCTGC CGGTAGACGA TAAAGGCAAA
TCGATCGTGT CGTACGGGCC GTGTCAATTT CCCACGCTCG GTTTCATCGT GCAAAGGAAG
TGGGACATTG ATGCGCACGT GAGCGAAGAT TTCTGGGCCA TCAAGTGTTC GCATTCTCGC
GAAGGCACAA CGACGCAGTT TGAATGGAGC CGCGGAAGGT TGTTCGATCG TGCTTTCGCC
TCTGCGTTGC ATGACCTGTG CGTGCGGGCC AACTCTGCCA CTGTCATCGA CGTGGACGGG
CAAGAAAGCA AACGTTGGCC ACCGCATCCC TTGAACACCA TTGAGATGCA AAAGCGCTTG
AATAGAGTGT TGCGCATTTC CCCCGAGCAA ATTATGAAGA TTGCTGAGGA CTTGTACAAC
GATGGTTTCA TCTCTTATCC GCGCACGGAG ACAGACAAAT TTCCAAATGA TTTCGATTAC
GATGGAACGC TGCGCGAGAT GCATCAGCAC CCGCAGTTTG GTTTTTACGT CGAGCGATTG
ACGACAGGCG GGCAGTTTCG ACGACCTCCG GGCGGCACCA AGGATGACAA AGCGCATCCC
CCCATCTACC CAACCAAGCT CGCCACGGAT GCACAGTACG CTCAAATGCG CAACAAAAAC
CACAACGCCC CTAAAGTGTA CGAGTTCGTC TGCAGGCACT TTTTGGCCAC ATGTTCATAT
CCAGCTGTGG CGTTGAAGAC GCACGTGGAT GTCGACATCG CCGGCGAAAC CTTCCGTGCG
ACAGGCGTGA TGATTCGTGA ACGTAACTAT TTGGATATCT ACGGCCCTGG TCCTCCTGAA
GGTCCACGGT TAGCGCCGAC TTACGATAAC TGGGGCAATA GCACGCTGCC TGTGTACACC
CCAGGAGAAC AATTTGTCCC AACTCTGAAT TTACACGAAG GATCGACGAG ACCTCCAGAT
TATCTCAGCG AGGTGGATTT GCTTTCGCTC ATGGAGTCAC ACATGATCGG AACAGACGCT
ACGCAGGCGC AGCACATAGA AAAAGTAGTT GGCGAACGAG GATACGCTCG AAAAGTGGGC
GATAACAGAT TGATGCCGAC AGAGCTAGGG GAAGCGCTGG TTCTCGCCTA CGATCGAATG
GGAGTCGCGG ACATGTGGCT GCCGACGAAA CGAGCAAAGA TGGAAGCGGA TGTAGACGCT
GTCGCACACA ACCGGATGGA TCCCAACGCA GGTTTGCGAC TGCATTTACA AACTATGCTG
CAAGCTTACG ACCGTGTCGC GACTGATGAA AACATGTTGA CAAACACCGT CGGATCGTAC
ATG
 
Protein sequence
MQRQSARGAI GARVLNVAEK PSVAKEISRV LSNGRASARE GTSRYNKVWE FPYEVRGRRV 
TMVFTSVTGH LSNFEFADDR HRRWNGVDPR ELLVNAAVAK RVPEDKRQVA DNVKREARGC
DSVILWLDCD REGENIAFEV LAACREANRG IAAFRARFSA LSRGDADRAL TNLVEPNQHE
SKAVDMRMEL DLRLGAAFTR FNTLALQRAG VGLPVDDKGK SIVSYGPCQF PTLGFIVQRK
WDIDAHVSED FWAIKCSHSR EGTTTQFEWS RGRLFDRAFA SALHDLCVRA NSATVIDVDG
QESKRWPPHP LNTIEMQKRL NRVLRISPEQ IMKIAEDLYN DGFISYPRTE TDKFPNDFDY
DGTLREMHQH PQFGFYVERL TTGGQFRRPP GGTKDDKAHP PIYPTKLATD AQYAQMRNKN
HNAPKVYEFV CRHFLATCSY PAVALKTHVD VDIAGETFRA TGVMIRERNY LDIYGPGPPE
GPRLAPTYDN WGNSTLPVYT PGEQFVPTLN LHEGSTRPPD YLSEVDLLSL MESHMIGTDA
TQAQHIEKVV GERGYARKVG DNRLMPTELG EALVLAYDRM GVADMWLPTK RAKMEADVDA
VAHNRMDPNA GLRLHLQTML QAYDRVATDE NMLTNTVGSY M