Gene OSTLU_33043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33043 
Symbol 
ID5003083 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp299180 
End bp300753 
Gene Length1574 bp 
Protein Length484 aa 
Translation table 
GC content61% 
IMG OID640418504 
Productpredicted protein 
Protein accessionXP_001419337 
Protein GI145349845 
COG category[L] Replication, recombination and repair 
COG ID[COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.804843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.549748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACGA GCGCCGAGGT GCGGCGCGCG CGCGCGTCGC GTCGTTCGTC GCGCGTGGAA 
GATGGCGCGC GTCGTCGCGA GGCGCGCTGT GGACTGACGC GGTCACGTCT CGAACAGGAA
TTCGCCGCGC TCGTGCGCGT GGTCAAGGAT GCGCAGCGTC GATTCACCGC CGCGCCGAGC
GAACACCCGG GGTGCGACTT CAAGGCGTGG CTCAAGGCGA ACGTGAAGCG ACCGGAGAAC
CAACGAGGCG ATCCGAGCCG GCACACGGCG GAGACGCTGA GAAAGTTTGT GGAGACGCTC
GGGACGGCGC TCGGGACGGC GCGCGTGACG GCGCTGACGA AACACGATCA ATGGCGGAAG
AAACGCGCGG AGTGGGACGA GTGGGGCGAC GACGAGGTCG TCGATGCGAA GGGTGATCGA
GCGTGGGCGC TGGTGCGACG AACGCGAAAG CATGAGGGGT TCGCGGAGGC GTATTATCGG
TTGCCGTCGC ACGAACCGCC GTGGCGACGG ACAAAGTATC GACCCGTGCG TTGGTGCGTG
GAGAAAGAGA TGAAGCCGAG GTTGTTGGGG GTGGATTGTG AGATGTGTGA GACGGATGAC
GACACGCGAG CGCTCGTCGG GGTGTCCGTG GTGGATGATG AGGGAAATAT TTTGTTGAAG
ACGCTCGTGA AGCCGCCGGG GAACATCGTC GACATGAGAA CGGAGATTAC TGGGTTGAAG
GCGGAGAACG TCCTCGCGGC GCCGACGACG TTGAGCGACG TGCAGGATAG ACTCGTGGAG
TTGTGTAAAC CGGGAACTGT GCTCGTGGGT CATTCGTTGA TGCATGATTT GAAGTCTTTA
AAGATTGACC ATCAACCCGT CATCGACACT GGAATGTTGT TTCGTTACAA GAATCTCCCT
CGGTCGACGC CGAGCTTGGC GATTTTGTGC GAAACTTTGC TCAAACGAAA GATGAGACAA
ACTGAGGCGG GCTATCACGA TTCCGTCGAG GACGCCAAAG CGGCATTAGA CTTAGTGTTA
TGGGCGGTTC GCGAGGCGAA ACCCATCTTT GAGGTGGACG CGCCGCCGCA CAAAGTGGAC
GCCGAGGACC TGTGCAAGCT TTTTATCCAT CGCATTCCGC GTGGGACGAG CGCGGAGGCG
TTGAAGATGG TGTTCGAAGA AACCGATCGA GCGCACATCG AGAGCGTGCA GGGAAGCTTT
CTCGATGCGA CGACGACGGA CTCCGCGGCT CTCGGCGGCA AAAAAACCAC CTCCTGCCTC
GTCACGTTTA CCGACACCAA ACGCGCCAAC GACGCGTTCG AACGTCTCGA CGGTGCGGTA
ACGAAAGATG CAATCGGTCG CGCGCAGAAG TCTCGCGCCT TACCGCTCGA TTCCACCGAT
CGATCCGTCA GCGTCGTCGT CCGTCGGATG ACGTCGAGCG GTAGCGTCGT CTCCGCCGGC
GCCGCCGCAG GCGCCAAGCG CCCGGCGAAC AGCGTCCCGG CGACGGTTCC AAAGTCGAAA
AAAGTTCGTC AGCGCAAGCC TAAATCGATC GCTCTTCCCG GCGATAAATC GTAGCCCGGA
TTCGCGCGCG CGCG
 
Protein sequence
MPTSAEEFAA LVRVVKDAQR RFTAAPSEHP GCDFKAWLKA NVKRPENQRG DPSRHTAETL 
RKFVETLGTA LGTARVTALT KHDQWRKKRA EWDEWGDDEV VDAKGDRAWA LVRRTRKHEG
FAEAYYRLPS HEPPWRRTKY RPVRWCVEKE MKPRLLGVDC EMCETDDDTR ALVGVSVVDD
EGNILLKTLV KPPGNIVDMR TEITGLKAEN VLAAPTTLSD VQDRLVELCK PGTVLVGHSL
MHDLKSLKID HQPVIDTGML FRYKNLPRST PSLAILCETL LKRKMRQTEA GYHDSVEDAK
AALDLVLWAV REAKPIFEVD APPHKVDAED LCKLFIHRIP RGTSAEALKM VFEETDRAHI
ESVQGSFLDA TTTDSAALGG KKTTSCLVTF TDTKRANDAF ERLDGAVTKD AIGRAQKSRA
LPLDSTDRSV SVVVRRMTSS GSVVSAGAAA GAKRPANSVP ATVPKSKKVR QRKPKSIALP
GDKS