Gene OSTLU_42373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42373 
Symbol 
ID5003467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp373113 
End bp374282 
Gene Length1170 bp 
Protein Length389 aa 
Translation table 
GC content56% 
IMG OID640418888 
Productpredicted protein 
Protein accessionXP_001419144 
Protein GI145349446 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.159065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCA AGGGCCTGAC GGCGCTGATG CGAGACAACG CCCCCGGGGC GATCAAGGAG 
CAAAAGTTCG AGTCCTACCT CGACCGGCGC GTCGCGATCG ACGCGTCGAT GCACATTTAT
CAATTCATGA TGGTGGTGGG GAGACAGGGC GAACAACAGC TGACGAATGA GGCGGGAGAG
GTGACGTCGC ACTTGCAGGG GATGTTGAAT CGAACGTGCC GAATGCTCGA GGCGGGAATA
AAGCCGATTT ACGTGTTCGA TGGGAAGCCG CCGGTGATGA AGGGAGGAGA GCTGGCGAAG
CGCAAGGACA AGCGAGAAGA GGCGGAGGCG GCGTTGAAGG CGGCGAGAGA GGCGGGAAAT
CAAGAAGAGG TGGAGAAACT GTCCAAAAGA ACGGTGCGAG TGAGCAAGCA ACACAGTCAA
GAGGTGATGA AACTCGCGTC GTTGCTCGGA GTGCCCGTGT TCGAGGCGCC GTGCGAAGCC
GAGGCGTCGT GCGCGGCGAT GTGCAAGGCG GGACTGGTGT GGGCGGTGGC GACGGAGGAT
ATGGATACAC TCACGTTCGC CGCGCCGCGG TTGGCAAGAA ATTTGATGGC ACCCAAGTCT
CAGGACAAGC CGGTGCTGGA GTTTGACTAC GACAAAGTTC TAGCCGGTCT CGGGCTCACG
CCCGAGCAAT TCATCGACAT GTGCATCTTG TGCGGGTGCG ACTATTGCGA CACCATTCGC
GGGATCGGTC CGAAGACGGC GTTGAAGCTT ATCAAAGAAC ACGGTTCCAT CGAAAAGATT
CTCGAAGAGA TCGACACTGA GAAGTATCCT CCGCCTCAGG ATTGGGATTT TGCCGGCGCT
CGTGAGTTGT TCAAAAATCC CGAAGTCATG GACACGACGG GCATCGCATT GAGTTGGAAG
GCGCCAAACG AGGAAGGATT GATTGACTTT TTGGTCAAGG AAAAGCAATT TAACGAGGAA
CGCGTGCGCG CCGTGTGCGC CAAAGTCAAG AAGGCGCGCC AAGGTAAAGC GTCGCAAAAC
CGCCTCGAGA GCTTCTTCGG CCCGCCGACC ATAATCTCCA GTACCATCGG CAAGCGCAAG
GTTGAAGAAA AGAAGGGTAA AAACGGCAAG GCTGGTCTCG CGAACAAAAA GTCTAAAGGC
GTCAGTGGCT TCAGACGATC GAAGAACTGA
 
Protein sequence
MGIKGLTALM RDNAPGAIKE QKFESYLDRR VAIDASMHIY QFMMVVGRQG EQQLTNEAGE 
VTSHLQGMLN RTCRMLEAGI KPIYVFDGKP PVMKGGELAK RKDKREEAEA ALKAAREAGN
QEEVEKLSKR TVRVSKQHSQ EVMKLASLLG VPVFEAPCEA EASCAAMCKA GLVWAVATED
MDTLTFAAPR LARNLMAPKS QDKPVLEFDY DKVLAGLGLT PEQFIDMCIL CGCDYCDTIR
GIGPKTALKL IKEHGSIEKI LEEIDTEKYP PPQDWDFAGA RELFKNPEVM DTTGIALSWK
APNEEGLIDF LVKEKQFNEE RVRAVCAKVK KARQGKASQN RLESFFGPPT IISSTIGKRK
VEEKKGKNGK AGLANKKSKG VSGFRRSKN