Gene OSTLU_35638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35638 
Symbol 
ID5002876 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp542907 
End bp544595 
Gene Length1689 bp 
Protein Length551 aa 
Translation table 
GC content54% 
IMG OID640418297 
Productpredicted protein 
Protein accessionXP_001418738 
Protein GI145348608 
COG category[L] Replication, recombination and repair 
COG ID[COG1111] ERCC4-like helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0251063 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.525211 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCGG AAGCGATTCA AACGTACGTC TATCCGGCGC AGATTTCGCG GCGTGATTAT 
CAGTTTGACA TGGCGCGTAA CGCCCTGCTG ACGAACTCAC TGGTATGCCT GCCCACGGGT
TTGGGCAAGA CGCTCATCGC GGCGGTGGTG ATGTACAACT ATTACCGATG GTTTCCCACG
GGGAAGATTA TTTTTATGGC GCCCACGCGA CCTTTAGTGG ATCAACAGAT GTCAGCGTGT
CACACCGTGG TGGGGATTCC CGCCTCTGAC ACCATCGTAT TGATGGGAAG CACGAAGAAG
GACGACAGCG GATCGAGACG AGACTTTTGG CGAGAGAAAC GTCTCTTCTT TTGCACCCCG
CACACGGTAG CGAACGATTT GGAAAGCGGC GATTTGGACG CCAAACAAAT CGTGTGCGTC
GTCGTCGATG AAGCGCACCA CGCGCGAGGA CAGTACGCTT CCGCGGAAGT GCTTCGTTTA
CTTCACGAAC GAAAAGTGAG ATTTCGTTTA TTGGCGCTCA CGGCGACGCC GGGGCAAGGA
TTGGACGAAG TTCAAAAAGT CGTCGAGACG CTGCGCATCG GACGTATCGA TTTCAGAAGC
GACCAAGACC CCGATGTTTC GCGTTACACG CACAAGCGCG AGATGACTGT GGAAAAGGTC
AAGCCCGATC AGGCGATGTC TCACGTGCAG GACATGCTGT GCGAGCTACT GCGCCCGTGC
TGCGCGCAAC TCGTCAACAT GGGCGCGCTC GGCGAAGCGG GCTTTCGAAT GTTGACGTTT
ATTAAAAATA AAGCCACAAA CGGGGCATCG CGCGTCGAAC CACCGGCGTG GTTCACGCTG
CAATCCGCAC AGCAGGCGAT ATATAGAAAT CAGCACCAAG TGCGCGCCAG AGGGCAAGCG
TTAGGCTTGC TCGAGACCGC GATGGAACTC AGTAAGGCGT ACGAGCTTTT GCTCAAGTAC
GGCGCCAAGA GCGCGTACGA TTACATCGAT AAGCGCGGTC GAGACAAGAG CAACACGCTC
GTTCATCGCA GCGACCCAGT ATCAGTAGAG TTGGTGGATT TGATTCGTTC GATGTCGTCA
AACGGCGCAC ATCACTCGCC AAAACTCGAC AGATTGACGT CGATTCTAAA GCAACACTTC
AGAGACGCTA CGGCGGATAC GCGAGTGATC ATTTTCACAT CGTACCGTGA AAGCGTCAAG
GACATCGTTC AAGCCCTTCG CGAGGTCCCC GCTGGCGAGG ACACCGCGTG CAAGATTAAA
GTTGCAGAGT TCGTCGGTCA GGGCGATACG GGCGCGACGG GTAAGAAACG GGCGCCTGGT
GCTACCTCGC GCGGTACTAA GGGACAGACG CAAAAAGAAC AAAAGCAGAC GCTGGACGAT
TTCAGAGCCG GAACGCTGAA CACACTCGTG GCGACATCCA TCGGAGAAGA GGGATTGGAC
ATTCCGAGCG TAGATTTGAT TTTTTTCTTT GACGTCGTTG ATACCATTCG TGCCATTCAA
CGAATGGGTC GAACTGGTCG AGCGCGCGAT GGCAAAGTCG TCATTCTCGC GACGGAGGGT
AAAGAATACG CCAAGTTTAC GAGCGAGCAG AAAAAGTACG AAACTTTGAT GACGTGCCTA
CGAATGCCAG AAACGCACTT TCGGCTCGAC AAAAAGTGTC CTCGCATCGT TCCAGATGGC
GTGACACCG
 
Protein sequence
MDPEAIQTYV YPAQISRRDY QFDMARNALL TNSLVCLPTG LGKTLIAAVV MYNYYRWFPT 
GKIIFMAPTR PLVDQQMSAC HTVVGIPASD TIVLMGSTKK DDSGSRRDFW REKRLFFCTP
HTVANDLESG DLDAKQIVCV VVDEAHHARG QYASAEVLRL LHERKVRFRL LALTATPGQG
LDEVQKVVET LRIGRIDFRS DQDPDVSRYT HKREMTVEKV KPDQAMSHVQ DMLCELLRPC
CAQLVNMGAL GEAGFRMLTF IKNKATNGAS RVEPPAWFTL QSAQQAIYRN QHQTAMELSK
AYELLLKYGA KSAYDYIDKR GRDKSNTLVH RSDPVSVELV DLIRSMSSNG AHHSPKLDRL
TSILKQHFRD ATADTRVIIF TSYRESVKDI VQALREVPAG EDTACKIKVA EFVGQGDTGA
TGKKRAPGAT SRGTKGQTQK EQKQTLDDFR AGTLNTLVAT SIGEEGLDIP SVDLIFFFDV
VDTIRAIQRM GRTGRARDGK VVILATEGKE YAKFTSEQKK YETLMTCLRM PETHFRLDKK
CPRIVPDGVT P