Gene OSTLU_41525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41525 
Symbol 
ID5005146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp227129 
End bp228529 
Gene Length1401 bp 
Protein Length466 aa 
Translation table 
GC content56% 
IMG OID640420567 
Productpredicted protein 
Protein accessionXP_001421091 
Protein GI145353588 
COG category[L] Replication, recombination and repair 
COG ID[COG1112] Superfamily I DNA and RNA helicases and helicase subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0000217661 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0334974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGA AGGAAGTGCG AGAGCCGTTG AACCCAAGCC AGCGCGTCGC CGTGAAAAGC 
GCGCTAAGCT CTTCGCTCGC GGTTTGGCAA GGTCCGCCCG GTACGGGTAA GACTCGCACG
CTCATAGCGT ACATCGGTGC TGCCGTACAC CTGGCGTCCA TCCAAAAGAG GCGAGGAAGG
GGTCCGATCG TTCTCGCTTC CGCTGCGTCG AACGTAGCCG TGGATAATAT CCTCGAAGGA
CTGGCAAAAG AATCTTTCAT CGTCGATGGA CGACCGCTGC GAGTCGTGCG CGTGGGAGCG
CCCGCAAAGG TGCAACCTTG GCTTCAGCAA CTCACGCTGG ATGCTCAAAT CGCGTTGCAC
CCTCTCGGGC GTCAAGCGGC GGCCATGCGT GAAGCTATCC GAGGACAATC TGGTCCAGCG
TTTGCTCGCC AGCGCAAGCA AGCGACGCAG TTGGAACTGA CGGCTGCGAA GAGCATATTA
AAGTCTGTGG ATGTCGTGTG CACCACATGC GTCGGCGCAG GCGACGAGTT ACTGGAGGAC
TTCACGTTCC CAGTGGCTGT CGTGGATGAG GCGACACAAT GCACCGAACC AGGAGCGTTA
ATCTCTCTCA CGAAAGCCTT GAGCGCCGTG CTCGTGGGTG ATTCCAAGCA ATTGCCTCCC
ACGGTGGTGT CTCGTGACGC CGTCGACGCT GGCTTACAAG TTTCAATCTT TGAGCGCATG
GAGAGGCTCG GGGTGAAGGT GTCTTTGCTA GACATGCAGT ACCGCATGCA TCCGCAAATC
GCCGAATTTC CGTCTCTGGC GTTTTACAAA GGGAAAGTAG GATCGGTACC GACGCCGCAA
GATCGTCCGT TGGTGCCGGG TATCGCTTGG CCGTCGCCGA ACGTTCCAGT AGCCTTCGTA
GAAATCTCCG CCCCTGAATC GCGAGCACCC GATGGAAACA GTCTGTATAA CGTCGGAGAA
GCGAAGATGG CCATCGGTGT GGTGAGAAAA CTTCTCGCGG CGGGCGATTT AGCGGGACCC
GGGGACATCG GCGTCATCTC GCCGTACGCC GCGCAAGTTC GACGGTTGCA AGAAGAATAC
GGCGTAGGAG GAAGTCCGAA ACGAAATTAC TTGGACTACA CCGAAGAGGA TAAAATAGAG
GAGCTCGAAA TACGTTCCGT TGATGGATTC CAAGGCAGAG AAAAGGAAGT GATCGTTTTG
TGCACCGTGC GAAGCAACCC GTCTGGAGAC ATCGGCTTCG TCGCCGACCC GCGCCGGCTC
AACGTGGGAA TCACGCGAGC GAAACGTGGA TTGATCGTCC TCGGAAATCG CAAAACTTTG
TCAAACAATG AGATGTGGCG AAGTTGGTTT AAGTGGATCG ACGAACAAAA CTGTGCGGTT
TCCGACACTA CAAATTTCTA G
 
Protein sequence
MALKEVREPL NPSQRVAVKS ALSSSLAVWQ GPPGTGKTRT LIAYIGAAVH LASIQKRRGR 
GPIVLASAAS NVAVDNILEG LAKESFIVDG RPLRVVRVGA PAKVQPWLQQ LTLDAQIALH
PLGRQAAAMR EAIRGQSGPA FARQRKQATQ LELTAAKSIL KSVDVVCTTC VGAGDELLED
FTFPVAVVDE ATQCTEPGAL ISLTKALSAV LVGDSKQLPP TVVSRDAVDA GLQVSIFERM
ERLGVKVSLL DMQYRMHPQI AEFPSLAFYK GKVGSVPTPQ DRPLVPGIAW PSPNVPVAFV
EISAPESRAP DGNSLYNVGE AKMAIGVVRK LLAAGDLAGP GDIGVISPYA AQVRRLQEEY
GVGGSPKRNY LDYTEEDKIE ELEIRSVDGF QGREKEVIVL CTVRSNPSGD IGFVADPRRL
NVGITRAKRG LIVLGNRKTL SNNEMWRSWF KWIDEQNCAV SDTTNF