Gene OSTLU_36534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36534 
SymbolCHR3501 
ID5006947 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp64360 
End bp67392 
Gene Length3033 bp 
Protein Length806 aa 
Translation table 
GC content55% 
IMG OID640422368 
Productpredicted protein 
Protein accessionXP_001422798 
Protein GI145357178 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.00151979 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0248069 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCCGG AGTGTTATCG CGCGGAGGGA GTCACGAGGG AGGAGCTGGA GACGCGCGTG 
CACGGGTTGG ACGCGCTGAA GGATGGGGAT CGGGAGGTGT TGCTGGCGAC GGTGCTGTCG
CCGGATCGGC CGGAGATCGA GAACACGGCG TTGGACGTGA CGAGCGAGGA GTTTTTGCGT
CGGCCGGCGG TGGAACCGAT GGAGGCGCCG CGGGCGTTGA CGCGGCCGTT GTTGGGGTTT
CAGCGCGAGG GGTTGCGGTG GATGTGCGAT AACGAGAGCG GCGATGCGAA GGGGGGGATA
TTGGCGGATG AGATGGGGAT GGGGAAGACG ATACAGTGCA TATCGATGTT GCTGGCGAGG
AAGGAGGCGT GGATGCGAGA CCGCGCCGAG GTGGGGGAGA TGGTGACGGA CGACGACAGA
CCGCCGCCGA CGCTCGTGGT GGTGCCGACG TCGGCGCTCG TTCAGTGGGA AGAAGAGATC
AAATCGTGCG TCGAGGAAGG GTCGCTGCGC GTGTTTGTGT ATTACGCTGA TCGCGCAAAC
GTCGTGGAAG GAGACTTTAA AGGATACGAC GTCGTGTTGA CGACGTATCC CGTCGTCGAA
GCCGAGTGGC GGAAAATCAT CAACCGACAC TTGACGGCGT GTCAGTGGTG TGGGAAAAAG
TACTTACCTC GCTCCATGGT GACGCACTTG AAGTACTTCT GTGGACCAGA CGCCGTGCGC
ACGGAAAAAC TCGCGCGGCG CGAGGTGACG CGCGACGTGG CGAACGAAAA AGCCATGCGC
ACGCTGAAAA TCAAGCCGGG CAGCGCCAAG GACGTGAAGA AGGGGATTCC CACGATGGCG
AACGTATACA AGGAACTCAT GGCGATGGCC GGACGGGAGA CGCTGAGCAT GTATGATGGC
GCGCACAAGG CGCGCGCACG CGCGGCTTCA GGTCTCGCCC CGGGCGGCGA CGTCGTCGTC
GTCAAGGAGG AAGTTGAAGA CGGCGTCGCC GAGCCGAGCG AAGTTCTGAA AGCTTTGATT
TCGCAGCTTC CAGTGCCGAC GATTGTAGTT GAGAATATCA AAGAGGAGTC GATTGAGGAG
AAAGAAAAAG AGGTCGAGTC AGTGAACGAG CCCGCCTTGG CTGACGCGTC AACGGCGGCG
ATAGCGAGTA CGGTGAAGAA GGCGCAGAAG CGCAAGTCGA AGGCTTCGGG TAAAGCGACT
TCGACTTCGA GCGCAAAGAA AAAGAAGAAG AGTCTGCGCG AAGCTAGCGA CGGCGAGGCT
GAAAGTGATT ACAAACCGGA CAGTGATAGC GAAGATGATG AGATCATATT AGTCGACGAT
AGCGAGAGCG AAGATAGAAA GCCAAAGAAG AAACAGAAGA AAAAGAAAAC GCCGGCGAAA
ACCGAGGAAG CGGACGACGT GAAGGCGTCC AACATCGACG ACATTCCGCA AACCTCGCAA
GGTGGTTCGC AAGGTGGGAG CCAGTTTGAA GACGAAGACG ACGTAGATTT GTCGGATTCC
CTTCTTCATC GCACGCAGTG GCACCGAATC GTTCTCGACG AAGCGCACAA GATCAAGGCG
CGCACGAGCA ACACCGCCAA GTGTATCTAC GCTTTGAAAT CCACGTATAA GTGGTGTTTG
ACAGGTACAC CGTTGCAGAA TCGAATCGGC GATCTTTACA GCTTGGTGAG ATTTTTGCGT
ATGGATCCGT ACGCGTTTTA CTTTTGTTCG ACGAAGGGTT GCGAGTGCAA AACGCTCACC
TGGAACTTCG GTCCTCAGGC GCGATTTTGT ACCAACTGCG GATGCGGCGC TCCTAGGCAT
TATTCGCATT TCAATCGCAC CGTGCTAAAC CCGATCAACC GTTATGGCTA CATCGGTGAC
GGCAAGAAAG CGATGCTGAC TCTTAGAAAT GACATTTTGT TGCCGATGCA ACTTCGCCGG
ACCAAGGCGG AACGCGCCGA GGACGTGCGA CTGCCGGACT TGAAGATTAT CATTCAAGAA
AACACATTCA ATGAGGTTGA ACAAGACTTT TACGAGTCTC TGTACATGCT GACGCGCTCG
AAGTTCGACG CGTTCGTGAA GAAAGGGAGC GTTTTGCACA ACTACGCACA CGTCTTCGAG
CTCCTCGCAC GACTGCGACA AGCGTGCGAT CATCCGTACT TGGTGATTCA TTCGAAGAGT
GCGAACGTGA AAAAAGACGC CCCTGACGCG CCGAAAGTTG AATCCCCGGC AGACACCGAC
GTTCCGAAGC ATTATTGTGG CATGTGTCAG GACGAAATTG AGGAAGAAGA CGCGGCTCTG
GCGAATTGCA AACACATTTT CCATCGTGAG TGCATCATGC AATACGCGTC TTGTGCGCCT
GCGGATGGCA AAAAAGTGAC TTGTCCCGTC TGTCGCACGG CGTTGACGAT TGACTTCTCT
CCAGAAAGTC TCGAAAACGT CAAGAGTGCC ATTAGTCGTA ATTTCAAGGA TGCGCTACCA
GACAAGTCAA TTCTCAACAA GCTCGATCTC ACGCAGTACA CGTCGAGCAC AAAGGTTGAG
ACGCTCGTTA ACGCTCTGCG AGACATGCGT AATCAAGAAA ATGGGCACTT AAACAAAGCC
ATCGTGTTTT CGCAGTACAC AGCCATGATA GAAATCGTCG AATGGCGTTT GAAAAAGGCC
AAGTTTACCA TCGCCAAGCT TCTCGGTTCC ATGCCGGTCA CGCAACGCGC GGCGAATTTG
CAAGCTTTCC GAGAAGATCC AAACGTCAGC GTGATCTTGA TGAGTCTCAA ATCTGGCGGT
GAAGGACTCA ACTTGCAAGC GGCGAATTAC GTATACGTTC TTGAGCCATG GTGGAACCCA
GCGGTGGAAA TGCAAGCCGT GATGCGCGCA CATCGCATCG GGCAGCTTCG ACCGGTGACC
GCTGTTCGAT TTTCGACCAA AGGCACGATT GAAGAACGCA TGATGGAGCT TCAAGAAAAG
AAGCAGCTTG TGTTCGAAGG GTGTATGGAC GGCAATCAAG CCGCGCTTTC TCAACTGACT
GCCGAAGACT TGCAATTTTT GTTCAAGCGA TGA
 
Protein sequence
MHPECYRAEG VTREELETRV HGLDALKDGD REVLLATVLS PDRPEIENTA LDVTSEEFLR 
RPAVEPMEAP RALTRPLLGF QREGLRWMCD NESGDAKGGI LADEMGMGKT IQCISMLLAR
KEAWMRDRAE VGEMVTDDDR PPPTLVVVPT SALVQWEEEI KSCVEEGSLR VFVYYADRAN
VVEGDFKGYD VVLTTYPVVE AEWRKIINRH LTACQWCGKK YLPRSMVTHL KYFCGPDAVR
TEKLARREKK KKTPAKTEEA DDVKASNIDD IPQTSQGGSQ GGSQFEDEDD VDLSDSLLHR
TQWHRIVLDE AHKIKARTSN TAKCIYALKS TYKWCLTGTP LQNRIGDLYS LVRFLRMDPY
AFYFCSTKGC ECKTLTWNFG PQARFCTNCG CGAPRHYSHF NRTVLNPINR YGYIGDGKKA
MLTLRNDILL PMQLRRTKAE RAEDVRLPDL KIIIQENTFN EVEQDFYESL YMLTRSKFDA
FVKKGSVLHN YAHVFELLAR LRQACDHPYL VIHSKSANVK KDAPDAPKVE SPADTDVPKH
YCGMCQDEIE EEDAALANCK HIFHRECIMQ YASCAPADGK KVTCPVCRTA LTIDFSPESL
ENVKSAISRN FKDALPDKSI LNKLDLTQYT SSTKVETLVN ALRDMRNQEN GHLNKAIVFS
QYTAMIEIVE WRLKKAKFTI AKLLGSMPVT QRAANLQAFR EDPNVSVILM SLKSGGEGLN
LQAANYVYVL EPWWNPAVEM QAVMRAHRIG QLRPVTAVRF STKGTIEERM MELQEKKQLV
FEGCMDGNQA ALSQLTAEDL QFLFKR