Gene OSTLU_49799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49799 
Symbol 
ID5002588 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp491901 
End bp494783 
Gene Length2883 bp 
Protein Length933 aa 
Translation table 
GC content54% 
IMG OID640418009 
Productpredicted protein 
Protein accessionXP_001418258 
Protein GI145347614 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.238463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGAG ACGAGGGAAG GATCGAGGTC CCCGAAGTCC CGGCGAAGTC GTACGCGTTC 
GAGCTGGACA CGTTTCAGCA AAAGGCGGTG GAGTGCTTGG AGCGGGGGGA GTCGGTGCTG
GTGAGCGCGC ACACGTCGGC GGGGAAGACG GTGGTGGCGG AATACGCGAT CGCGATGGCG
ATACGAGACG GACAGCGCGT GGTGTACACG TCACCGTTGA AGGCGCTGAG TAATCAAAAG
TATCGCGAGC TGAAGGAGGA GTTTGAGGAC GTGGGATTGA TGACGGGGGA CGTGGTGATA
AATCCGAGCG CGTCGTGCCT GGTGATGACG ACGGAGGTTT TGCGATCGAT GCTGTATCGG
GGAGGGGAGG TGATGCGCGA GGTTGGGTGG GTGATTTATG ATGAGATTCA TTACATGCGG
GACAGCGAGC GAGGGGTGGT TTGGGAGGAG TCCATCGTGT TGTTGCCGGA CATGGTCAAG
TACGTGTTTC TATCGGCGAC GATTCCGAAC GCTCGAGAGT TCGCGGAGTG GGTGTGCAAG
ACGCATAATC AACCGTGTCA CATCGTGTAC ACTGACTTTC GTCCGACGCC GTTGGAGCAT
TACGTCTTCC CGGCGAACGG AGAGGGGATA TTTTTAGTCA TGGATCGGCA GTCCAAGTTC
AGGGACAGCA ATTTCGAGCA AGCCGTCACC GTCATCGCCG ACGGCGGCGG CGCGGCGGCG
GCACGCGTGG CCAATCGAGC GCGAGGTGAT GATGGAAAAA AGGAGGCAGT TAACCAGGAC
ATCTTTAAAA TCATCCGTAT GGTCGTCGAG CGCAACTACG ATCCTGTGAT TGTGTTCGCG
TTCAACAAAC ACGAGTGCGA AAAGATGGCT AACTCATTAC ACAAAGTTGA TTTGTGCGAC
GAAGACGAGA AAAAGCTCAT CGACACGATT TATTGGAACG CCATGGACTC GTTGTCGGAC
GAAGACAAAC GATTGCCTCA AGTCGCAAAT TTGCCAAACC TTTTACGACG AGGTTTGGGT
GTGCACCACT CGGGACTGCT GCCGATTTTG AAGGAGGTGA TCGAGATTTT ATTCCAGGAA
GGCTTGATCA AAGTGCTCTT CGCCACGGAG ACCATGTCGG TGGGACTGAA CATGCCGGCT
CGCACGGTCG TGTTTTGCTC TCCTCGAAAG TTCGACGGCG CTGGTTTTCG TTGGATCACG
TCTGGCGAGT ACATTCAAAT GTCTGGTCGA GCCGGTCGTC GCGGCAAGGA CGACCGCGGT
TTGGTAATCT TGATGATGGA TGAGCGTATG GATCCGCCGG TGGCGAAGAA CATGTTACAC
GGGCAATCTG ATACTCTTGA TAGCGCATTT CACTTGAACT ACGCAATGAT TTTGAACCTG
ATGCGCGTCG AAGGTGCTGA GCCAGAGTCG CTCATTCAAT CGTCGTTTGC GCAATTCCAA
AACGATCGCG CGTTGCCGGG TCTGGAGGCG AAGATTGTCG AGATACAGAA GGATCGGGAC
GCGGTGAAGA TTCACGACGA AGACAGCGTC GACGAGTATG TCAAGCTCAA AGACGGCCTC
GATGCCATGA TTCGAGAACG TCGTGTCGTC ACCAACACTC CAACGCACGC GGTGCCGTTT
TTGCAGCCTG GTCGATTGGT GCGCGTGTGC ACAAAATCGC CGTCGATTTC GTCTACGTAT
GACGAGGAAG ACGATTCGAT TAGAATTCCC GTACCAGGAA CGGAGCCGGG TGAAGAGGAC
GTCGTTTGGG GCATGATTGT GTCTTTCGAG CGCATCGGTG GCGGTGGAAA ATCTGGGAAA
GCGGCGTACG GGGTCGACGT TCTGGTGCGC ACGCGCGAGA ATAGTGACGG TAAGACTCCG
TTATCATCAA AGAGTAAAAA CGATAGATAC GAGTTTTTGA ATGCGAACGA GGAGGACGAT
TCGTCGGAGC CGCGGGTGAT TCGAGTGCCC CTCGAGCAGC TCGATGTATT GAGCAGCGTT
CGCGTATACT TGCCGAAAGA CTTGCATCCA CGCGAGGCGA GAGATCAGTG CATTAGCAGT
GTGGGGGAGG TCATCAAGCG GTTCCCTGAT GGCGTGCCGG TTTTGGATGC CACGCGAGAT
CTGAAGATCG ATAGTGAAAA CTTCTCCAAG CTCTTAAAGC GAATCGACGG CATCAAGTCG
ATGATGAAGA AGCACCCCGT CGCCTCAAGT GAAAGGCTCG TCGAACAGCT CTCAGCGCAC
AAGAGAAAAC GTGAGCTCTC CATCGCACTG AAGCAGGCGA AGAAAAACGC AAAGGCTGCC
GCTGGACTGA TCATGCGCAA TGAGCTGAAG CAGATGCGTC GCGTGCTCAA ACGACTCGGG
CACACGAGCG CTGAGGGCGT GGTACAAACG AAGGGAAGAG TGGCGTGCGA ACTCGCCTCG
GTCGACGAGC TCGTCACCGC GGAGCTCATC TTCAACGGTA TGTTCAAAGA AGTCGATGTT
GATATGCTCG TCGCTTTGGT TTCGTGCTTG GTGTGGCGCG AGAAGTCGCG CAACACACCC
AAGCTTAGCG AAGAAACCGC GGAAGTGTTT TCGCGCCTGA AGGATGTTGC GCGCAAAGTC
GGGAAACAAA TGATGGAGTG TAGGATGAGC GTGGACGTCG AAGAGTACGT AGAGGGCTTT
AGGAGCGAGC TCATGGAAAT CATGCTCGCG TGGTGCAAAG GGAATAAATT TGCAGAGATT
ATGAAAATGA CAGATTTGTT CGAAGGTTCC ATAGTGCGCG CCATTCGGCG TGTCGAGGAG
GTTTTGCGCC AACTGTCTGA CGCGTGTCGG GTCATAGGCG AGACTGAACT TCAAGAAAAG
TTCACAATCG CGAGCGAAAA AGTGAAACGC GACATAGTGT TCGTCGCGAG CTTGTTCCTT
TAG
 
Protein sequence
MTRDEGRIEV PEVPAKSYAF ELDTFQQKAV ECLERGESVL VSAHTSAGKT VVAEYAIAMA 
IRDGQRVVYT SPLKALSNQK YRELKEEFED VGLMTGDVVI NPSASCLVMT TEVLRSMLYR
GGEVMREVGW VIYDEIHYMR DSERGVVWEE SIVLLPDMVK YVFLSATIPN AREFAEWVCK
THNQPCHIVY TDFRPTPLEH YVFPANGEGI FLVMDRQSKF RDSNFEQAVT VIADGGGAAA
ARVANRARGD DGKKEAVNQD IFKIIRMVVE RNYDPVIVFA FNKHECEKMA NSLHKVDLCD
EDEKKLIDTI YWNAMDSLSD EDKRLPQVAN LPNLLRRGLG VHHSGLLPIL KEVIEILFQE
GLIKVLFATE TMSVGLNMPA RTVVFCSPRK FDGAGFRWIT SGEYIQMSGR AGRRGKDDRG
LVILMMDERM DPPVAKNMLH GQSDTLDSAF HLNYAMILNL MRVEGAEPES LIQSSFAQFQ
NDRALPGLEA KIVEIQKDRD AVKIHDEDSV DEYVKLKDGL DAMIRERRVV TNTPTHAVPF
LQPGRLVRPG EEDVVWGMIV SFERIGGGGK SGKAAYGVDV LVRTRENSDG KTPLSSKSKN
DRYEFLNANE EDDSSEPRVI RVPLEQLDVL SSVRVYLPKD LHPREARDQC ISSVGEVIKR
FPDGVPVLDA TRDLKIDSEN FSKLLKRIDG IKSMMKKHPV ASSERLVEQL SAHKRKRELS
IALKQAKKNA KAAAGLIMRN ELKQMRRVLK RLGHTSAEGV VQTKGRVACE LASVDELVTA
ELIFNGMFKE VDVDMLVALV SCLVWREKSR NTPKLSEETA EVFSRLKDVA RKVGKQMMEC
RMSVDVEEYV EGFRSELMEI MLAWCKGNKF AEIMKMTDLF EGSIVRAIRR VEEVLRQLSD
ACRVIGETEL QEKFTIASEK VKRDIVFVAS LFL