Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36534 |
Symbol | CHR3501 |
ID | 5006947 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | + |
Start bp | 64360 |
End bp | 67392 |
Gene Length | 3033 bp |
Protein Length | 806 aa |
Translation table | |
GC content | 55% |
IMG OID | 640422368 |
Product | predicted protein |
Protein accession | XP_001422798 |
Protein GI | 145357178 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.00151979 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0248069 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCCGG AGTGTTATCG CGCGGAGGGA GTCACGAGGG AGGAGCTGGA GACGCGCGTG CACGGGTTGG ACGCGCTGAA GGATGGGGAT CGGGAGGTGT TGCTGGCGAC GGTGCTGTCG CCGGATCGGC CGGAGATCGA GAACACGGCG TTGGACGTGA CGAGCGAGGA GTTTTTGCGT CGGCCGGCGG TGGAACCGAT GGAGGCGCCG CGGGCGTTGA CGCGGCCGTT GTTGGGGTTT CAGCGCGAGG GGTTGCGGTG GATGTGCGAT AACGAGAGCG GCGATGCGAA GGGGGGGATA TTGGCGGATG AGATGGGGAT GGGGAAGACG ATACAGTGCA TATCGATGTT GCTGGCGAGG AAGGAGGCGT GGATGCGAGA CCGCGCCGAG GTGGGGGAGA TGGTGACGGA CGACGACAGA CCGCCGCCGA CGCTCGTGGT GGTGCCGACG TCGGCGCTCG TTCAGTGGGA AGAAGAGATC AAATCGTGCG TCGAGGAAGG GTCGCTGCGC GTGTTTGTGT ATTACGCTGA TCGCGCAAAC GTCGTGGAAG GAGACTTTAA AGGATACGAC GTCGTGTTGA CGACGTATCC CGTCGTCGAA GCCGAGTGGC GGAAAATCAT CAACCGACAC TTGACGGCGT GTCAGTGGTG TGGGAAAAAG TACTTACCTC GCTCCATGGT GACGCACTTG AAGTACTTCT GTGGACCAGA CGCCGTGCGC ACGGAAAAAC TCGCGCGGCG CGAGGTGACG CGCGACGTGG CGAACGAAAA AGCCATGCGC ACGCTGAAAA TCAAGCCGGG CAGCGCCAAG GACGTGAAGA AGGGGATTCC CACGATGGCG AACGTATACA AGGAACTCAT GGCGATGGCC GGACGGGAGA CGCTGAGCAT GTATGATGGC GCGCACAAGG CGCGCGCACG CGCGGCTTCA GGTCTCGCCC CGGGCGGCGA CGTCGTCGTC GTCAAGGAGG AAGTTGAAGA CGGCGTCGCC GAGCCGAGCG AAGTTCTGAA AGCTTTGATT TCGCAGCTTC CAGTGCCGAC GATTGTAGTT GAGAATATCA AAGAGGAGTC GATTGAGGAG AAAGAAAAAG AGGTCGAGTC AGTGAACGAG CCCGCCTTGG CTGACGCGTC AACGGCGGCG ATAGCGAGTA CGGTGAAGAA GGCGCAGAAG CGCAAGTCGA AGGCTTCGGG TAAAGCGACT TCGACTTCGA GCGCAAAGAA AAAGAAGAAG AGTCTGCGCG AAGCTAGCGA CGGCGAGGCT GAAAGTGATT ACAAACCGGA CAGTGATAGC GAAGATGATG AGATCATATT AGTCGACGAT AGCGAGAGCG AAGATAGAAA GCCAAAGAAG AAACAGAAGA AAAAGAAAAC GCCGGCGAAA ACCGAGGAAG CGGACGACGT GAAGGCGTCC AACATCGACG ACATTCCGCA AACCTCGCAA GGTGGTTCGC AAGGTGGGAG CCAGTTTGAA GACGAAGACG ACGTAGATTT GTCGGATTCC CTTCTTCATC GCACGCAGTG GCACCGAATC GTTCTCGACG AAGCGCACAA GATCAAGGCG CGCACGAGCA ACACCGCCAA GTGTATCTAC GCTTTGAAAT CCACGTATAA GTGGTGTTTG ACAGGTACAC CGTTGCAGAA TCGAATCGGC GATCTTTACA GCTTGGTGAG ATTTTTGCGT ATGGATCCGT ACGCGTTTTA CTTTTGTTCG ACGAAGGGTT GCGAGTGCAA AACGCTCACC TGGAACTTCG GTCCTCAGGC GCGATTTTGT ACCAACTGCG GATGCGGCGC TCCTAGGCAT TATTCGCATT TCAATCGCAC CGTGCTAAAC CCGATCAACC GTTATGGCTA CATCGGTGAC GGCAAGAAAG CGATGCTGAC TCTTAGAAAT GACATTTTGT TGCCGATGCA ACTTCGCCGG ACCAAGGCGG AACGCGCCGA GGACGTGCGA CTGCCGGACT TGAAGATTAT CATTCAAGAA AACACATTCA ATGAGGTTGA ACAAGACTTT TACGAGTCTC TGTACATGCT GACGCGCTCG AAGTTCGACG CGTTCGTGAA GAAAGGGAGC GTTTTGCACA ACTACGCACA CGTCTTCGAG CTCCTCGCAC GACTGCGACA AGCGTGCGAT CATCCGTACT TGGTGATTCA TTCGAAGAGT GCGAACGTGA AAAAAGACGC CCCTGACGCG CCGAAAGTTG AATCCCCGGC AGACACCGAC GTTCCGAAGC ATTATTGTGG CATGTGTCAG GACGAAATTG AGGAAGAAGA CGCGGCTCTG GCGAATTGCA AACACATTTT CCATCGTGAG TGCATCATGC AATACGCGTC TTGTGCGCCT GCGGATGGCA AAAAAGTGAC TTGTCCCGTC TGTCGCACGG CGTTGACGAT TGACTTCTCT CCAGAAAGTC TCGAAAACGT CAAGAGTGCC ATTAGTCGTA ATTTCAAGGA TGCGCTACCA GACAAGTCAA TTCTCAACAA GCTCGATCTC ACGCAGTACA CGTCGAGCAC AAAGGTTGAG ACGCTCGTTA ACGCTCTGCG AGACATGCGT AATCAAGAAA ATGGGCACTT AAACAAAGCC ATCGTGTTTT CGCAGTACAC AGCCATGATA GAAATCGTCG AATGGCGTTT GAAAAAGGCC AAGTTTACCA TCGCCAAGCT TCTCGGTTCC ATGCCGGTCA CGCAACGCGC GGCGAATTTG CAAGCTTTCC GAGAAGATCC AAACGTCAGC GTGATCTTGA TGAGTCTCAA ATCTGGCGGT GAAGGACTCA ACTTGCAAGC GGCGAATTAC GTATACGTTC TTGAGCCATG GTGGAACCCA GCGGTGGAAA TGCAAGCCGT GATGCGCGCA CATCGCATCG GGCAGCTTCG ACCGGTGACC GCTGTTCGAT TTTCGACCAA AGGCACGATT GAAGAACGCA TGATGGAGCT TCAAGAAAAG AAGCAGCTTG TGTTCGAAGG GTGTATGGAC GGCAATCAAG CCGCGCTTTC TCAACTGACT GCCGAAGACT TGCAATTTTT GTTCAAGCGA TGA
|
Protein sequence | MHPECYRAEG VTREELETRV HGLDALKDGD REVLLATVLS PDRPEIENTA LDVTSEEFLR RPAVEPMEAP RALTRPLLGF QREGLRWMCD NESGDAKGGI LADEMGMGKT IQCISMLLAR KEAWMRDRAE VGEMVTDDDR PPPTLVVVPT SALVQWEEEI KSCVEEGSLR VFVYYADRAN VVEGDFKGYD VVLTTYPVVE AEWRKIINRH LTACQWCGKK YLPRSMVTHL KYFCGPDAVR TEKLARREKK KKTPAKTEEA DDVKASNIDD IPQTSQGGSQ GGSQFEDEDD VDLSDSLLHR TQWHRIVLDE AHKIKARTSN TAKCIYALKS TYKWCLTGTP LQNRIGDLYS LVRFLRMDPY AFYFCSTKGC ECKTLTWNFG PQARFCTNCG CGAPRHYSHF NRTVLNPINR YGYIGDGKKA MLTLRNDILL PMQLRRTKAE RAEDVRLPDL KIIIQENTFN EVEQDFYESL YMLTRSKFDA FVKKGSVLHN YAHVFELLAR LRQACDHPYL VIHSKSANVK KDAPDAPKVE SPADTDVPKH YCGMCQDEIE EEDAALANCK HIFHRECIMQ YASCAPADGK KVTCPVCRTA LTIDFSPESL ENVKSAISRN FKDALPDKSI LNKLDLTQYT SSTKVETLVN ALRDMRNQEN GHLNKAIVFS QYTAMIEIVE WRLKKAKFTI AKLLGSMPVT QRAANLQAFR EDPNVSVILM SLKSGGEGLN LQAANYVYVL EPWWNPAVEM QAVMRAHRIG QLRPVTAVRF STKGTIEERM MELQEKKQLV FEGCMDGNQA ALSQLTAEDL QFLFKR
|
| |