Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31512 |
Symbol | |
ID | 5002074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 72347 |
End bp | 73963 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417495 |
Product | predicted protein |
Protein accession | XP_001417665 |
Protein GI | 145346376 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.268234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCCA GCCTCGCCTG GGTGCCTCCC GGCGCCGCGA GCGCGTCGCC CAAGTACGCG GACGTCCCCG AAGAGGAGCT CGAGCGGCTC GCGCATCGCG CGCGCGACGT CGCGCGTCGC CGCGACTCGC GACGGCGCGA CTCCGACGCC TCGGACGCGT CGGAAGACGA GGATGGGATG ACGGACGATG ATTCGGACGC GTCGAGCGGC GACGCGTCGG ACGAGAGCGA CTGGGAAGAG GTCGAGGCGG ACGCGGAGGA GGATTTGGAC GAAATGGCGG CGGACGAAGA AGACGCGAAA GAAACGACGA CCAAGGCGGT GGCGAAGGCG AAGGCGGTGG CGAAAGCGGC GAAAGGAATG GTTGATGATC TCGCGGAACT GAACATGGAC GCGTACGACG ATGAAGAGGA GGATGAACGC GCGGCGGCGG GACGGTTGTT CGGAAGCGGA CGCATGACGC ACTACGACGG AAACGAGGAC GATCCGTACA TGACGATTAA GGATAGCGAT GACGACGAGG ATGAGATGCC GGACGATATG ACGATGGCGG AGACGGATTT GGTGATATTG GCGGCGCGAA CGGACGAGGA TGTGTCGCAT TTGGAGGTGT GGGTGTACGA GGAGGCGGGA GTGACGGGGA ACGCGGAGAC GAATTTGTAC GTGCATCACG ACGTGCTTTT ACCGGCGTTT CCGTTGAGTG TGGCGTGGAT GAATTGCGCA CCCAAGAGCG GGACGAATGA AGTCAACTGT GCGGCGATCG GGACGATGTA TCCAGGGATC GAGATTTGGG ATTTGGATTG CGTGGACGCC GTCGAGCCGG TGACGACGCT GGGGGGATAT TCAGACGAGG CGATCAAGGC TGCGAGTAAA AAGGGTAAGA AGGGCGGCAA GAAAGAGTCG AAAGCGTTGA AAGGCGGCTC ACACGAAGAC GCCGTCATGG GATTGTCGTG GAATCGCGAG TTTAGAAACG TCCTGGCGTC GGCGAGCGCC GACACGACGG TTAAGATTTG GGACATCGCG ACGGAAACCG CCTCGCAAAC GCTGAATCAT CACAAAGGGA AAGTGCAGGC GTGCGAATGG AACCCAGCTG AACCTACTGT GCTTCTCACA GGATCTTACG ATAAAACGGC TCAAGTTGTA GACGTCCGCG CGCCCGATAA TGCATCACTT ACGTGGAAAG TCGGCGCCGA CGTCGAGAGC GCAATTTGGC ACGTCGGATC GCCGACGCAG TTTTTAGTAT CGAACGAAGA TGGGCTCGTG ATGTGCTTCG ATACACGCAT GGGATCAAAG TCGGACTGTG TTTTCAAGCT CCAGGCGCAC GACAAGGCCA CAACAGGGCT GAGCATGGCG TCTGGTGCGC CCAACCTATT GACGACGTGC TCCACGGACA AGTCGATCAA ATTGTGGGAT TTGAACGATG GTAAACCGTC CTTACTGTGT CAGCACTCTC CTCAAGTGGG AGCTATTTTT GCGTGTGGAT TTTCGCCTTC GGTGCCGTAT TTGATAGCCG CCGCTGGCTC CAAGGGCACC GTGGCGGTTT GGGACATCCT GTCGGAAGCC GCAGTCAAGC AAACTCACGG AAAAACTCTC GAACAATACT ATCGCGTGTC AAAGTAA
|
Protein sequence | MISSLAWVPP GAASASPKYA DVPEEELERL AHRARDVARR RDSRRRDSDA SDASEDEDGM TDDDSDASSG DASDESDWEE VEADAEEDLD EMAADEEDAK ETTTKAVAKA KAVAKAAKGM VDDLAELNMD AYDDEEEDER AAAGRLFGSG RMTHYDGNED DPYMTIKDSD DDEDEMPDDM TMAETDLVIL AARTDEDVSH LEVWVYEEAG VTGNAETNLY VHHDVLLPAF PLSVAWMNCA PKSGTNEVNC AAIGTMYPGI EIWDLDCVDA VEPVTTLGGY SDEAIKAASK KGKKGGKKES KALKGGSHED AVMGLSWNRE FRNVLASASA DTTVKIWDIA TETASQTLNH HKGKVQACEW NPAEPTVLLT GSYDKTAQVV DVRAPDNASL TWKVGADVES AIWHVGSPTQ FLVSNEDGLV MCFDTRMGSK SDCVFKLQAH DKATTGLSMA SGAPNLLTTC STDKSIKLWD LNDGKPSLLC QHSPQVGAIF ACGFSPSVPY LIAAAGSKGT VAVWDILSEA AVKQTHGKTL EQYYRVSK
|
| |