Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31583 |
Symbol | |
ID | 5001901 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 175457 |
End bp | 178092 |
Gene Length | 2636 bp |
Protein Length | 851 aa |
Translation table | |
GC content | 64% |
IMG OID | 640417322 |
Product | predicted protein |
Protein accession | XP_001417700 |
Protein GI | 145346451 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0143437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.733987 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGAAC AGACGATGAA GAGGACGAAG AGCGCGAGAG GGGCGCCGCG AGGCGCGCGC GCGGCGGCGA AGCGCTCGGA AGACAGCGAA GAGGTCGAGG GCGGCGCGCT GAAGCGCATC GTCGCGGCTG GTACGGTTGC GTTTTCCGTG CTCGCGGGCG TGGAGGGGGC GATCGAGCCG GCGCGCGCGG CGGGCTCGGC GACGGAGATC GTGCAACTCG CGCTCGACGC CGTCGACCCC GTCGAGGACC CTGACGCCAA GGCTGAAGCG CCCGAGCGAG TCAAGGCGGA TACTTCCTCG CTCGAGGGTG CGCTTAAGGC ACAAGTGCAG TCGCGAAAGT CGACGGTGAA GGAAGCGGGT AAGAAAGCGG CCAAGGCTGC CGCCGCGCCG GCGAAGAGCG GCGCGCCCGA AGGCGCCATG TCTCCGAACG CGAAGGATTA CAAGTCTGAA ATCGGCGAAA GCCTCGCTAC TCTGGATTAT GACGCCATTA TCAAGAAGAC GGACGACTAT TTTGTCTACC GCTACGATCG CGGCATCGAT GAATCTCAAA TCATCGATCT CGACGACGAA GACGACTCTG CGACGCGCGG TCCGAAGGGC ACGAAGAAAC GAGTCGCGGT GGCCAAGTCC ACCACCGCCC CTTCGTTCAC CTTGCCGAGC TTCACCGCGC CGAGCTTCGA TGCGCCGAGC TTCTCGATTC CGACGTTTGA AGTGCCGAGC ATCGAAATTC CGGCTATTCC GGGTGTAGCC GAACCGGCGA CCAAGAAGGA CACCTCCGCC GAAGACGCCG CGAAGGCTGA CGCCGCGGCC GCGAAGAAGG CCGAGGCCGA GGCCGAGGCT GCGAGGAAGG CCGAGGCCGA CGCCGCGACC GCCAAGAAGG CCGAGGCCGA CGCCGCGGCC GCCAAGAAAG CTGACGCTGA AGCCGCCAAG AAAGCTGACG CCGAAGCCGC CAAGAAGGCC AAGGCTGACG CTGAAGCCGC CAAGAAGGCA GCGGCGGAAG CCGCCAAGGC TGAAGCTGCC GCGAAGAAGG CTGAAAGCGC GAAGAAGCCG ATGGCTGCCG CCCCGGCCGC CGGTAGCAGC GATCTTGGTT TCGATTTCGG CTCTCTCTCT CAATACATGG AATCCGCTCC GGCGGCGCCG AAGGTTGACA AGAAGGCTGA AGCCGCCGCT AAGAAGGCTG CCAAAAAGGC CGCTGCTGAG GCCGCGAAGA AGGCTGCCGA GGAAGAAAAG AAGGCCGCCG CTGCCGCGAA GGCCGCTGCG AAGGCTGCTC CGAAGCCGAT GGCTGCCGCC CCGGCCGCCG GTAGCAGCGA TCTTGGTTTC GATTTCGGCT CTCTCTCTCA ATACATGGAA TCCGCTCCGG CGACGCCGAA GGCACCGAAG GCTGACAACG CGTCCGCCGC CGCTGGTCAA AAGGCTGCCG AAAAGATCGC CAAGCAACAG AAAGAAGCCG CCAAGAAGGC TGAGGCCGCC GCGAAGAAGG CTGAGGCCGC CGCGAAGAAG GCTCAAGCGC AAGAGGAAGC CGCCGCCGCG CGCGCCGCCG CCAAGGCTGA GATGGCCGCG AAGAAGGCGG CCGGGAAATC CGCTGAGAAA CCGACTTACA GCAAGCGCAC TGTTGAAAAG AAGGCGAAGC CGACGTTCAC GAAGTCTGCC AGAGATGGTA AGTTCGCTCC CTTCGCTGGG ACTTACAAGA CGACCGTCGT CGAGAAGGAG GCCCTCCCGG GCGTGCCCGT CGACTTCGAT GCCATCGTCG ACGCCCAAGA ACCCAAGGCG GAAGCGATTC TCGCCAAGGC CAACGATAAA TCTGGCGATT TCTTGAACAT TTCCGGTGAA GCTGGCTTCG CCATCGCGGG TACGATTGCG TTGGTTTACG AGACGGAGGA CAAGAAGTTC CGCGAGCAAG CGAAGAATGC CAAGATGCCG GCGCCGACGA AGACGAATGC TCCCTCCGGT GAAAGCACCA CCGAAGGTTG GTTCGATGCA GCGCTCAAGA AATACATGAA CAAAGATGGT TCCGCCCCGA AGCCAAAGCC CGTCGCCGCC GCGCCGAAGC CGGTCGCCGC CGCGCCGAAG CCGGCCGTGC CGAAGTCTGA TCCGGTGAAG AACGCCAAGG AGGCGCAATC GTGGATGGAC AAGTGGTCCG CGAGCAAGCC TAAGCCGGCT GCCGCCGCGG CGCCGGCGCC GGCGCCTGCC GCGCCGAAGC CCGCGGCGCC GAAGTCTGAT CCGGTGAAGA ACGCCAAGGA GGCACAGTCG TGGATGGACA ACTGGGAGCG CAAGGTCAAG CCGACCGCCG CCGCCGCGCC GACGCCGGTC GCCGCGCCGG CGCCGACGCC GGTCGCCGCG CCGGCGCCGA CGCCGGTCGC CGCGGCGCCG AAGCCCGTCC CGACGCCGAC GGTCTCGACC ACCACCACTC GCACGGTCAC CTCGGACAAC CTGACGGCCG AGCAACGCGC CGCCGCCGAA GCCTGGCTCA AGAAGTGGCG CGAGGACGGT CGCCCGACGG ACGAGACCAA ATTCGACGAG GCCAAGACGT GGTTGAAGCA GCACAACTTC GACTGAGCGA TTACAAACAA CGAGTGAATC GTTCACATTG ACTGTTCATG AAGACATTAA TAAACAAACG CACACGCACT AAGCTA
|
Protein sequence | MDEQTMKRTK SARGAPRGAR AAAKRSEDSE EVEGGALKRI VAAGTVAFSV LAGVEGAIEP ARAAGSATEI VQLALDAVDP VEDPDAKAEA PERVKADTSS LEGALKAQVQ SRKSTVKEAG KKAAKAAAAP AKSGAPEGAM SPNAKDYKSE IGESLATLDY DAIIKKTDDY FVYRYDRGID ESQIIDLDDE DDSATRGPKG TKKRVAVAKS TTAPSFTLPS FTAPSFDAPS FSIPTFEVPS IEIPAIPGVA EPATKKDTSA EDAAKADAAA AKKAEAEAEA ARKAEADAAT AKKAEADAAA AKKADAEAAK KADAEAAKKA KADAEAAKKA AAEAAKAEAA AKKAESAKKP MAAAPAAGSS DLGFDFGSLS QYMESAPAAP KVDKKAEAAA KKAAKKAAAE AAKKAAEEEK KAAAAAKAAA KAAPKPMAAA PAAGSSDLGF DFGSLSQYME SAPATPKAPK ADNASAAAGQ KAAEKIAKQQ KEAAKKAEAA AKKAEAAAKK AQAQEEAAAA RAAAKAEMAA KKAAGKSAEK PTYSKRTVEK KAKPTFTKSA RDGKFAPFAG TYKTTVVEKE ALPGVPVDFD AIVDAQEPKA EAILAKANDK SGDFLNISGE AGFAIAGTIA LVYETEDKKF REQAKNAKMP APTKTNAPSG ESTTEGWFDA ALKKYMNKDG SAPKPKPVAA APKPVAAAPK PAVPKSDPVK NAKEAQSWMD KWSASKPKPA AAAAPAPAPA APKPAAPKSD PVKNAKEAQS WMDNWERKVK PTAAAAPTPV AAPAPTPVAA PAPTPVAAAP KPVPTPTVST TTTRTVTSDN LTAEQRAAAE AWLKKWREDG RPTDETKFDE AKTWLKQHNF D
|
| |