Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_31150 |
Symbol | |
ID | 5001506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 392750 |
End bp | 396115 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | |
GC content | 54% |
IMG OID | 640416927 |
Product | predicted protein |
Protein accession | XP_001417492 |
Protein GI | 145346014 |
COG category | [S] Function unknown |
COG ID | [COG5594] Uncharacterized integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.034455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGC CGACGCAAGA AAACTACGGC GCGACGCCGA CGAGCGAGCT CACGAAGGTG CAACCGCCGC GAACGCGAAC GCCGTGGGAC TCGGCGGCGT GGATCACGGC GACGCTGTTG GACGAGCAAG GCCAACGGTG TCCGTGCCCG CCGGGATGCG CCGGGGATTG CGCGGTGCGC TCGCATTGGT TGTTTAATGA GAAGGATAAC GACCTGCGAA CGATTCACAC GCTGCAAAGC ACGTCGAATC CGCCGATGGT GAGCTTGAAC GCGGGTAACG TGGCGCCGTG GGGTAAGGTG GCGCAGGTGA ACGCGTCGAG TTATCGCGTC ATGGGTGGGT GTCCGCCGTC GGATCAGATG GACTGCATCG CGTGCGATAC TGGTGCTGGA TACTATCCGG TGGTGTACGC TGGCGACTTT TCTATCGAAG GTATCAAGCA AACGGGCGCG GGTATTTCGT GCTATTATCT GGAGATTTGT GACAACGACG ACTCCTGCGA TACGCAGATC GCAATGTCCG AAGTGGGGAC GCTGGTAGGG TCTTACGCGG GTCTGTTTTC ATTCCTTCTC ATCCTCTTTG TTATGTTGCG TAAAATCGTA TGGCTCCGGA TGGCGATCGA CTCGGCGGCG TGGGTTGAAA AGGGCAACCC CAAGTACCCG CCGCGAGAAA TTCAGATTCC CAAACCGAGA GACGACAATG CATGGTCGTG GCTTTATGAC GCTTATCACC GCGATAATAA TTGGATGAAA GAGTTCACCA CCCCAGACGA GTACATGCTC GTCCGCTGGT TCAAGTTATC CAGTCGCTTC TTCTTCACCG CCGGCGCCGT ATGTTGTCCA ATTCTCATGA GTTTGTACGC CGCCGATACC GTACCAAGCG CAGATGCGGG CTCGAAGATA CTTACAACTC TCGAAAAGAG CGGTATCGCG AAATACACAC TGCTCAATGC TCGAACCGAA TCTTCTTTCG CGGCCGCCAT GGCTTTCACC TGGATCACAA GTCTCTTCCT GATTTCTTTG ATTCGCGTCG AGTCGCGTAA GTACGTCCAC ATGATGTGGA CGGTGGATCC CGATAAGACT GGTATTAAAG CGAACGCAAT CCTCGTCAAG GATATGCCCT TGTTAACCAC GGCTCCGGCA CCGAAAAAGT TTGAACAGCT TAACACCGGC AGCGTCAAAG ATATTCTTAA AGTCAAAAAG AGTGTGCGAG GTTCTGTGAA GAAACTCGAC AAAATCTTCG ACGACGAAGA AGTCGGCTGC CTGGGGAGAT TTAAGCTTTT ACTCAACGAT GGCACCGTGC AATCTTCCGG TGATTCCGCC AAGCTTCGGC TTTTATACAA GGAGGAGTCG ATGAACCTTG TCATCAGCAA GTTTGAGAGC GTCCTCGGCA AAGACTGCAT CGCTTTCAAA ATGCTCGCCT CGGATACTCG AAAGCTTGAT AGCGCCGCGA AAGCCTGGGC GAATGCCCGC GAGCACGTCA CACAGAACAT GCAAGCGATC GCCGACTTAC AAGAGACCGA AAAGACTGGA GGACTCAGCT GGGGGGAATC GACTCAACTT GCGAAAGCGT TGAAAGATAT GGATTCGTTG AAGAGAGCCG AGGCGCAACG CTTCGATGTT TTCATCTCCA CTCGCGATGA ATACATCAAC AACCACCGAC CCGCGTGTAG TGCGGTCGTG GTATTTGCGA GACAAATGGA CGCCGTCATA GCCTCTCAAA TTCAAATTGA CGACGTTCCC GGGCAGTGGG TCACCGAACC GGCGCCAGGC AACTCTGATG TCGTGTGGCA CAATCTCTCC TTGACATCCG TCGAACGCGC AAAAAAGACG ACTCAGGCTT TTTTTATCGC CGTGGCCATT TCTCTATTCT TCATGTACCC TGTCAATATC GCTGTCGCGG CTGTCGCCGA CGTCAAGGAC TCTCTCGTGA GCGTGTTTGG CGAGTCTATT TACAACATCA TCCTGTCAAT TGTGTTGACG GTGTTCCTCG TCGTCGGTCA CATTTTAAGC TTGGTCGTGA GTCGGCAAAC TGGCTACGTC TCGGTCAGCG CTATGGACTC ATTCGGTGCG TCTATGTACT TTTGGCTTCT CATTCTCAAC TTGGTCTTTT CCAACCTGAA CACGACGCCC CTGTGGAAGG ACGTCCTCGT GTGGATGCAA AAGCCGCACT TGTTCACGTA CCAGTTCATC TTGAGGTTGA TGAATACCAG TACTTTCTTC CTCCAGTTCG TCATGCTTCG TACCGCGACT TCACCGGTTC TCGAGTTGAT CCATCCTCCG GTACTCCTCG GCTTCGTCAC AAAGTGCTTG CTATACCGCA GTCGAGCGCG CACGTGGCCG GCTTTCGCAA AGAGACTCAT CTGGGCTCAA CCGACGCCCA CGCCGAGCCA TCGGGTTCCC GCGCAAACGA TGTTGGTGTT CTTCATCGGC ATAATTTACA CCGTCGTCGC ACCAGTTTTA CTTCCGGTGT GCGGCGTCTT CTTCGGTTTT TTCTACATTT TCTGGAAGCA CAACATGGTC TATCACTACA TCCAACAGTA CTCCGCGGGG ACGTCCATGT GGGCGTGGCT CGTCGGAAAG ATGTATTTCA GCCTCGTGTT CAGCCAAATC ATGGTCGCTT TCGGTCTTCC GACGCTCGGC TTCAACACGA TGAAGTATCG TGTCTTCATC ATACCTCTCG TTTTATTCAC CCTTCTCGAA TGGTCGCGCG TGAATACCAT CCTCAACGAT GCGTTCAGGG TGCCGGTACA CGCGGCTGGC GCAGCGTTGA AGCGCAGAAG CGGCAAGCAT GAAGAAGACT CGGACGACGA ATACTTCGCT TCGAGATCGG CTTCGAGATC GGATGTTCCA GCGCTCGAGG AAGACGCTCG GCAAGAAATC TTCGTGAGCA CGACGCGAAT CGGTGATTCA AGGAAGAGGA AGGTTATGCG CGGCATCGTG CCGGTGGAAG AAATGAAAAA GAGCACACGC CGGCAAAAGA ATATTCTCGA AAACACTCGA GAGCAAGGAA AAGCTGAGAT CGAGTACAAA GTGAAGAAGG GAATCTGGCA AACGTACGCG CCGAGCGTGT TGTGGCCACT CGCCGCCGAG AAGTCTGCCG GTTCCATTTT CTTGCGTCGC TGGAAACAAA TCAAGGCGCG AAAGCAAGTC GAAGCAGACA TGTTGGCCGC CGTGGCGCAC TTACCCGACG ACGACCAGAG AAAGAAGGCA GTCATCAAAG ATTTGGCGCT CAGGAAACAA GTGCGCGACT CCGCCGCGGA AGCCATTCTT CGAAAATCCA CCTCGCCGTA CGTGAAGCAA CTTCGACAGA AGCGCGCCGC CGAGGAAGCC GAGAAAGCCG ACACTCGAGC TGGCTCGACC AAGTAG
|
Protein sequence | MATPTQENYG ATPTSELTKV QPPRTRTPWD SAAWITATLL DEQGQRCPCP PGCAGDCAVR SHWLFNEKDN DLRTIHTLQS TSNPPMVSLN AGNVAPWGKV AQVNASSYRV MGGCPPSDQM DCIACDTGAG YYPVVYAGDF SIEGIKQTGA GISCYYLEIC DNDDSCDTQI AMSEVGTLVG SYAGLFSFLL ILFVMLRKIV WLRMAIDSAA WVEKGNPKYP PREIQIPKPR DDNAWSWLYD AYHRDNNWMK EFTTPDEYML VRWFKLSSRF FFTAGAVCCP ILMSLYAADT VPSADAGSKI LTTLEKSGIA KYTLLNARTE SSFAAAMAFT WITSLFLISL IRVESRKYVH MMWTVDPDKT GIKANAILVK DMPLLTTAPA PKKFEQLNTG SVKDILKVKK SVRGSVKKLD KIFDDEEVGC LGRFKLLLND GTVQSSGDSA KLRLLYKEES MNLVISKFES VLGKDCIAFK MLASDTRKLD SAAKAWANAR EHVTQNMQAI ADLQETEKTG GLSWGESTQL AKALKDMDSL KRAEAQRFDV FISTRDEYIN NHRPACSAVV VFARQMDAVI ASQIQIDDVP GQWVTEPAPG NSDVVWHNLS LTSVERAKKT TQAFFIAVAI SLFFMYPVNI AVAAVADVKD SLVSVFGESI YNIILSIVLT VFLVVGHILS LVVSRQTGYV SVSAMDSFGA SMYFWLLILN LVFSNLNTTP LWKDVLVWMQ KPHLFTYQFI LRLMNTSTFF LQFVMLRTAT SPVLELIHPP VLLGFVTKCL LYRSRARTWP AFAKRLIWAQ PTPTPSHRVP AQTMLVFFIG IIYTVVAPVL LPVCGVFFGF FYIFWKHNMV YHYIQQYSAG TSMWAWLVGK MYFSLVFSQI MVAFGLPTLG FNTMKYRVFI IPLVLFTLLE WSRVNTILND AFRVPVHAAG AALKRRSGKH EEDSDDEYFA SRSASRSDVP ALEEDARQEI FVSTTRIGDS RKRKVMRGIV PVEEMKKSTR RQKNILENTR EQGKAEIEYK VKKGIWQTYA PSVLWPLAAE KSAGSIFLRR WKQIKARKQV EADMLAAVAH LPDDDQRKKA VIKDLALRKQ VRDSAAEAIL RKSTSPYVKQ LRQKRAAEEA EKADTRAGST K
|
| |