Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_29234 |
Symbol | |
ID | 4999667 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 1044831 |
End bp | 1048958 |
Gene Length | 4128 bp |
Protein Length | 1307 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415088 |
Product | predicted protein |
Protein accession | XP_001416012 |
Protein GI | 145341842 |
COG category | [S] Function unknown |
COG ID | [COG5594] Uncharacterized integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCG CGTCCAGCGG CAGTGGCTGC AGCGGAACTT CATGCGTTCG TTTCGCGACG AGCGACCGCG ATATATACGT TGGGTGCGTG ATGACGCGTA TAATGCGGCA ACCTCACGTC GTGACCGCGC GCCGCGCGCG CGCGACGCGA CGCGCGCGTC GAGAGAGCTT TGGGACACGG ACGAGCGTTT AAACGAACTT TTACCGACGT TGACTCGACA CGCGCTCGAA AAAACGACTG ACGACGAGGC GCGACGCGCG CGCGAAACGT AGTTTGGGCA TTTACGCCAC AATCGCGTTC TTGGGGTTGC TTCTCTTCGG ACGCATGCGA CACGTGATGC CTATTTATTT CGGACGGTTG CGACTGCGAA ATTTGACGAA GCCGCCGCCG CCGTTTCACT CGCGGAAGCG CGATGGTACG ACAAAAGACG GCGTGTTTAA GCGTGTGATG CGATACTACT TTGGTTGGAT TCCGCACATT TTGCGCGTGG ACGATAAGAC GCTGATCCAG ACGGCGGGGC TGGATGCGTT TGCGTTTTTA CGCGTGTGTC AGTTCGGGTT ACAATTATTC GTGCCGTTGT CGATTTTTTC AATGATGATT TTGTTGCCTA TTCACGTGAA CGGGGACGAT ATGGTGCGCC AACACGCACA GTATATTGTC GCGAAAGTGA ACACGACTGC GGAAGTGCCA GGCGGGTTGA TCTTAACAAC GGTGGCAAAC ATACCAGGTA AGCAAGGCGT GCTTTGGTTG CACACTGTGG GGATGTGGTT GATGGTCTTA TACACGACTT GGCTTCTGAA GCAGCACAGT GCGACGTTTG TGGTACTTAG AACACTTTAT TTAACCACTC GCGGCGACAC CAATCTGTGG CGAGTGGTGC ACCAACCGTC GAACTTCATA GAGCAGTTGC TCGTGCAAGG AACGCAGGCG GGAGCTGAAA TCGACGACGA TGAATTGCGC AAGATGAAAA GTGCCAATTC AGACGACATT AACGCGGCAT GCGATGCGCT GGCGGCGATG GACGAGTATG AAGATGAGCA AAGAGGCTTG TTGAAGCGCG CTGTGGACGT AACTTCACAC AAGGCAAAGT CGTTGCTCAC TCGCGCGGCG TCGACACGGC TCGTAGACAA GAGGCTTGAA GATAAAGAGC GGCGTCAGAG TTTGCACGAG CTCGACACTG AGCCATCGCC CACGCATGCA TATGGCGATA CGCTAGCGTC ATCGACGGTG AAGATATCAC CTTTCGCGAG TACCCCTCAT CTATCGCCTT TTGGTGGGCC GACAGCGCGG CGACTCACGC CTCCGCAGGT TGGCAACGTC GCGCAAGGCT TGAAGGGAGC TGGAACTTTG TTCAAGTCGC CCGCATCTGC CGACGCAGAC GATCGCCAAA TGGCCCGGAG ACCTTCGGTA CCTGTAACAC CGACGCTCGG CCGGGCGTCG ACGAGTCAAG CTTCTTCACA ACCTGAAAAC AAAACAGAGC CGGGACCGGT GACGGAATCA CGGCACTTTA CTTCGTTTAC TTCGGGTGAA AACTCGGACG ACCTGGATCT GAATGTCAAT CCGCCTGTTT CAGAGCCGCC AGCAGTTGCT AAACGAAGAG AACGACGAAA GCTCCGGATG GGTTCGTTTG GATCGTTGGG AGCGATGCCG GCTTCCGTTA GTCAAACGCT TCGAGAGCAC GAGGCGGCGC AGGCGTCTGG TATGCACCGC GTGACTTCTC GCGAAGATAT CGGCACGAAG CAGCTCATGG AAGGTGTCAT CAATCAACGA GGGAGTGCTG ACAATTTGGC CTGGCTCGTA AGCCCTACGA AATCGCCGCG ACGTGGTAAA CACAATCGAA TGCCTACGAT GGAGGACTTG CCTGTGGAGA ATATTGCGCT CACTCCAGCT GTGAAGAAAG CGCTCGCGAG AGTAAAGTCT CTCGAAGACG CGGGCGAGGC CGTTCGCTGC GGCGGCCGTG GCAGTGAAGG TGTCCAAATT CGCGCCGAAA CATCGATCGC GCACGATTGG TGGGTCGGCC TGGACGTTAC GCACCAGTTT AAAGGAACGG GATCCAAACC TAATCCTCGC ACTGATGGAC GACCACTAGA AGTTCCAGAG CGACAAACGA TGGAGGGCGA AGAGCCGAAG AGAAATCTGA CATCTTTATT CGACGATCCA GAGTCTCCTC TCAGCGCGAC GTCTCCGGTG ACGCGGGCGT CCTTGTGCGT GGGCGATTAT CTACAAGAAG CTCCGTCGGC GGTGCCCGAT GTTGACAGTA TACGCACCGT CAACGCGTTT GATACGAACA CGAATCAAAT CGTGAGTGTC TGGGCGTCCA GCTACACGGT TTTAATCACC GATATCCCAT TCGTTCGAGC CATGGGCGAA AATGGCGAAG AGGTGAGAGT ACGCGGCTTG CGAGAAGTTG AGGCGACGCT CGAGTATATT TACGGCGATG AGTTCAGAGG CTTAATACCC ATCTTTGATC ACCGTCCAGC GGATGCGCTT TTGGACAGCC GTGATGAATG TAAGAATATG ATAACGAGAA TTAGGATGTT AATGGCGCGC GAGGGCATGA TACCGGAAAC GCAGTCGCCG AAGGCGTTGA CGTACAAATT TGGGACGAGA CTTTGCGAGG TTGAAAAAGG AACATTGCAA TTGGGCCACA ACTGGAAAGA ATTCAAAAAA GCTTTCGCCG ATGTGCTTAG GCCTCCGAAA AATTTGCGTA AGCTCGCGTT GTCAGAGCAG GTTGCGGTTT TGGAAGCGCA GCTCGAAGCA ATTGATGAGG CAATCATCTG CGTGCGCAAG ACGACTTGGG AAGGGTCCCC GGGACCTTGC GCCTTTGCAG TCTTTGAAAA CCAGGTCTCC GCGAGTACGG CAGCGCAGTG TGTTATTTCG CGCGCGTCGC ACAGAGTATA CCGCGCTCTA CCCGCACCCG GTCCGGATGA CGTCAATTGG CCGACGCTTC TACACAATTC GACAGATAAC CGTGCCCGAG CGCTCGCTAT CTGGCCCTTC ATCATAGCTT TGATGATCTT CCCGACCGGT ATGTTCGCGA CAGCGGTGAC CAGTGTGTGT CAGGTGCGCG AAGGAGATGA TGTTATCAAT AGCGGGGCAG CTTTGGACTG GTATTGCTCG GATGATGCCA AGGTTTACGC CGCGATAATC TCGGGCGTCC TCCCACCGAT TATCCTTACG CTCTGGGAAG TTTTTGTCAT CTCCTTCTAC ATGATGTATT TGGTGCAGAG ACAAAACGTT CATGTATCGT TAGCCGCTAC AGATCGTCGA TTTCTACGTT TCTATTGGGC GTGGGGGGCG TTGAACGTAT TGCTTGGAGG TATCTTTGGC GGCGCGTTAA GTCTCTTCAC GACGACGCTT AGCTCGAGCA ACGTTTCGTT GAACGAGGTA CAGTTACAAT TTGGTCGCGT CTTGCCGCTG AGCAGCAATT TCTTCTTGCT TTTCATCGTT TTCCGCGCAA TCTACCTCCC CGTGCAGCGC CTGTTGCTCC CCCACCCGGG ATCATTCTGT CTGGCGGCTG ATATTTTCTG GTGCGAAAAA CGCGGCTCGT GCGCGCGAAC GTCGCGAGAC AAGACTCGTC TGTACTCTCC ACGAGCCGTT CGCATGGGTC GCGAAACCGG CGTCTTTATG CTCATCATGG TCATTGGTTT GACATTCGTT TGCATCGCAC CGCTCATTCC ACTCGCCGCC GCCTTGTTTT ACATCACCAA CTTCGTCATC TGGCGTTACC ACGTCTTGTA CGTGTACGAG CGAGGATACG AATCAAACGG TTCGGTATGG TTCACGTTTA CTCAGCTCGT CATCCTGTCT CTCGTCGTCG CCCAGACGTT CTTGTCCTGC GTGCTATTCA GCAAACAGGC GTACATTCAA GGCGCCGTTT TGTACGCCAC CGTCCCCTAT TACCTCTTCA AGGTCTACAG AAAGTTCCGC GCCGAATTCG GCAGCGCGAG CTCTTGGGCC GTCCCTCTCA GCGAAGCCAC TGCGGCGCCG CCGACAGATT TCGGTGGCGA AATCTACACC CATCCCGCGC TTCGCCCCGC CGCCTCGGGA TGGTTCCCGG ATATCGGCAA GGTCTGGCGC GGTTATCCGG GCGTGACGTC CAAGAACAAC TGATTGTAAT CAATACAC
|
Protein sequence | MSGASSGSGC SGTSCVRFAT SDRDIYVGLG IYATIAFLGL LLFGRMRHVM PIYFGRLRLR NLTKPPPPFH SRKRDGTTKD GVFKRVMRYY FGWIPHILRV DDKTLIQTAG LDAFAFLRVC QFGLQLFVPL SIFSMMILLP IHVNGDDMVR QHAQYIVAKV NTTAEVPGGL ILTTVANIPG KQGVLWLHTV GMWLMVLYTT WLLKQHSATF VVLRTLYLTT RGDTNLWRVV HQPSNFIEQL LVQGTQAGAE IDDDELRKMK SANSDDINAA CDALAAMDEY EDEQRGLLKR AVDVTSHKAK SLLTRAASTR LVDKRLEDKE RRQSLHELDT EPSPTHAYGD TLASSTVKIS PFASTPHLSP FGGPTARRLT PPQVGNVAQG LKGAGTLFKS PASADADDRQ MARRPSVPVT PTLGRASTSQ ASSQPENKTE PGPVTESRHF TSFTSGENSD DLDLNVNPPV SEPPAVAKRR ERRKLRMGSF GSLGAMPASV SQTLREHEAA QASGMHRVTS REDIGTKQLM EGVINQRGSA DNLAWLVSPT KSPRRGKHNR MPTMEDLPVE NIALTPAVKK ALARVKSLED AGEAVRCGGR GSEGVQIRAE TSIAHDWWVG LDVTHQFKGT GSKPNPRTDG RPLEVPERQT MEGEEPKRNL TSLFDDPESP LSATSPVTRA SLCVGDYLQE APSAVPDVDS IRTVNAFDTN TNQIVSVWAS SYTVLITDIP FVRAMGENGE EVRVRGLREV EATLEYIYGD EFRGLIPIFD HRPADALLDS RDECKNMITR IRMLMAREGM IPETQSPKAL TYKFGTRLCE VEKGTLQLGH NWKEFKKAFA DVLRPPKNLR KLALSEQVAV LEAQLEAIDE AIICVRKTTW EGSPGPCAFA VFENQVSAST AAQCVISRAS HRVYRALPAP GPDDVNWPTL LHNSTDNRAR ALAIWPFIIA LMIFPTGMFA TAVTSVCQVR EGDDVINSGA ALDWYCSDDA KVYAAIISGV LPPIILTLWE VFVISFYMMY LVQRQNVHVS LAATDRRFLR FYWAWGALNV LLGGIFGGAL SLFTTTLSSS NVSLNEVQLQ FGRVLPLSSN FFLLFIVFRA IYLPVQRLLL PHPGSFCLAA DIFWCEKRGS CARTSRDKTR LYSPRAVRMG RETGVFMLIM VIGLTFVCIA PLIPLAAALF YITNFVIWRY HVLYVYERGY ESNGSVWFTF TQLVILSLVV AQTFLSCVLF SKQAYIQGAV LYATVPYYLF KVYRKFRAEF GSASSWAVPL SEATAAPPTD FGGEIYTHPA LRPAASGWFP DIGKVWRGYP GVTSKNN
|
| |