Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88822 |
Symbol | |
ID | 5004877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 391947 |
End bp | 394271 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420298 |
Product | predicted protein |
Protein accession | XP_001420678 |
Protein GI | 145352705 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5028] Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.355252 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00889443 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACTTTG TGGAGGATTT CGCGTCTCTG AACCTCGGTC CGGGCGGTGG TCCCGGTGGG GAGGCTGGAA TGGATCCCGC GACGTTTCCT CGACCGGGCG ACGACGAAGC GCGTCCGAAC TTGGAGCTTT CGTGCGATCC CAAGTACATG CGTCTGACGT GCGGGGCGCT GCCGTCGAGC CCGAGCTTGA AGACGAGGTT CGCCATGCCG CTGGGATGCA TCGTCCAGCC GCTCAGACCT GGAGATGAGA CGACGGTGAA GACGGCGCAT TTCGGGAGCT CGGGTATCGT GCGATGCCGC AGATGTCGAA CTTACATAAA CCCTTTCGTG CAATTCACAG ACGGCGGGCG GCGATTTAGG TGCAACGTGT GCGCGCTGCC CAACGAAGTG CCGGTGGATT ACTTTTGCAC GCTCGACGCG AACGGGGTGC GTCGAGACAT CGCCGAGCGC CCTGAGTTGA ACAGTGGAAC GGTTGAATTT TTAGCGAGCC AAGAGTACAT GGTGCGACCG CCGATGCCGC CGTCCTACTT CTTCGCCTTG GACGTGTCGC ACACGGCGGT GAATAGCGGC TTTTTGAAAC AAACGGTGGA GGTGATTCGG GACTCCCTCG ACGTCATGTC GAAGAAGAGC GAGCGCACGC GGGTCGGATT CTTGACGTAC GATTCGACGT TACACTTTTA TAGCCTGAAG GCGAATCAGT CTCAGCCGCA AATGATGGTA GTCGCCGAGC TCGACGACCC GTTTTGTCCG ATGCCAGACG ACTTGCTGGT GAATCTCGCC GAGTCGCGCG CGGTGATTGA TGCATTTTTA GACATGGTGT GCGATACGTA CGCGCAGACG CAAAACATGG AAAGTGCCAT GGGCCCGGCG ATTCAAGCCG CGTTCTTAGC TATGTCTCAC ATCGGTGGTA AGCTTCTCGT GTTCCAATCC TGCCTGCCCA CGCTCGGCGC GGGACGCATG ATCAACCGCG ACGACACGCG AGCGAGCACG GATAGCACGA AGGAACACCT GCTTCGCGGC CCGGTCGATG GTTTCTTCAA GAAGACCTCG GCAGAGTGCT CACGACATCA GATTTGTATC GATTTGTACA CCATCGCGGC GCCGTTCTCT GATTTGGCCT CCATGGCGGT GTTGTGCAAG TTCACCGGAG GCGAGTTGCG ACATTACCCC GGTTTTACGC CGGACAAGGA TGGGGTAAAG TACGCAAAGG AGCTGAAAAA TAATCTCACG CGCTTCACCG CGTGGGAAGC CGTGTGTCGG GTGCGATGCA GTCGAGGATT TAGAATCTGC GCCTTCAACG GGCACTTCTT CATTCGATCG ATGGACTTGC TCGCGCTCCC GGCGACGGAT GGCGACAAGG CGTACGGCGT GCACATCGCG CACGACGAAG TGGTTCCGAG CACGAACATT TCGTACTTGC AGTGTGCGCT ACTGTACACC TCCGCAGAAG GAGAACGCAG AATCCGAGTG CACACGATGG CGGTTCCGGT GGTGACAGAC ATAGCAGAGA TGTACCGCGC CGTGGACTGC GGCGCCATGG GCGCGTTCAT GGCGCGTTTG GGCGCCGAGC GCACGCTCAC GGTGCGATTG CAAGATGCGC GCGAGGCGGT GATGACCAAG GTTGTCGCCA CGTTGCGTGA GTTCAAGTTG CTCAACACGC AAGCGTCGAG AGCGTTCAAT AGGCTCATCT TCCCCGAGAG CATGAAGTTA CTTCCGTTGT GGATCTTCGC CGCGAGCAAG AGCACGGCGA TGCGAGGCGG CCCGCGAGAC GTCCCCGTCG ACGCGCGGAT CGCCGCTGTG TACGACTTCA TGTCTGCCTC GACGGAAGAA ATCTTAAAGC TGCTGTATCC CACGATGCAC GCCTTGCACA CGATGCCCGA GGAAGCGGGT ACGAAGGACG AATACGGCAG AGTGATTTTG CCGCCGCGCA CCGTCCTCGC GGGCGAGCGC ATCGACGCTC GCGGCGCCTA CCTCGTCGAC GACGGTCGTC GCCTGCTTTT ATGGCTCGGA AAGATGCTCG ACCCACAGTT CGTCGCCGCT TTGTTCGGCC CTAGCGGTCC TCCGAGCGCG GATGTGGACT GCAACCTCCC TCATCTGGAC ACCGACGTCT CGCGTCGCGC GCGCGCCGTC GTCGACGACA TACGCGCCGA GGCGTCCCGC GCGCGTCATC TCGCCCTCAC CGTCGTCATC CAAGGCCATC CGAGTGAGAC GCAATTATTC CCTTACCTCA TCGAAGATCG AGGCGCGGCC AACGTGCCCG GCGCGTCGTC CTACGGCGAG TTCCTAGTGC AGCTTCACAG GCAAGTCTCC GCCGCGCAGC GGTGA
|
Protein sequence | MNFVEDFASL NLGPGGGPGG EAGMDPATFP RPGDDEARPN LELSCDPKYM RLTCGALPSS PSLKTRFAMP LGCIVQPLRP GDETTVKTAH FGSSGIVRCR RCRTYINPFV QFTDGGRRFR CNVCALPNEV PVDYFCTLDA NGVRRDIAER PELNSGTVEF LASQEYMVRP PMPPSYFFAL DVSHTAVNSG FLKQTVEVIR DSLDVMSKKS ERTRVGFLTY DSTLHFYSLK ANQSQPQMMV VAELDDPFCP MPDDLLVNLA ESRAVIDAFL DMVCDTYAQT QNMESAMGPA IQAAFLAMSH IGGKLLVFQS CLPTLGAGRM INRDDTRAST DSTKEHLLRG PVDGFFKKTS AECSRHQICI DLYTIAAPFS DLASMAVLCK FTGGELRHYP GFTPDKDGVK YAKELKNNLT RFTAWEAVCR VRCSRGFRIC AFNGHFFIRS MDLLALPATD GDKAYGVHIA HDEVVPSTNI SYLQCALLYT SAEGERRIRV HTMAVPVVTD IAEMYRAVDC GAMGAFMARL GAERTLTVRL QDAREAVMTK VVATLREFKL LNTQASRAFN RLIFPESMKL LPLWIFAASK STAMRGGPRD VPVDARIAAV YDFMSASTEE ILKLLYPTMH ALHTMPEEAG TKDEYGRVIL PPRTVLAGER IDARGAYLVD DGRRLLLWLG KMLDPQFVAA LFGPSGPPSA DVDCNLPHLD TDVSRRARAV VDDIRAEASR ARHLALTVVI QGHPSETQLF PYLIEDRGAA NVPGASSYGE FLVQLHRQVS AAQR
|
| |