Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15108 |
Symbol | |
ID | 5001513 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 611534 |
End bp | 614497 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | |
GC content | 52% |
IMG OID | 640416934 |
Product | predicted protein |
Protein accession | XP_001417297 |
Protein GI | 145345608 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.000868125 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACGA GTAAGACGCG AGCGGGAATT CGACGCGAAC GGAGGGCGCG AAAGCGACGC CAAGATGAAG AGGCGGCAGA GAATTGCGCG TTGGAGACCG ATTCCACCAT GCTTGAGTCG AAGTGCATGT CCGAAAAGCC GCTCACCGCG AACCCATTCA TTTCGCACAT CGACGTGAAG GAAGTCATTC GATTGAGCGA CGTCTATCGT TGTTCACAAG ATGTCTCAGT CGACGGGGAT GGCGAGACTA GTGGACTGAC GAGCGAGGAG GTGACTTTGT CGCCTGCCTC GACGGAATAT CTGCAAACGT CGAGAAAGCG CTCGAAGTCG TACAGAAAGC CGAGTTGGCT TCGTAAGCGT GAAGCGCGGC TGAACTCAAC GCATGCTCAG AGTTTGCGGA CGCGAGAAGT GAAGCAAATG ACCGCCGGCA ACATCGTGAA TCGACAAAAG AATGCGAGTT TGGGTAACGA GGTGAAACTT GCTGACGAGG TCTTCGATCG AGGGGTGTTC ATGTTTCGTT CAAGCTTCAC GAAGAAACCA GGCTTGCCGG CGACGCATAC TCTGCGATCG ATCGGCTTGG GCGTCCGCGC ATCAAGGCGC CTTTACCTGA CGATATTTAA GGCAAAAAAG CTTACGAAGG CTACATCGAA GCACAAGACA GATCGCGCTT TGCGGCAGGC AAAGAGTACG TATAGAATTC CGCAAAAGCA CCGCGACACA CTTTTACCGA CGTTGCGGGC GATGTTGGAC AAGGTTCGAT GTTGCCCATT CACGAACATC CTCAACAAGC ACGCTCCCAT GCCGCGTGAG CTAATGGTCT CCACGAAACG AGTCGGTGAA TTGAGTCAAG CGTCGCTGTT ATCTGCGTAT ACGGCACCAC GAAGAGTAGC CGCCTTTGTT TGGGAAGTCA TTGAGCGCCT CGTGCCCGTC GAACTCTTGG GGTCGGTCCA AACACGAAAA AGCCTGTACG GTTGCATTCG CCGAATTGTG TCGTTGCGAA GATACGAGCG GTTTACTTTG CATGAAGTCA TGCAAGGCAT CCGAACGGCA GACTTCAAGT GTTTCAAGGC GAAAACGACC AATGCGAGTA ACGCAGGACA GGTGGAGGCG AGTCAGCGGA GAATGGTCAC CAAATGGATC AGCTGGCTCG TGCAAGAGAT GGTCTTCCCA ATCATACGAT GTCATTTCTA CGCCACGGAT ACGCAAACAC ACAAAAATAG GATATTCTTT TACCGGAAAG GGATTTGGTC TCGGCTCGTC ACCGCCACAC TACAAAGCTT GGAAGAAACG TCGTTCAAGC GCCTCACGGC GAAGGAAGTG ACAGCGATGA TGGATAGATC GAATGCGTCC AACTTGGGGT TTTCGAACAT GCGATTTTTA CCGAAAGGCT CGGGGTTACG ACCTCTAGCG ACGCTGAACA AACCGACGAC CTTTGTCGTC AAGGGCGGCG GGAAGAATTC AATGCGGTGG TTGAAGAAAT TCGATGCGAT AAACAGACGG TTGAAAGACG TGCAGGATAT TCTCCACCAC GAGACAGTTC GAGATCCGAA CGTCTTAGGC GCAGCCGTCG GAGATTATAA GGCGGCTCTT CTTCGTCTTG GACCGTGTCT ACGCGCGATT CGTCGCCAAA AGTTACTTGG ACGGCGTCCT CAGGCGTACA TTCTCGCGAC TGACATTAAA GGCGCTTTCG ATAATCTTCC GCTCGAATCT CTCGAGCGCG TGGCCTTGGA CCTAATCTCA AGCTCATCAT ATCAAACGCT TCATTATGCC GTCGTGAAAC AAAACGGTGA GGCAAAATAT AAAAAGTTGG TGTCTTCCCT TCCGACCGCG GCGAAAGGCG AAAGAGTGCC GGGACCTCTC GTGACTGAAG CGCAGCGCGT TTCGAACACC GGCGGTGCGG CGATTCTTGG TAGAGAGAGC AGAAACTCGA TCGTCATTGA CGCAGGATTC GCTAAGGAAA TTTCCGATGA AAACATAGTG CCGTTATTGC GCGCACATCT ACGAGAAAAC ATCATCTCTG CGCGCGGCAA GTTCTTGTTA CAAACCGTCG GAATCCCGCA GGGTTCGATT GTATCTCCGC TTTTGTGCAG CCTATTTTAC GGCCACCTCG AACACGAGTA TGGATTATTG AGCGGTTTCT GTGGCGAGCA ATCAACCGTG TGTCGATGGA TGGACGACAT GCTTCTCGTC ACCACCGATC TTGATCGGGC CAAAGCGTTC GTCGACATGT GTAAGAAAGG TTTCGAGGCG CACGGCTGCA CTCTAAACAC CACAAAAACG CTTTCAAACT TCGACCACGG AGAAGACGTA CGACAAAGGA CATTTGTGAA CGCGGATGGG CGAAGTTACA TCCCGTGGTG CGGTATATTA ATCGATTGCC TAACGCTAGA AATAATGGTG GACTACTCGA GGTACTCGGG AGAATTCGTT CGAGAGTCGA TGAATCTACC GCTCGGGCGC AAGGCGTGGA CGCGTCTACC CGAACGCATT TGTGGCTTTT TGAAGCCAAA ATGTGCGGCT ATATTTTTCG ACGAGAGTAT CAACAGCAAA GTCACCGTGC GCGTCAACGT CTACCAGCTC TTCCTCATGG CGGCGATGAA AACACACAGC TACGTCGCTG CGACGAGCGC GATTCCGGGA TGCGAGCAAA TCTCATCGCT TGCACTTTAT CGCGCGATCA AGCACGCCGT TCGTTTCGGA CAAATATTGA TACAGCGTCA GAGTAGCAGC GCGCGCGCGC ACTGCAGTAG CGTCGGGAGA CTGCCGGACT CGCACGTTGA ATTTCTCGCT ATCCAAGCCT TCTTGAAAGT CCTTCAACAA AAGCAGACGC GATACAAGCG AGTAATTCAG CTATTGAACT CGCGACTCAC CTCACAAGGC ATGAAACGAA CGTGCAGAAA CGGTCTGCTC ACACAAGCGA TGGAACCATC GCGAAACACG ATTTTCACAC ATATTAGATT TTAG
|
Protein sequence | MATSKTRAGI RRERRARKRR QDEEAAENCA LETDSTMLES KCMSEKPLTA NPFISHIDVK EVIRLSDVYR CSQDVSVDGD GETSGLTSEE VTLSPASTEY LQTSRKRSKS YRKPSWLRKR EARLNSTHAQ SLRTREVKQM TAGNIVNRQK NASLGNEVKL ADEVFDRGVF MFRSSFTKKP GLPATHTLRS IGLGVRASRR LYLTIFKAKK LTKATSKHKT DRALRQAKST YRIPQKHRDT LLPTLRAMLD KVRCCPFTNI LNKHAPMPRE LMVSTKRVGE LSQASLLSAY TAPRRVAAFV WEVIERLVPV ELLGSVQTRK SLYGCIRRIV SLRRYERFTL HEVMQGIRTA DFKCFKAKTT NASNAGQVEA SQRRMVTKWI SWLVQEMVFP IIRCHFYATD TQTHKNRIFF YRKGIWSRLV TATLQSLEET SFKRLTAKEV TAMMDRSNAS NLGFSNMRFL PKGSGLRPLA TLNKPTTFVV KGGGKNSMRW LKKFDAINRR LKDVQDILHH ETVRDPNVLG AAVGDYKAAL LRLGPCLRAI RRQKLLGRRP QAYILATDIK GAFDNLPLES LERVALDLIS SSSYQTLHYA VVKQNGEAKY KKLVSSLPTA AKGERVPGPL VTEAQRVSNT GGAAILGRES RNSIVIDAGF AKEISDENIV PLLRAHLREN IISARGKFLL QTVGIPQGSI VSPLLCSLFY GHLEHEYGLL SGFCGEQSTV CRWMDDMLLV TTDLDRAKAF VDMCKKGFEA HGCTLNTTKT LSNFDHGEDV RQRTFVNADG RSYIPWCGIL IDCLTLEIMV DYSRYSGEFV RESMNLPLGR KAWTRLPERI CGFLKPKCAA IFFDESINSK VTVRVNVYQL FLMAAMKTHS YVAATSAIPG CEQISSLALY RAIKHAVRFG QILIQRQSSS ARAHCSSVGR LPDSHVEFLA IQAFLKVLQQ KQTRYKRVIQ LLNSRLTSQG MKRTCRNGLL TQAMEPSRNT IFTHIRF
|
| |