Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15365 |
Symbol | |
ID | 5002123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 251699 |
End bp | 253669 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417544 |
Product | predicted protein |
Protein accession | XP_001417717 |
Protein GI | 145346485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.138212 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.727181 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CGGAGGAGGG GTTCGACGCG AGGGCGTATC TCGTCGCGGC GCACGGCGAG CGAACGCGCG AAGAGCTGGC GAGAGGGGCG ACGAGGCTGG AGGCGGAGAT CGACGCGGTG CGGGCGTCGA CGCGAATGTC GGCGGCGGAG GAGTTGCCGA CGGTGCTGGC GTGCTTGGAC GCGATGGAGG ACGCGAGAGG GGTGCTGCGA AGAGGGCGAG AGGAGGCGGG CGAATTCGGC GCGACGGCGG AGTTGGAGGC GCGACTGTCG CGAGCGTGGA AGAGCGCGAG GGAGAGCTTG AGGGAGGTCT TTGCGATCGA GGAGAGACGG GAGAAGATTG CGCGCGCGCT CGAGGCGATG GAGCGGCACG AGGACGTGTT CGGGATTCCG GGAGCGGTGC GAGAAGCGCT GTCGCGAGGA GAGTACGCGC GCGCGGCGGA GACGTATCGT CGCGCGCGCG CGGCTTTCAG CGGCAAACGC TCGCGCGTCT TAGATGCGGT CTTGGATGAA GTCGAAGAAA ACGTGAAATC CGCGGAGGAG CGCATGTACG AACGCTTGTA CGTGGGAGAC CTCGACGACG CTCACGCGGA AAGAATCGTC ACGGCGCTGC AGACGTTGAA ACTTTGCAGG CCCGCGTTGA CGTCGAGTCA GGGTGAAGTC ACGGCTGCCG GTAATGCCGT GCATATTTAT TTAGATAGAT TAGTGGAGTA CGCGTGCGAG GAACTGACGA ACACGGCCTC GAGCGATGAC TTTGACGTCG AGACGCTCAG TCGAGGATAT CGCGCGCTTT TCGTTCGCGT CTGGCGCTTC GTGACTCTCA TGGACACGTG TGCGTCGTCA TACGCGCGTG ATGCGTCGAC TAAGATCCAA TCGGTATACG TTGGTTTCAT GAAATCCAGG TTTGACAACA GTCTCAACAA GCGGACAATT GAAACTGACG CCGAGCAAGC GAATCGTCGC TTTGACGTCT TAATTGACAA GTGCGCGAAA ATGTCTTGCA TCGGCTTTTC GTTATCGTAT TCATACGATA TTCTTGGCAC GCGATTAAGC TTACAACCGG ACTTGCTCGA GGCACTGCAG CAACAATACA CGCGTTTTAG CGTCAGTTTG CGCGTGCACT TGGAACAAGC GCTCAAGTTG GCGGCGCAAC CGCTCGCGCA AGACCAACGC CTCGAGACGA CCACACAGTC GTTCTTCCGT GATGCACGCG TCGTATTTCA AATCACCGCG GAATACTGGC TCGATGAGCG GTTCACGCCT TGGATGATCA ACGCCGGTTC ACGAGACGTG GGAAGTTTGA TTGACACCTT TTACGACGCT GCTCGCTCGC TCGTGGCGCT GGCACGCGAA CTTCGGCGTG GTCCGTTGGC GTCGTTGGCC GCTCTGAAAC AGATCGAAAA TTGGTGCGCG GTATTTTTCG ATGAGTTCAA TTTTACTGGG ACCGGAATAG GTAACGAGGC GAATCGCGTG GCGTTCAGAG ACGACATCTC ACGAACCACG CAAATGTTTC TGGACGAATT TGTCGGTGGC GAAATGAGCG CAATTATCGT CGCAGTGCGC CGTTGGTTCG CAGCACCGGT GGAGAAAACG TTAGAATCAC GCCCGGAATG CGTCGATGTC TTGCATCGGG TGCGTTCAAC GTACGAGTCT GCGACGTCCA CGGTGCCCGA ACTCGCCACG GCCATCTCGC AAGACATCGC GGCGCGACTC GTCGACGCGT TACGCTCCGA GTTCACCTCG AATCTCAGTC AGCTTAGACC GACGGCGAGT ACACTTCGCG TAGAGTTCGA GCTTTTGAAG CTCGCGCTGG ACGCGCGCTC GACGAAACAG GCGCGAGACG GCGCGTCTCG TCTCGTCGAT TTGGCGGCGC GCGTCGCCCC CGACGACGAC GCCATCGCGC GCGCGCGGGC AATCGCCAAC GACGCTCAAA AACACAAACA CCTCCTTGTC GCGCTCGGGA GATTAGCCTA G
|
Protein sequence | MTTTEEGFDA RAYLVAAHGE RTREELARGA TRLEAEIDAV RASTRMSAAE ELPTVLACLD AMEDARGVLR RGREEAGEFG ATAELEARLS RAWKSARESL REVFAIEERR EKIARALEAM ERHEDVFGIP GAVREALSRG EYARAAETYR RARAAFSGKR SRVLDAVLDE VEENVKSAEE RMYERLYVGD LDDAHAERIV TALQTLKLCR PALTSSQGEV TAAGNAVHIY LDRLVEYACE ELTNTASSDD FDVETLSRGY RALFVRVWRF VTLMDTCASS YARDASTKIQ SVYVGFMKSR FDNSLNKRTI ETDAEQANRR FDVLIDKCAK MSCIGFSLSY SYDILGTRLS LQPDLLEALQ QQYTRFSVSL RVHLEQALKL AAQPLAQDQR LETTTQSFFR DARVVFQITA EYWLDERFTP WMINAGSRDV GSLIDTFYDA ARSLVALARE LRRGPLASLA ALKQIENWCA VFFDEFNFTG TGIGNEANRV AFRDDISRTT QMFLDEFVGG EMSAIIVAVR RWFAAPVEKT LESRPECVDV LHRVRSTYES ATSTVPELAT AISQDIAARL VDALRSEFTS NLSQLRPTAS TLRVEFELLK LALDARSTKQ ARDGASRLVD LAARVAPDDD AIARARAIAN DAQKHKHLLV ALGRLA
|
| |