Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_27700 |
Symbol | |
ID | 5005656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009369 |
Strand | + |
Start bp | 27049 |
End bp | 29976 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | |
GC content | 62% |
IMG OID | 640421077 |
Product | predicted protein |
Protein accession | XP_001421552 |
Protein GI | 145354565 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.584028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCGC CGTCGGCGCC GAACGACGAG CTCCTGTCGA CGATACTGTT TTTGATCAAC AATCACTACT GGTGCGCGCT GTACGAGCTC TACGTCGACG CGCGAAGGCT GAATGTGTTG GATGCGGATT TGCCTTCGCT GAACACTTTA AAGTCGTTCT TTAACGACTC AAAGCGGTTC CCCGACGGCG TGTTTCGCGC GGTCAACGAC GCGTTCGACA GTGGGGTGTC GGCCGCGCGC GTGGTGGACG CGGAGTCGCG CGCGGCGGTG GCGGAGTACG AGTTGAGGTG CGCGCGGGAA GATTTGCAAA ACTCGCGGCG AGGCGCCGGC GGAGGCGGAG GCGAGGACGG TGGCGAAGAC GCGAGCGGGG CGAGCGTGCG GCGAGGCGTG GACGCGGATG ATGCGTTTGC GCCCACGAGC GGTGATTCCG CGCGCTCGAG CGCGGATGAC GTGGGTTTGG ATGCGAAACT AGACGCGGCG ACGTACGAGT ACCTGTCTCG TCGCGGATAC AAAGCCACGG CGCTTTCGAT GCGCGACGAA TCGTCCACAG CGGCGAAGTT GATGGAAAAA GATTTCGCGA GCGATGGAGA AAGAAGTTTC GGGGCGCTGC GAAGGATGTA CGAGCGCGCG AGAATGACCG AGACGACGGC GAGCGCCCTG GAGAGCGAAC GCGCGCGCGT CGACGACGTG GAGAGCGAGC TGATTCGCGC CCGCGCGCGA ATCGATGAAC TCGAACGCGA AAATACGACG ACGACGACGT TGCGCGACGC GTTATCGTCG AAATTAGCGC TAGCAGAAAG TGAGCTGAGC GATTTGAAGG TTTCGGCGGC GCAGTGGGAG AAGAAGGCGA CGGAATCCGC GAGCGAGGTG AATAGGCTGT TGTCTGAACT CGGCGCTGGG AACGAGGAAT GGTCGAGTCG CGACGGCAAC GGAGTGCCGC GAACGACGAT TGAAGAGAAC GATACAATCG ACGCGGTCTT GTCGTTCATT TGCGCGATCG CGCCGAAGGT TTCGCCGGCG GCGCGGAAGG AGTTGCTGCC GATGATTTCC CGCGCGTGCG TGCGAGCCGC CGGCGATGAA AAACGCGCCG CACGGTCTTC GAATCTGTTT TTTGAGCTCT TCAAAGCCCC GAATGCAGAA CAGAGAGATG CAATCGTCGA CGCGATCGCG AACGTTGGCG AAATCGTGGG CATGAACGCC TTCGAGGGGA CATTCATTCG AGGCTGCCTC GGATCCGAAG CCGTGGCGAC GATGAACGAA GAGCGACGAG TGCTCGTGCT CGACGCCATC GCCAAGCTCG GCGCATCATC GTGGTTTACT TTGAATAATT TTGTCATGGA TGGTTTCAAG CGTGCCGCGG TAGATCCAAG CGATGGTGTT CGAGCGGAGT GCTCACGAGC GGTGGAGCGT TACATCGCCG CAAACGCGCC GAGCGACGAC AACATGGAAA CGATTGAAGA CGTTTTGATG ACGCTCGCGT GCGACGGCTC CGACGAAGTT GCCGACGCCG CGCGAGCGAC GCTCGCCCCC GCCGTCGCAT CTTGGTACCT AGGTGCGAAT CCACGACGGT TCACGGACGT TTTCGCGCGA AAGGTGCTCG ACAAAGCCGC CGAGGCATTG CGGAGCGGAT GGACCGGCGA AGGCGCCGAG CGGGAGTTCA AGGGTTGGGT GTCTCCAGAA GATGGTGATC GTCATCGATG GCACGCGACG AGCCTAGTAA AGACGTTTGA AGCGTTTGCG CGGCCGATTC GCGAGGCGTT AAGCGCGACG AAACCCGCGT CTATCGCAGA CGGCGTCGAC GACGCGCTGA GCAAAGACGC TCCAGATTCC TGGCCTTTCG CGCAGTGGTG CGTAAAAGAA GCCTCAGACT TGATTGTCCA AGTCATTAGC TCCACCGCAC CCGATGTCGT CGGTCAAGAG TCCGTGAGAG AGAGCATATG CGCCGCCGTC GCGTCTTGGT GCGGCGTTCT CGGCGCGCTC GCCACGCGAG CGGTTTTGAT TGGAAAAGTG AACGATGCTT GCTTGGTGAG TTACGATCAG CGTCGCGCGG TGATGCCAAT ATTGCTCGCG GGCGTGGTTC CTTACACCCC GGACGGTGGC GCCGTGCTCG GAGATTACAT TAAGCGCCTG ATTCAACAAG CCAGCGCGGA GTCGTGCGAT GAAATCATCG ACGCCGCGCG GTACCTCGCC GCGTTCGAGC AGCACACCGC AAACTTGCTC GATGCGTTGA AGCGGTGCGC GCATCCGATC GCTGAGAACC CACCGACCGT GCGCCTTATC ACCGCACGTC TTCTCGCGGC GACGTCGGAG ATTTTGCCGC TCAAGCACGT GCTCGAACAC GTCTATCCCG CGCTCAACGT TTTGCGAGCC GATCCCGAGA CGAGTGTCCG CCGCGAAACC GCTCTAGCGC TCGCGGTGTG CGCGTGCACG CACTACGAGA CTATCGAACC GACGACGCAA ACGATGCGTC AGCTCGAGTC GTTGATCGGC GACTCCGACG TCGGCGTCCG CGTCGCCGTC GTCGAGGCGA TGTCCCTCGG CGCGTCGGTG CCCGGAACGT CGTTCGCCGT CTCCGCCGCC AGCGCGCTCT CAGCCATCGC GCAGCTGCCG AGTGAAGAAC ACGAGATCGC GCACGCGCTT TTCGCGGCCA TTCGCGACAT GCTCGGCGCG GATGGCGATT TGTTCCCTCA AATCTCCCCC GCCCTCGTCG CCTTGCTCGC GAGCGGAGGC CTCGACCAGG GGCGTCGCGC CCAAGTCGAA ACCATGCTTC GCGACGGCGG CTGGTCCCCC GACACCGCGC CATCGAGCAT CACCAACGTC ACCGTCGACG CGCCCCACGC ACCGGGACGC GGGAGGTCCT CCGCGTTCGA TCGTATGAAG AGCGTGGGCC GTTCCGCCTT CGGCGGCCGC GGTCGCGACG CTCGATGA
|
Protein sequence | MAPPSAPNDE LLSTILFLIN NHYWCALYEL YVDARRLNVL DADLPSLNTL KSFFNDSKRF PDGVFRAVND AFDSGVSAAR VVDAESRAAV AEYELRCARE DLQNSRRGAG GGGGEDGGED ASGASVRRGV DADDAFAPTS GDSARSSADD VGLDAKLDAA TYEYLSRRGY KATALSMRDE SSTAAKLMEK DFASDGERSF GALRRMYERA RMTETTASAL ESERARVDDV ESELIRARAR IDELERENTT TTTLRDALSS KLALAESELS DLKVSAAQWE KKATESASEV NRLLSELGAG NEEWSSRDGN GVPRTTIEEN DTIDAVLSFI CAIAPKVSPA ARKELLPMIS RACVRAAGDE KRAARSSNLF FELFKAPNAE QRDAIVDAIA NVGEIVGMNA FEGTFIRGCL GSEAVATMNE ERRVLVLDAI AKLGASSWFT LNNFVMDGFK RAAVDPSDGV RAECSRAVER YIAANAPSDD NMETIEDVLM TLACDGSDEV ADAARATLAP AVASWYLGAN PRRFTDVFAR KVLDKAAEAL RSGWTGEGAE REFKGWVSPE DGDRHRWHAT SLVKTFEAFA RPIREALSAT KPASIADGVD DALSKDAPDS WPFAQWCVKE ASDLIVQVIS STAPDVVGQE SVRESICAAV ASWCGVLGAL ATRAVLIGKV NDACLVSYDQ RRAVMPILLA GVVPYTPDGG AVLGDYIKRL IQQASAESCD EIIDAARYLA AFEQHTANLL DALKRCAHPI AENPPTVRLI TARLLAATSE ILPLKHVLEH VYPALNVLRA DPETSVRRET ALALAVCACT HYETIEPTTQ TMRQLESLIG DSDVGVRVAV VEAMSLGASV PGTSFAVSAA SALSAIAQLP SEEHEIAHAL FAAIRDMLGA DGDLFPQISP ALVALLASGG LDQGRRAQVE TMLRDGGWSP DTAPSSITNV TVDAPHAPGR GRSSAFDRMK SVGRSAFGGR GRDAR
|
| |