Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42874 |
Symbol | |
ID | 5003420 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 276116 |
End bp | 277921 |
Gene Length | 1806 bp |
Protein Length | 572 aa |
Translation table | |
GC content | 63% |
IMG OID | 640418841 |
Product | predicted protein |
Protein accession | XP_001419116 |
Protein GI | 145349386 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0187431 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGC TCGTGCGAAG GCAACGCGCG CTCGAGGACG ACGCGGATCG AGCGCGCGCG GCGTTTCCGA CGGCGTTCGG GCGAGGCGGG GGGGCGGGCG CGACGGCGAG GGACGTCGTG GACGACGGCG CGCGCGCGAC GGCGGCGCTG GGCGCGAGGG ACGACGACGA CGACGACGGC GGCGGCGGCG GCGGCGGCGG CGGCGGCGAG GCGGACCGGG GAAGCGGCGA GGAGGAATAC GAGGAAGAGG ACGCGATTCC GCGCGCGGCG GAGGCGGTGA TGGAGGGATT TAAGAAACCG GCGACGTGCT GCGCGGTGGA TCGATCGGGG GCGCGAATGG CGGCGGGATC TTCGGACGGC GTCGTGCGGT TGTACGATTT CAACGGGATG AAGCGGGATT TACAACCGTT TCGGTCGATC GCGCCGCGCG AGGGGTACCC GATACACGCC GTCGATTGGT CGCCGACGGG GGACATGTTC GTCGCGGCGA GCGGGAGCTG GCAGCCGACG GTGCACGACA GAGATGGGGT CGAATTGGGG GAATTCGACA AGGGCGACAT GTACATCAGA GATTTGAGGA ATACCAAAGG ACACGTCGCG GCGACGACGG ACGTGAAGTG GAACCCGCTG GATAAGGAGA CGATTTGCAC CGCGGGAGAG GACGGTGCGC TGCGTTTGTG GGACGTCACG TACCTGGGCG ACGCGCGCGG GTCGCAAAAG GCGGTGCTCA AGCCGCAGCA AGTCAAGCCG GGGCGCGTGC AAGTGACGTC GTGCGCGTAC TCGCACGACG GAGATTTGAT CGCGGGTGGA ATCACGGACG GTAGCGTGCA AATCTTCTCG TCCAAGGGCT CGCAGTATAA ATCCGCAACC ATCGGCTTGG TCCTACCGCC TTCGCAGCAG TGCAAACTCG ACAACCACTG GACGTTCAAC GGACGACCGA GTCATCTCTT CAAAGGCGCG CATCCGGCGG GCGAGGAGGT GACGTCGCTT TCGTTCGGTA GGGACGGTCG CACGCTGCTT TCGCGTTGCG AAGACGGTAC GCTAAACGTG TGGGATCTGC GCAACGTCAA GGCGCCCTTG AAGCGGTTCG AAGATTTACC GACGCGCCAC AGCGAAACCA CGGTGGGTTG GTCGCCAAAC GACGTGTACT TCTTCACCGG CGTCGACGCC GAGCGCGACT CGCGCGGCGG TAACACGCAG GGCGGTCTCT GTTTCTTCGA CAGAGAAAAG CTCGAGATGG TGCATCGCGT TTCGACGCCG ACGAATTGCA TCGCGGCGAC TTGGCATCCG CGTCTGAACC AAATCTTCGT CGGCTGCGGC GACGCCAAAG GCGGGGAGTT GCGCGTGCTC TACGACCCCA AAAAGTCCAT GGGCGGCATC ACGCAGGCTG TGGGTAAAGC GGTGCGTAAA AAGGCGGATG ATTTCGTCCG CATCGATGTG CAAGAAATTT CTTACACCCC GAACGCGCTA CCAGCGTTCA AGGAACAAAT GCCGGGCAAA CGCAAGCTCG ACTCGACGGA CATCGCGCGC CAGGCTCTGC GAAAGAACCC GAGCAAGGCG GTGACGAATA TGGACAAGAG CGGTGTGTTG ACCGGGGGCA CGGGAGCATC GCTTCTCACG CAGCACATAA TGCACAACAA CGAAGAGCTC GGCGAAAAGA ACTGGATGAA GACGGACGCG AGAGAGTCCA TCCTGCGGCA CGCCGAGAAA GCCGCCGCGA ATCCGATGTT TACGAAAAAA GCTTACGAGC ACACGCAGCC GAAAGAAATC TGGCGCGAGG AAGAAAAAGA AGGAGATTCC GATTGA
|
Protein sequence | MAALVRRQRA LEDDADRARA AFPTAFGRGG GAGATARDVV DDGARATAAL GARDDDDDDG GGGGGGGGGE ADRGSGEEEY EEEDAIPRAA EAVMEGFKKP ATCCAVDRSG ARMAAGSSDG VVRLYDFNGM KRDLQPFRSI APREGYPIHA VDWSPTGDMF VAASGSWQPT VHDRDGVELG EFDKGDMYIR DLRNTKGHVA ATTDVKWNPL DKETICTAGE DGALRLWDVT YLGDARGSQK AVLKPQQVKP GRVQVTSCAY SHDGDLIAGG ITDGSVQIFS SKGSHHLFKG AHPAGEEVTS LSFGRDGRTL LSRCEDGTLN VWDLRNVKAP LKRFEDLPTR HSETTVGWSP NDVYFFTGVD AERDSRGGNT QGGLCFFDRE KLEMVHRVST PTNCIAATWH PRLNQIFVGC GDAKGGELRV LYDPKKSMGG ITQAVGKAVR KKADDFVRID VQEISYTPNA LPAFKEQMPG KRKLDSTDIA RQALRKNPSK AVTNMDKSGV LTGGTGASLL TQHIMHNNEE LGEKNWMKTD ARESILRHAE KAAANPMFTK KAYEHTQPKE IWREEEKEGD SD
|
| |