Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14476 |
Symbol | |
ID | 5001108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | + |
Start bp | 166758 |
End bp | 168434 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | |
GC content | 61% |
IMG OID | 640416529 |
Product | predicted protein |
Protein accession | XP_001416573 |
Protein GI | 145344094 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACGC GCGCGGTCGC GGTGGTGCTC GCGGCGACGC TGGCGGCGCG AGGCGCGCGC GGGGAACACC CTCGGGACGC GAGCAACTGG CAAGGAAAGC ACGCGTGCGA CGCGGCGGGC AGAGCGGCGA AGCCGCGGTC GGTGCGCGAC GTGTCGGAGG CGGTGTACGC GTCGACGGAT GCGCGCGCGA ACGGTGCGGG GCACTCGTGG CATTCGGGAT TGTTTTGCGC CGCGAACGGC GGGACGCGCG TGGACGTGAG CGAGACGCGG GCGGTGCGCG CGGCGGAGCG TTTCGCGCTG GACGAAGGGG CGATGGCGGC GCGCGCGGAC GCGGGGATGC TGACGAGAGA TTTGCTCGAT GGGCTCGCGA GACGTGGGTA CACGCTGCCG GCGTTTCCGT GGTTCATAGA CCAGACGATC GGGGGGGCGA TCGCGACGGC GAGTCACGGG AGCTCGCTCA GAGCGGGATC GCTCTCGTCG CAGATGGTGG CGTGCACGCT TGTCAAGGCA GATGGTTCCG TGGAGCATTT CTCCGAGGGC ACGACGCCCG CGCCGTTGTT CGACGCGCTG CGCGCGAACA TCGGGCGGTT GGGTGTCGTC GTTGACGTCA CGCTGCGCGT GGTGAAGAAC ACGCGAATCA CTCGTAGAAA CGAGGACGTG AGTCCCGAGG CGTTCGTCAA CGAGATGCGG CGCGTGCAGG ACGCGGTGCG AGCGTGCGAG CGCGATTACG CTGGCAACTT TGACGCGCAG TGGTCGTGCG CGATGAACAA GCCAGAGGTG CGTGCGCTCG ATGAAACCCA GTTTTTCTGG TACATTCCCC TCGGTGAAAT GAGTCGCGTG CACTTCGAGC GCGAGGAGCC GATGCCTTCG TTTCCTCGAT ACGACGAGCG AGGGTCGAAG GAGCTTTCCT CGATTTGGTA TGGTTCGTTG AGCGGCAATG ATTTGATTCG TGACTCGCCT CGCCGTGTTC GCGATATCAC ATCTCCTGTG ACGCTGATGA GCGCCGACTC GATGGCGGAA TCCTGGGCGA GACAATGGAA GCGCGCGACT CTCGCGAACA TCGCCAATAA CACAGAAGAA CAACGAGATA ATTTTTTATC GATGACTGAG CGCCAGTACG AGTTACATCA GCGATACGGG TACGAGCAAC TTGAAGTCGC GGTGCCGTTA ACGAAGGCTG GCGATTGCAT GCGCGCGTTC AAGGAGGCCT TGTACGACGA CCGGCAGTTG AATTTAGGAT TTCGCTCGCA GGCGTTACTT CGGTTCATCA AACCCGAAAG CGCGTGGCTG TCGCCCGCAC ATGGGCGACT TGGGTCCTTG TACATCAACA TTGAAGATTT CATCAAGTAT TCCCGCTTGA TCGATCGATA TGGAAATCCA CGCTTTGACG CCGCAGTTAA AATTTTACGC GGGGACTCGT GCGAAGGGAG ATTGCACTGG GGGAAGTTTG GCTTCCCGGA ACGCCGAGGG TGCTTTGACG GTGCCAAGGA GTATGGTGTT GCGTTTTGTC ACTTCGGCTG TCACGTGCAT CGTTTGGATC CCACGGGAAA GTTTGCTGGA GATTCAGACG TCTTGCGCTT CGACGGCGTT GATTTCGTAC ACTGCTGCGG CGATGACGGA TTGTTCAAAG AAAGCTCGAC GTGTCGGTGC GCGCTAACGG ATCGTCAATC ATGCTAG
|
Protein sequence | MPTRAVAVVL AATLAARGAR GEHPRDASNW QGKHACDAAG RAAKPRSVRD VSEAVYASTD ARANGAGHSW HSGLFCAANG GTRVDVSETR AVRAAERFAL DEGAMAARAD AGMLTRDLLD GLARRGYTLP AFPWFIDQTI GGAIATASHG SSLRAGSLSS QMVACTLVKA DGSVEHFSEG TTPAPLFDAL RANIGRLGVV VDVTLRVVKN TRITRRNEDV SPEAFVNEMR RVQDAVRACE RDYAGNFDAQ WSCAMNKPEV RALDETQFFW YIPLGEMSRV HFEREEPMPS FPRYDERGSK ELSSIWYGSL SGNDLIRDSP RRVRDITSPV TLMSADSMAE SWARQWKRAT LANIANNTEE QRDNFLSMTE RQYELHQRYG YEQLEVAVPL TKAGDCMRAF KEALYDDRQL NLGFRSQALL RFIKPESAWL SPAHGRLGSL YINIEDFIKY SRLIDRYGNP RFDAAVKILR GDSCEGRLHW GKFGFPERRG CFDGAKEYGV AFCHFGCHVH RLDPTGKFAG DSDVLRFDGV DFVHCCGDDG LFKESSTCRC ALTDRQSC
|
| |