Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25777 |
Symbol | |
ID | 5006390 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009372 |
Strand | - |
Start bp | 74986 |
End bp | 78287 |
Gene Length | 3302 bp |
Protein Length | 841 aa |
Translation table | |
GC content | 55% |
IMG OID | 640421811 |
Product | predicted protein |
Protein accession | XP_001422333 |
Protein GI | 145356221 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.203689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00367561 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTTCAACGCG CGACGATGGT GAGACGTGGC GTTGCGATCG CGATCGTTGG TGTCGCCGCC GTGGTGGTCG CCCTTGGCAC CGGCCTCGGC ATCGGGCTCA GGGATGACGA CGCGACCCCG GCCCCGGCCC CGACCCCGAC CACCGTCAGC GGGCTGTCGG GCGCGGCGAA GACGAACGCG CTCAAGGCGG CGTTGACGCC GCGTAAAGTC ACCATGATCC CGTCAGAGGG TGTTTCGACG ACGTCTTCTG GTTCGTCTCG AATCGATGGT CGTTCGCTCG TCGAGGTGAC GCCTCTCGAA GTGTCTTCGT TCTCTTCGAC GTCCGACTAC GCCACGGCGC CGCCGGCGGA GACGTTCGTC GACGCTGTAA GTACCGAAAT TTTCCAGATA CCAAACATGG TTCTTTGCTA CGTCGCCTCG GTGAACTGGA CGGCGAACCT GAACACTGGC CCATACATCG CTGAGATCGA TCCTTTCCAC TGTGACAACA GCGACGGCGA CCGAATCAAG GGTAAGGTTA CGTACCGTTG GGTCGTCAAC GCCACGGGCC CAGACATCGA CGACGCGAGC GATACCCGTG ATTTCAAGAC GAACGTCTGG GTAGCCCTCA GCGATACCCC AGAAATGCCG AACATCGACA CGGAAATGAT CGTCAAGCGC GATGACGTCG CCATCAAGTC TGACAAAATC TTGGTGAAAA ACTTTACCTT TACGTACCAA TCGCCGACTG GCTCGACCGC GTCCCAACGC ATCAAAGGTG TGGTTCGTCG CGAGTGCGAA ACCACCGACG CGGATTCGGA TTGCACGGCG ACCGGGGTCA CGTGGTGGGA AGAGGTGACA CGAGGCAGCA ATTCATTCAC CGCGGGTGCT AAATCGCGCG CAGAAAACGA CATCACGAAG GCCTCTTTCC AGACCATCGA CTTTGATAAT GGTTCGACAC AAGCGGGTCA ATTGGTGACT ACCGCTACGC TCGTCAAGAC GAAGGCGTAC GGGCAGTCAT TCTGCGATGA ACTCGAGAAC AGATCGTTCT TCGGTGAGCG CTACGGCGCG TACGACAGCA ACGGGGCGAA GGTGAACCTT CAAAGCTACG TGCATCTTCA GGCTACAGGC AGCGACGGCA AGACGTATAA TGCTGATCTG CACTATCCGG GCAATCTCTA CATTTCCGAT TACGACTACA TTTCCGACAC CGCGCTCAGT ACCGCCGAGA AGACTATCGC TCAAACCGCG TTCACCGACG GGAACGTAGT AGAGGAAATT GTCGATTGGG ATGATCCGAT ACTGAACGTG AACAAGAGGC TCAAAGTCTC ACGCGGGGTG TTGTACAAGT TCACGGCGAC GGTGCGAAGC GCAAGCGACT ACCAAGGGGC GACGATCACA GGCTGGTTCT GGGATGACAC TAACGCTGCA GAAGTTGAGG TCAAGTTCAG GTATGACTCC GACGCGAACG CACTCGTGTT GTCGTCGAAG CGAACTTGGA ACGGTGGATC GCTGAACACG AACTCAATCA CGACTCCGCG TGCGCTCACG ATGGCTGACA TTGACAACCA AATCATACGC TGCTGGGGAA GCATCTTCGG CCGAGGCTCT GCGAGTTTGC TTAACCTGAC GCACTTCAAA GTGTCTGACG CGCAAACCGT CCCCCCTGGG TCGCTCAGCA GCCACCTCGC ACTCACGTGC GGCGAGCGGT GTATCGACCA GACGAAGCTA TCGAGTGCGA GCAGTCACGA CAGCCAGTTC TACGGCTCGC CGAACGGATA CAACGACGCT CGAAGCACTT ATAAAAAGTA CGTCTTCGAC AAAGATACCG GCTCGTTGAA GGAAGACGTT TCCGGCACCC TCGGCGCTGA AGTCGTAATG GACCCAACCA ACTCATTCTT TGACTCGGGA GAAGTTCATA TGGTTCTCTT CGAAGCTACA CCTGCGAACC TCGACACTTT GAGTTGCACC GCCACTACCG ACGTTTGCGA GGAACCCAAC AAATTGGATG TGCACTACGA GTGGAGCTCA AGCAATTGGG AGGGCATGGC GTGGCTCGAA GATCCTTCGG ACTCCGCGTC AAAGAAGTTC ATGGATGCGC CTCTCCAGCT TCAAGGCAAG ATACCCACGG ACGCCGTGTT GCTGCGCTCT CCATCCGGTA CCGACTACTC CGGGGTGAAT CTCAACGTGC GTTACGAAGG CGGTTGGTTG GGCGACGCGC CGTTCATTTG CTTCAATCCG ATGACTGGTT CGCGCGCGGC GCCCGAGGTC GACCAGTATG GCAACGAGGA GTGCGACGAC AACAATGGTT ACCATCGCCG TCCGGATGTC CTGATCCCCG ATGGCACCAC GTTCAAACAA CCATCGACTG GCGACCGGTA CGTCTTGAAA CTCGAAAGTG GTGTCGAAAT GCTTGCTCCC GCCGATGCGT CGGCGTGCTC CGGGATGTCG TACGACACGA GCATCACGGT GCCCACGGCG GCGGACTACA CCGCGTTCAC CATGCCGACG AAGCCGTCCA TGACCGGGCT CAGCGTAAAA GGCACGGATA AGGTTTCCTA ATCAATCAAG TCAACCAGTG CTTGTTTTCA AGATCGTGTC CGAAACGCCG GCGTTATATC GGCGGGCGCG ACCCATCGAT GCGACTCAAA CGACGGCCAA ACTTGAGTCT CTCGGCAGCA ATAAAATCAT TTAGCGCCAA CACGCGCGAG TCTTTAATTA ATAAATATAA ACTTCTCCGT ACCGGTTATC ATGTGGAGCG CACGAGACCT ACGACCGACG CGTATTCGGA AATGAAGCGT GGCTGCATGC CCAAGTTCAC TCGATGGTTT GAGCATTGCG TCACTGTTGA ATTCCCTGAG AAGTGGGTTG GAAATAAAAT CCGCAACAGT GACATCTTCA TCGAATACCA GACGTGGTTG CCAGCGGCTG CGCGTGGACA AGATAGCGCG ACGAAGGTCG GCAATAAGCT CAAGGACTTC TTCAAGAAGG AAAAGGGCCA CAGGATCCCA ATGGAAGAGG ACCACCTGAG GCAAGGTAGG GACGAGAAGG GCGTCTATTG GGAAATTGAC CGCGACGGGT GCTTCGAGTG GCTGAAGAAC AATGGGTACA CGGGGGAGAC GGAGCTCGCG CCGGCGGTCG TCTGGTGTTC ATACTAATTT TGTGATTCGT ACATTTGCAA TATGAAGACT CAACAAGACA CCGAAAAGAC CATGGTTTGG ATCCTGACGA AATGACGAAT GACGAACTGT TTTCATAAAG GTCTACACTG TTGCTTGTCA TAGATCTAAA ATAAAAATAA ACATTAAAAT ATTTTAACAT TT
|
Protein sequence | MVRRGVAIAI VGVAAVVVAL GTGLGIGLRD DDATPAPAPT PTTVSGLSGA AKTNALKAAL TPRKVTMIPS EGVSTTSSGS SRIDGRSLVE VTPLEVSSFS STSDYATAPP AETFVDAVST EIFQIPNMVL CYVASVNWTA NLNTGPYIAE IDPFHCDNSD GDRIKGKVTY RWVVNATGPD IDDASDTRDF KTNVWVALSD TPEMPNIDTE MIVKRDDVAI KSDKILVKNF TFTYQSPTGS TASQRIKGVV RRECETTDAD SDCTATGVTW WEEVTRGSNS FTAGAKSRAE NDITKASFQT IDFDNGSTQA GQLVTTATLV KTKAYGQSFC DELENRSFFG ERYGAYDSNG AKVNLQSYVH LQATGSDGKT YNADLHYPGN LYISDYDYIS DTALSTAEKT IAQTAFTDGN VVEEIVDWDD PILNVNKRLK VSRGVLYKFT ATVRSASDYQ GATITGWFWD DTNAAEVEVK FRYDSDANAL VLSSKRTWNG GSLNTNSITT PRALTMADID NQIIRCWGSI FGRGSASLLN LTHFKVSDAQ TVPPGSLSSH LALTCGERCI DQTKLSSASS HDSQFYGSPN GYNDARSTYK KYVFDKDTGS LKEDVSGTLG AEVVMDPTNS FFDSGEVHMV LFEATPANLD TLSCTATTDV CEEPNKLDVH YEWSSSNWEG MAWLEDPSDS ASKKFMDAPL QLQGKIPTDA VLLRSPSGTD YSGVNLNVRY EGGWLGDAPF ICFNPMTGSR AAPEVDQYGN EECDDNNGYH RRPDVLIPDG TTFKQPSTGD RYVLKLESGV EMLAPADASA CSGMSYDTSI TVPTAADYTA FTMPTKPSMT GLSVKGTDKV S
|
| |