Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25220 |
Symbol | |
ID | 5004574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 216644 |
End bp | 218368 |
Gene Length | 1725 bp |
Protein Length | 546 aa |
Translation table | |
GC content | 57% |
IMG OID | 640419995 |
Product | predicted protein |
Protein accession | XP_001420276 |
Protein GI | 145351853 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.185767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.155521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATTCCGCGAC GCGACGCGGT TCGCCGAGAT GGCGCGACGA CGCGGACCGC GACGCCGCGG GGACGGCGAC GACGACGAGT CGACCCCGCT CGTCGCTCGC GGGCGCGGGT CGACGACGCC GGCGACGGTG ACGCGGGTGA CGCTTTTAGT GTTGGTCGCG ATGGGCGTCG TCGGCGCGTC GACCGCGCCG CGTCGACGAT TCTCGGCGCG AGTCGGCGTT TTCAAGAAGG GATTGAAAAC GGTGACGAGC GCGGCGAGCG GGGTGGCGAG CGCGTGGTCG AAGACGGGCG ACGCCGCGTG GACCAGGGCG GAAAATCTCT CTAAAAACGC GCGTAAAAAG GTTGGCGCCG AGTTGACGGC GGCCATGCAA TCCGGGCACC AAGAGATTTT TAAGCAATAC GACGAGGCGT CGGGGTGGTC CCAGCAAGCG GCGAATCAAG TTGCGGGAGA GGCGACAGAA CTCGGGGACG ACGTCGAGAA GACGGCGAAG ATGATTGCCG AGTTCCTCGA CGGTTATAAA TGCGACATCG GCGTCAAAGA CTTGAAGAAG ACCGTTGACG ATATTAAAAA CGCGGTCGGC TCAAACTCTT TGAAATCCAT GTACGACAAG GTGACGGGAT CTCCGAATAG GTATTTCAAG AACATGGACG ACTTGGCTTG CCAGATGATG TGGGATACGT CTGGATTCGC GCAAGCTGCA TCCGTGATGG AGGCTTTCAT CGAGTTGGCC AAGGATAAGT GTCCGCTCGT CGTCAAGGGT TCGAGTAAGC CCGCTTTCAC CTATGGTTTC TCCGTCTCGG GTGACATTGT CGGCGTCGTA CTCAGCGCCG GTCGCTCTGG CGAAGTCGGT CTCGGCATCG ACTTGTCTGG CCAAAAGTTT TGCTACGCTG GCCATTGCGT CTACTCGGGC TTCACCTTGG ATGTACCTCA AGTGGGTGTG GACTTCAACG TCGTTGTCTC CGGTTGGAAG TCGATGTCAG ACGTGTCTGG TCGGTCGAAC ATGATGGCTT TCGGCTTGAG TGGCGAGTTA CCAGACTCCG CCATCCCAGA TTTTGACGTT GATCTCACGC TCGTCACCGG CGGCGCCAAG ATGAGCAACA TATTGGGCGT CTCCAAGGCG TTAGGCGTCG GTCAAGACGC GTCTGAGCTG CCGGTGACGG GCAGCTTCGC CAAGGGTACG TGTATAACTC CTTGGTGCGT CACGTATGAA GGAGGTTCGT GCGCCGGTGA TCCGGGAGAA GGGCGCGATT TCAAGTGCTA TATCTCCAGA CATTCGTCGT TCCGGCTTCC TGCTCCCTTT GCGGGTCAAA CCGTTCAAGA CCGATGGGAT TACTACAATC GCAAATCGGA ATCGGGCAAA AAGATGCGCA AAGAAATCGA ATCGAATAAT GACAAGTATC CGATTTACGA AGGGTGCGAG TTCGGCGACG CCGAGATGGA GTGTTACTAC GACAAGTTCT ACGCCAAGGC TCATCCAAAG GATTACGACG ACATCGAGAA AAAAGTGAAC AAAGAATCTT GCAACGGCTT CTCCTCTGAC TCTGAATGCT CCAAGAATCG CCGCGAGAGG ATCATCGAGA AGAAGGTGAG CCGCGCCAAG GCTGATTTCT TCAAAGATAA GGGTGGCTGG AAAAACGAAC CGACTTCTTA CGCGAAGTGC GGCATTTAAG ACGCGCGTTC CACCGCGGGC GCGCGCGCCG CACGATTTTG CCGACGACGC TGTAA
|
Protein sequence | MARRRGPRRR GDGDDDESTP LVARGRGSTT PATVTRVTLL VLVAMGVVGA STAPRRRFSA RVGVFKKGLK TVTSAASGVA SAWSKTGDAA WTRAENLSKN ARKKVGAELT AAMQSGHQEI FKQYDEASGW SQQAANQVAG EATELGDDVE KTAKMIAEFL DGYKCDIGVK DLKKTVDDIK NAVGSNSLKS MYDKVTGSPN RYFKNMDDLA CQMMWDTSGF AQAASVMEAF IELAKDKCPL VVKGSSKPAF TYGFSVSGDI VGVVLSAGRS GEVGLGIDLS GQKFCYAGHC VYSGFTLDVP QVGVDFNVVV SGWKSMSDVS GRSNMMAFGL SGELPDSAIP DFDVDLTLVT GGAKMSNILG VSKALGVGQD ASELPVTGSF AKGTCITPWC VTYEGGSCAG DPGEGRDFKC YISRHSSFRL PAPFAGQTVQ DRWDYYNRKS ESGKKMRKEI ESNNDKYPIY EGCEFGDAEM ECYYDKFYAK AHPKDYDDIE KKVNKESCNG FSSDSECSKN RRERIIEKKV SRAKADFFKD KGGWKNEPTS YAKCGI
|
| |