Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26042 |
Symbol | |
ID | 5004226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009364 |
Strand | - |
Start bp | 132627 |
End bp | 133907 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | |
GC content | 63% |
IMG OID | 640419647 |
Product | predicted protein |
Protein accession | XP_001420089 |
Protein GI | 145351448 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3145] Alkylated DNA repair protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0723901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCCG CGCGCGGCCA CCATCCCGCG CGATGTACCG CGCGCGCGCA CCGCATCGCG CGACGACGAC GCGTCGTCGC GCGTCGCGCC GACGCGAGCG ATCGCGAAAC CGCCGCCGCG CCGAGCGCCG CGCAGCTCGT CAACGACGCG AAAAGTGCGC TGGATATTTT AACCGCCGCG GATTCGTTGC CGCTGCCGAC GGACGCGTCG CTCGCGCCGC ACGAGTCCCA GCTGCACCAC CGAAGGAAAC GCAAAAAGAC GTGCTCGCGG GCGCTCCAAA GGCTGGCGAA AATGCTGGTC GGGACGCGAC GGGAGGACGC GCGGCGGGAG GCGACGAGCG CGACGCAGTT CGCGAGGTTG GTCGCGGGCG CGCTGTGGAT CGACGACGAC GACGAGACGA GGAACGACCC GGAGGCTGGT GTGTTGTTTA CCGAAACCGC GCGCGCGCTC GGGAGCTTGG CGCCGTTTGA GATGGAGGAA GCGGCGAGAG TGAATTTTTA CGCGACGGCG GCGGCGGCGA CGCTGCCGCC GCGCTGCGCG ACGGTGGTCG CGTGGGCGCT CGCGCGGTGC GGGAGCGCGG TGCCGAACGA AGTCGACATC GCTATGCGTG GGGTTCCGTT TCGTTTTCAA CCCTATCTAA CGGCGGGATT GATTGATTTG GAGACTTTGA AACGCGAAGT GCCGTTCAAG CGCGAACAGT TGACGACTCG GGACGGCAGG CGCGTGGACG AGCGCCGCGA GACGTGTTGG ATGGGCGAAG AACACGTTGG TTCGTACGCG TACAGTGGGA AAATTATGCA ACCCGTGCCG ATGTGCCCGG CGGTGGCGAG AGTGCGCGAT GCATTGGAAG AGAAGACGGG CGAGAGGTTC GATTGCTGCT TGATTAATTT GTACCCGAGC GAAACGGCGG CGTGCGCGTA TCATACCGAT CCGTTCATGG GCATCGGGTA CGCCACGGAT AGCATCATCG TCTCCGTAGG TGAGACCAGG CGATTTAGTT TTAGGCCTCT AGGTTCGACC GACGCGGAGT CGCATTGGAT CCGAACGCTC GATGGCGACG CGATTTGGAT GTTCGCGAAT TGTCAAGACG ACTTCGAGCA TTGCGTGATG ACAGCAGAGG GCGACGGTAA CGACGCGCCT CGCGCGAGCA TAGTTTTCAA GCGAAGTCTG AAAAGAAAAT CAGCGGCGGA GGCGAGAGCG AAGAAGAAGA AGAAGAAACC TCCACCGTCG TCGAGCGGAG GAGCCGGAGG AGGTAGGAAA CAGCCTGCGA AGAGACGTTA G
|
Protein sequence | MRAARGHHPA RCTARAHRIA RRRRVVARRA DASDRETAAA PSAAQLVNDA KSALDILTAA DSLPLPTDAS LAPHESQLHH RRKRKKTCSR ALQRLAKMLV GTRREDARRE ATSATQFARL VAGALWIDDD DETRNDPEAG VLFTETARAL GSLAPFEMEE AARVNFYATA AAATLPPRCA TVVAWALARC GSAVPNEVDI AMRGVPFRFQ PYLTAGLIDL ETLKREVPFK REQLTTRDGR RVDERRETCW MGEEHVGSYA YSGKIMQPVP MCPAVARVRD ALEEKTGERF DCCLINLYPS ETAACAYHTD PFMGIGYATD SIIVSVGETR RFSFRPLGST DAESHWIRTL DGDAIWMFAN CQDDFEHCVM TAEGDGNDAP RASIVFKRSL KRKSAAEARA KKKKKKPPPS SSGGAGGGRK QPAKRR
|
| |