Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16603 |
Symbol | |
ID | 5003317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 678729 |
End bp | 680738 |
Gene Length | 2010 bp |
Protein Length | 669 aa |
Translation table | |
GC content | 55% |
IMG OID | 640418738 |
Product | predicted protein |
Protein accession | XP_001419241 |
Protein GI | 145349650 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTGGT CGTACCTCAC CGGCGGTGCC GCGGCCGCGG CGCAGGAGCA ACGATCGACG ATCGACGCAT CCGCGAGCGA CATCGACCGC ATGAAAGCCG AACTCGCGCT CAAGGATGAA CAAATCAAGC ACGAACAAGC GCGCGTGTTG GCGCTCGAGT TGCAGATTAA ACTTCGCGAG CGCGACGAGC GCATCGAACG GCTCGAACGC GAGCTGAAAG AGAGCGCGGA GAAAGCGCCG AGGTCGTCGT CGCACGGGAC GCCGCAGTCG TACCCGCGCG CCTTCGCGTC GACCGCGCGC CCGCAAAACC GCGAGTCGGG AGGGACGAAA CGCACGAGAG ACTCGCGTGA TTACGCTTCA ACCTCCTCGG ACGACAAAGG TGACGACGAC GGCGACGACG CGACGCGCTC GAGTCCAAAC CATGCCGTAA AAACGAAGAC GTGTACCCCG TGGACGGTCG CGGAGGAGAA TTTCGTCATG AACTACTTCG CTGGCGTCTG GGGCGGTAGT GCTGCGTCGC TCTTCCGGAA CCGGGACTTT CTCGAGCGAT TGCGAGCCGT GAGTGGGAAC AAAAGGACGA AGGGAGCGCT TGCGGCGAAG TGGTTTAAGG GTGGCCTCAG AAACGAACTC AAACGCAAAG CTGAAAAGTG CTTGCTGTAC ACGGAGGAGG AGGGATTGAG GATACCTTGC TGGAGTAAGG AGGAGGAAGA TTTCTTCATG AAGCATTTGA AGAAAAGTGG TTACCAAGTG AGCGAAGGAA GGTTCATCCA TAATTACCTG ACTCAAGATT TCCTCGAGCG TCTGGCTAAA ATGAACGGGG GCATCAAACG AACTCAAAAG GACGCTTACA ACAGGTTTTA CAACACAACT TTTGAAAAGT ACAAGAAGAA ATTACAAGCG ACGGAGAAGG CGAGACAGGA AAACGGGGCG CTAACAAAGA AGCGAGAGAA GAAACTCATC GAGAGTCGAC CGACTGTGAC CAATGCCGAC GGTCAAATCA CGGCAAATGG CCGTATCAAG CTTCAGAAGA CGACCACGGC GAAGACCGGA CCAAACGGTG AGATCATCCT GGTGCCGATT CCCGGCGTCA AGTTTTGCCA TCGTTGCAAG CGCACGAACA GGGAGGGCAT GGATTTCCAT GAGCATAACT TCTCGACATG TATCAAGTGT CATGACCGCG TAAAAGAAAT CGAAGAAGCG CGAATCGCGG CCGGACTACC GCGCCGGGGA AAAGGTTCGA GCCCGCGCAA CAAGCCCAAG GAAACCAAGT TGCCTCCGAT CGGAGCAGAC GGCAAGCGGG ACTTGTTGGC GTACGAAAAT AAAGAGAAAT GGTCCGAAAT GATCAAGGCA GTCGTGGACG GTGACGTGCA AGTGTGTGCC GATGTCAGAG AAAAGTTGAT CAAGGATAAG CAGCTCTATC GCAATGTCGT CGCGTGGGAT AAATCGATCT TCTCCACCGC GGCGTTTTCG AGCAAGAATC CGATTCGTGT CATCAGCTGG GCGCTAGCGA ATGGGGCGTC GCGCGATGAA ATCAACGAAG ATGCGCTGAA ATGCGCTGTT GAGCGTAAAC CGCACGATAC GAACGAGCCC ACCGACGGGT GTGCGTACGT CCCGGCGGTC AAGGTTTTGA AACACTTGCA CAAGAGTGGG TTTCCGGCGA CGGAGGATGT TATACACACG GCGTGCGCGT TCGGCGACGT CGATTGCGTG AGGTACTTGA AGGAGAACTA TGAATGCTGC GATTTTGAGA ATATCTGGCG TGATTTCAAG AGATCTGACG GCGAAAACAA TGACATCATG ATCGTCGCCG CGCAGGAAGG TCACGTAGAC GTTTTGAAGT ACTTGTACGA AAACGACTGC GATTTTTCAG TGCAAGACGC AGAACACGCC ATGCGAGTGG CCACGAGCCG CAAGCCTCGA CGCGCTGATG GGTTCGAAAA AGTCAAGGAG TGGATGGAAT CTACGGCCGA ATGGCGAGAA TCGCAGGCTG AGAAAGAAAT AGAGGAATAG
|
Protein sequence | MAWSYLTGGA AAAAQEQRST IDASASDIDR MKAELALKDE QIKHEQARVL ALELQIKLRE RDERIERLER ELKESAEKAP RSSSHGTPQS YPRAFASTAR PQNRESGGTK RTRDSRDYAS TSSDDKGDDD GDDATRSSPN HAVKTKTCTP WTVAEENFVM NYFAGVWGGS AASLFRNRDF LERLRAVSGN KRTKGALAAK WFKGGLRNEL KRKAEKCLLY TEEEGLRIPC WSKEEEDFFM KHLKKSGYQV SEGRFIHNYL TQDFLERLAK MNGGIKRTQK DAYNRFYNTT FEKYKKKLQA TEKARQENGA LTKKREKKLI ESRPTVTNAD GQITANGRIK LQKTTTAKTG PNGEIILVPI PGVKFCHRCK RTNREGMDFH EHNFSTCIKC HDRVKEIEEA RIAAGLPRRG KGSSPRNKPK ETKLPPIGAD GKRDLLAYEN KEKWSEMIKA VVDGDVQVCA DVREKLIKDK QLYRNVVAWD KSIFSTAAFS SKNPIRVISW ALANGASRDE INEDALKCAV ERKPHDTNEP TDGCAYVPAV KVLKHLHKSG FPATEDVIHT ACAFGDVDCV RYLKENYECC DFENIWRDFK RSDGENNDIM IVAAQEGHVD VLKYLYENDC DFSVQDAEHA MRVATSRKPR RADGFEKVKE WMESTAEWRE SQAEKEIEE
|
| |