Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_25463 |
Symbol | |
ID | 5005378 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | + |
Start bp | 42063 |
End bp | 44097 |
Gene Length | 2035 bp |
Protein Length | 570 aa |
Translation table | |
GC content | 60% |
IMG OID | 640420799 |
Product | predicted protein |
Protein accession | XP_001421192 |
Protein GI | 145353806 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02345] T-complex protein 1, eta subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.261555 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000278196 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | CGCGAACGCG CGAACGCGAT GGCGGCGCCG ATGGTGCGCA TGCCCGATGG GCGATTAATA GTGCGTGTCG CGACGGCGAC GGCGCGGACG ACGCGACGCG CGACGCGCGA AGAGACGCGA TCGGAACGCG CGCGCGCGAT GGCTCGCGAC GGCGGATGAA GAGGATGATT TCGAGCGATG CGACGTCGCG CGAACGACGA GGGGGGGGGC GCGCGACGGA GGAGGGGTCG AAGAATTTGT GTGCTTCGAG CGCGGACGTC GCGCGGGACG TCGGCGCGAC GCGGGAGACG CGAGGCGGCG ATGATGGTGA TGATGAAGAC TGAAGACTGA AGACTGACGC TCGCGAACGA ACGCGCGAAC GCAGCAACCG CAAATCGTGC TGCTCAGGGA AGGCACGGAC ACGTCGCAAG GACGCGGACA GTGCGTGTCG AACATAAACG CGTGCTGTGC GGTGGCGGAT ACGGTGCGAA CGACGCTTGG ACCGCGAGGA TTAGATAAAC TCGTGCGAGA CGCGCGAGGA AACACGACGA TTTCCAACGA CGGCGCGACG ATTATGAAAC TGTTGGAGAT TGTGCATCCG GCGGCGAAAA CGCTCGTGGA CATCGCGAGA GCGCAGGATA GCGAAGTGGG GGACGGGACG ACGACGGTGG TGATACTGGC GGGGGAGCTG TTGAAGGAGG CGAAGACGTT CATCGAGGAC GGCGTGCACC CGATGAATGT CATCAAGTCG TTTCGAGAGG CGTGCGATTT GGCGACGGCG CGCGTGCGCG AATTGGCGAC GTCCATCGAA GGGAACAGCG CGGAGGAGAA GGATGAACTT TTGAAGAAAT GCGCGATGAC GACGCTGAGC TCCAAACTTG TAGGAGGGGA GAAGGACTTC TTTGCCGACA TGTGCGTCAA GGCGGTGCGC TCGCTCGATC AAGACTTGTT GGACCCGAAG ATGATTGGCG TGAAAAAGGT CATGGGCGGA GGGATGACGG ATTCGTTCTT GGTGGATGGC GTGGCGTTTA AGAAGACGTT CGCCTACGCC GGCTTCGAGC AAATGACGAA GAGCTTCAAA AAGCCGAAGA TTTTGGCGCT CAACATGGAG CTCGAACTGA AGAGCGAAAA AGACAACGCC GAGGTGCGCC TGAGCGATCC GACCAAGTAT CAAGAAATCG TTGACGCTGA ATGGAACATC ATTTACGAGA AGCTCGACAA ATGCGTGGCG TCGGGAGCGA ACATCATTCT GAGTAGACTC GCGATCGGGG ACTTGGCGAC GCAATATTTC GCCGACCGCG GATTGTTTTG CGCCGGTCGC GTAGACGCGC AGGATTTGGA ACGCGTGACG CGCGCCACCG GCGCGCCGGT GCAGACCACG GTGAACAACA TCACGGACGC CCAGCTCGGA TCGTGCGAGT TGTTCGAAGA GATTCAAGTC GGTAACGAAC GGTACAACAT TTTTAGAGGT TGTCCGCAGG CGAAGACATG CACGCTCATC TTGCGCGGCG GCGCCGAGCA ATTCATCGAG GAGGCGGCGC GCTCGCTCAA CGACGCCATC GAAATTGTTC GCCGCGCGGT GAAGAACGCG GCGATCGTCC CTGGTGGCGG TGCGATCGAT ATGGAGTTGA GCAAGTACTT GCGCAACCAC GCGCGCACGG TGGCGGGCAA ATCTCAGCTC TTCATCAACG CCTTTGCCAA AGCTTTAGAA ATCATCCCGC GACAGCTCTG CGACAATTCC GGGCACGACG CGACCGACGT GTTGAACAAG CTCAGACAAA AGCACGCGGG CGATGACGGG GCGAATTTCG GCGTCGATGT CAACGGAGGT GGCATCTGCG ACACGCACGA ACGCTTCATC TGGGAACCGA GCCTGGTGAA AATCAACGCC CTGAACGCCG CCACCGAAGC GACGTGCTTG ATTCTCTCCG TGGACGAAAC CGTGAGAAAC CCGCGAAGCG AAGGCGCGGA TGAAGGCATG GGTGGCGGCG GTGGTCGGGG CATGCCGATG GGAGGTAGAG GAGGTGGTCG CGGCCGGGGT CGCGGCCGAC GATAG
|
Protein sequence | MAAPMVRMPD GRLIQPQIVL LREGTDTSQG RGQCVSNINA CCAVADTVRT TLGPRGLDKL VRDARGNTTI SNDGATIMKL LEIVHPAAKT LVDIARAQDS EVGDGTTTVV ILAGELLKEA KTFIEDGVHP MNVIKSFREA CDLATARVRE LATSIEGNSA EEKDELLKKC AMTTLSSKLV GGEKDFFADM CVKAVRSLDQ DLLDPKMIGV KKVMGGGMTD SFLVDGVAFK KTFAYAGFEQ MTKSFKKPKI LALNMELELK SEKDNAEVRL SDPTKYQEIV DAEWNIIYEK LDKCVASGAN IILSRLAIGD LATQYFADRG LFCAGRVDAQ DLERVTRATG APVQTTVNNI TDAQLGSCEL FEEIQVGNER YNIFRGCPQA KTCTLILRGG AEQFIEEAAR SLNDAIEIVR RAVKNAAIVP GGGAIDMELS KYLRNHARTV AGKSQLFINA FAKALEIIPR QLCDNSGHDA TDVLNKLRQK HAGDDGANFG VDVNGGGICD THERFIWEPS LVKINALNAA TEATCLILSV DETVRNPRSE GADEGMGGGG GRGMPMGGRG GGRGRGRGRR
|
| |