Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_88041 |
Symbol | TPR1 |
ID | 5003455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 450778 |
End bp | 452667 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | |
GC content | 62% |
IMG OID | 640418876 |
Product | TPR-repeat containing protein |
Protein accession | XP_001419384 |
Protein GI | 145349939 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.220076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0637913 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGCGA GAGAGATACA CGCGCTGACG CGCGCGGGAC GACGCGACGC GGCGGCGCGG GAAGGGACGG CGAGCGCGAG GGCGAAGCCG GCGGACGCGG CGGTCGCGAG CGCGGCGACG ACGGCGTGCT TTAAAGCCGG TGACGTCGAT GAGGGACGGG CGATCATGCG AAGACATCGC GAGAGCGTGG GTGACGGTAA GGTTGGGTGC GTCGTGCTGT GCGCGTGGGC GACGGCGGAA TCGGGAACGG GGACGCGGGA CGGGGACGCG AACGCGAGGG CGTTGCTGCG AGAGGCGACG ACGACGGCGA ACGGAGACGC GAGGGCGGTT TGCGACGCGT GGGTGGCGCT CGGGAATCAC GAACTGAAGC ATGGTCGCGT TGATAAGGCG AGAAAGTGCT ATAAGAGCGC GTTGAGCGCG GGAGAGGGAG CGCCGGCGAG CGCCGCCGTG GCGGCGCACT CGTGGGCTCG GTTGGAGGCA AAGGAAAGGA ATCCCAAGCT CGCGCGAGAG CTCTTCGCCA AGTCGGTGGA CTTGTGCGAA ACGCACGTTG CGAATTATAC CGCGTGGGCG TCGTTTGAAA TGAGTCGCGG GCAGAGCGAT GCGGCGAGGA AATTGCTCGA GCGCGGGGCG GCGTTTTGCA AGTCGGCGGA AGAGTTGCGC CAGACGAACG ACTCGAAGAC AAGGCGAGCT CGCTCGATGG CTTCGGCGTT GTTCACGTCC TGGGGGGACA TAGAGGGGCA AATCGCGTTG CGTGAGAGCG AAGACTCCGC CGATGCGCTC GATATCGCCG TATCCAAGTC GAGAGGAATG TTTGAGCGCG CGTGCGCGTA CGACAAGAAG AACGTCGCGG CGTGGTTGAA GTGGAGCGAA CTCGAGAAGG ATATCGCGCG CGGTAATTCG CATCGCGGTG GGGTGGTGAT GAGCGCAAGA CGGAACAGCG TGTCGAACCG TCGGCAACTC GATGTCTTGA CTGCGGGTTT AAAGGCGAAT CCGGGCGAAA TGCGACTCGA GCACGCCTAC GCCATGGCGC TGAAACTCAA CGGCGACGTC GAGGAGGCAA CGAGACGACT CCATCGTTTA AGCGAGCGGT TTCAAAACAA CGCGCACGTG TGGCACGCAC TCGGAACTAC ACTTCAGGAA TCTGGTGATT TTCAAGGCGC CATAGCGGCT TTTGAACGTG GATCTTTTGC TTCAGGTCGC GCCAACTTGC CGTGCATCAC CGCGGCCGCG GCGGCGGAAT TGCACGGCGG CAAGCACGGT CGTGCTCGCC AGCTGTTTGT CCAAGGCGAT TCCGTTCCGC GGCATCTGAG TACTCGTCGT GAGCGCGCCG CACACCTCCG ACTGTGGGCT TTGCTAGAAA AGCGTGCCGG GGGCGAGGAG GCGACGCGTA AACTGTTCAT CGCCGCCACC GCTGAAGATC GCACCGACGC CGCGACTTGG TTGCAGTGGG GGCAATGGGA GAAGCGTGTG AACAGTGTCG GCGCCGCGCG CAAGGTATTC AAAGATGGAA TTCGCTACGG CGTGAATAAT GGACAATATT TCATCTATCA GGCTCTAGCC ACTTTGGAGG CGGAGACAAA CAATCACGAA TCGGCGCGAG AGTTATTCAA GCAAGGATGT TCCGCGCATC CACGCAGCGC CTCTCTGTGG CTGCAGTGGG CTCTGTTCGA GCTTTCGTGC GGCGAAGACG ACAAAGCTGC GTCGCGGAAT TCGATTGCAG TCATCGAAAA AGGGGCATCG CGCGCGCCGC CGCACATCCC GCTCCTCGAA CTGTGGCTCA ATCTCGAACG AAAGGCTGGC GACGAGCACA AGGCGCGCGC GGTGGAGGAC AGGTTGAAAA AGCTTCTCTC CGAGCAGCGA TACGCTCCGG TCGGTCACGA AGTGAATTAG
|
Protein sequence | MRAREIHALT RAGRRDAAAR EGTASARAKP ADAAVASAAT TACFKAGDVD EGRAIMRRHR ESVGDGKVGC VVLCAWATAE SGTGTRDGDA NARALLREAT TTANGDARAV CDAWVALGNH ELKHGRVDKA RKCYKSALSA GEGAPASAAV AAHSWARLEA KERNPKLARE LFAKSVDLCE THVANYTAWA SFEMSRGQSD AARKLLERGA AFCKSAEELR QTNDSKTRRA RSMASALFTS WGDIEGQIAL RESEDSADAL DIAVSKSRGM FERACAYDKK NVAAWLKWSE LEKDIARGNS HRGGVVMSAR RNSVSNRRQL DVLTAGLKAN PGEMRLEHAY AMALKLNGDV EEATRRLHRL SERFQNNAHV WHALGTTLQE SGDFQGAIAA FERGSFASGR ANLPCITAAA AAELHGGKHG RARQLFVQGD SVPRHLSTRR ERAAHLRLWA LLEKRAGGEE ATRKLFIAAT AEDRTDAATW LQWGQWEKRV NSVGAARKVF KDGIRYGVNN GQYFIYQALA TLEAETNNHE SARELFKQGC SAHPRSASLW LQWALFELSC GEDDKAASRN SIAVIEKGAS RAPPHIPLLE LWLNLERKAG DEHKARAVED RLKKLLSEQR YAPVGHEVN
|
| |