Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_26716 |
Symbol | GTF3501 |
ID | 5004820 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009366 |
Strand | + |
Start bp | 91567 |
End bp | 92999 |
Gene Length | 1433 bp |
Protein Length | 226 aa |
Translation table | |
GC content | 63% |
IMG OID | 640420241 |
Product | predicted protein |
Protein accession | XP_001420584 |
Protein GI | 145352509 |
COG category | [K] Transcription |
COG ID | [COG2101] TATA-box binding protein (TBP), component of TFIID and TFIIIB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.258569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.155129 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CGGCGTCGCG CGGCGTCCGT CGCGTCGTCT CGCGCGCGTC GTCGACGGCG TCTCGACGCG CGCGATCGCG GCGCGCCGAT CCGACGCGTC GACGCGCGAT CGCGAGCGCG CGATGAGCGA CGCGGAGGAC GAAGATCTCA CCGCGATCGT CGAGGCGGGC GAGGACGCGG TCGACCGGAG CGTGCACCCG AGCGGGATCG TGCCGGTGCT GCAGTGCGTG TCGAGCGAGG CGAAAACCCG CGCGCACTCG CGCGCGGCGC TCGACGCGCG GAAGAGCGCG GCGTCGGGGA GGCGCGGCGA GGGAATTCAT TCGGCGCGGA ATTCAATTGG CGCGACGCAT TCGGCGCGCG CGGGAAGCGG CGCGGACGAA GGCGAAGCGA GCGTCGCGAG GGCGATGGAC TGACGACGCG ATGCGCGCGC GTAGGAATAT CGTGGCGACG GTGAACCTGG ACTGTAAGCT GGATCTGAAG ACGATCGCGT TTCACGCGAG AAACGTCGAG TATAATCCAA AGGTGCGGCA ACCCTAAGGC GACGCGGCGA GAGACGCGCG AAACGCGCGC GATGGGTGCG ACGATGACTG ACGACGAACG ACGGTGGTTT GAATGCGCGT AGCGATTCGC GGCGGCGATC ATGCGGATTC GTAATCCGAA GACGACGGCG TTGATTTTCA GCTCGGGCAA GATGGTGTGC ACGGGCGCGA AGACGGAGGC GTTGGCGCGC GAGGCGGCGC GAAAATACGC CAAGGTCATC ATCAAGCTCG GATTTCCCGC GCAATTCAAA GACTTTAAGA TTCAAAACAT GGTCGGTTCG TGCGACGTGC AGTTTCCGAT TCGATTGGAA GGCTTGGCTT GGCAGCACGG CCACTTCGCG CAGTACGAGC CCGAACTCTT TCCGGGTTTG ATTTACCGCA TGCAAATGCC CAAGATTGTG CTCTTAATCT TCGTGTCGGG GAAGATTGTG TTGACCGGCG GCAAGCGTCG CGAGGATATA TACCAGGCGT TCGAGAATAT ATACCCCGTG CTCACAGAGT TTAAAAAGCT CGCGCAGCCG GACGAAGCCG CCGCGGCGCC CAAGGCGCTC CCGGCGGCCA AGGGGAAAAA ATAGAGCGAT CAGCACCAAA TAGAAACACA CACACAGAGG GGAGAGAGAA AGAGGGAAAC GCAGCGCGCG CGCGCGGAGA CGGAGACAGA GGAGAGGAGA AGAGAGAGAA GAGAAGAGAA GAGACGGAGG AGAATTTCAA ACCAAAAGTA GTAGAGCTAG CCGACGCGCG TCGCATCTCG AGACCACCGC GCGCGCGTGT CGCGGTCCGG GCGTTCGTCC CACCGCGCGC CCGCGCGTTT CTCGAGTCGC GCACGCCGAT ACAACATTCG CCTCATCGCG ACGACGAGAG CTCGTCCGCG CGTGTAATCG CGAACCAACG TCAAAACCAT CAA
|
Protein sequence | MSDAEDEDLT AIVEAGEDAV DRSVHPSGIV PVLQNIVATV NLDCKLDLKT IAFHARNVEY NPKRFAAAIM RIRNPKTTAL IFSSGKMVCT GAKTEALARE AARKYAKVII KLGFPAQFKD FKIQNMVGSC DVQFPIRLEG LAWQHGHFAQ YEPELFPGLI YRMQMPKIVL LIFVSGKIVL TGGKRREDIY QAFENIYPVL TEFKKLAQPD EAAAAPKALP AAKGKK
|
| |