Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_36784 |
Symbol | |
ID | 5006999 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009375 |
Strand | - |
Start bp | 256270 |
End bp | 258546 |
Gene Length | 2277 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 57% |
IMG OID | 640422420 |
Product | predicted protein |
Protein accession | XP_001422941 |
Protein GI | 145357469 |
COG category | [K] Transcription |
COG ID | [COG1405] Transcription initiation factor TFIIIB, Brf1 subunit/Transcription initiation factor TFIIB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.00059248 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0129799 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGCACC GGTGGTGCGA GACGTGCGGG AAACGGGTCG CGGCGGAGAC GAACGAGGCG AACGGGTTCA CGTGCTGCAC GACGTGCGGG AAGATTTTAG ACGAGCGCGC GGCGTTCAGC GCGGACGCGA CGTTCGTGAA GAACGCGCAG GGAGCGTCCG TGCCGGATGG ACACTACGTG CCGGAGAGCG GGGTGGCGCA CGGGGTGATA CGGGCGACGC GAGGCGGAAG ACTGTACGGC GTGCAGTTGG ACTCGCACGA GCGCACGCTG TATCGAGGGA AGCTGGAGAT TAAACAGTTG GCGGATCGGT TGGGGATACG ACCGAGGGAG GACGTCGTGG ACGCGGCGCA TCGGCTGTAT AAGCTCGCGG TGCAGAGAAA TTTTACGCGA GGGCGAAGGA TTTCGCAAGT GGCGGGGGCG TGCATGTACA TCATCTGTCG ACAGGAATCG AGACCGTACA TGTTGATTGA TTTCGCGGAT ATTTTACAGA CGAACGTGTA CGTTTTGGGG GGCGTGTTCT TACAGTTGTG TCGGTTGTTG CGACTGGAGC AGCATCCGCT CATGCAAAAG CCGATCGATC CGAGTCTGTT CATTCATCGA TTCGCGGACA AGTTGAACTT GGGACGACGG ATGCACACCG TGGCGAACAC GGCGCTGCGG CTCGTGGCGT CCATGAAGCG GGATTGGATG CAGACTGGTC GTCGTCCGAA TGGAATTTGT GGTGCCGCGT TGTGGGTCGC CGCTCAAATT CATGGATTCA GTCCGAGCAA GCGCGATGTC GTGGCTGTGG TGCACGTCGG CGAATCGACG CTGAAGAAGC GTCTGAGCGA ATTCGAAAAC ACGCCGAGCG CGGCGCTGTC GATCGAGGAG TTTGACACGC AAGCTCGCAC GTTTGAGGCT GAAGAAGAAG CGAATAAAAA CACAAAATCG CTAGCGTCGA GCCCAATGTC GGTGCTGAGC TGTGTGCACA AAGACAACGA AAACATTCCG CACTTTGCGC ACGGAATGTG TCGCGCGTGT TACGTGGATT ACGTTAGAAT TTCGGGGGGT TCGGTGGGAG GCGCCGATCC GCCCGCGTTC ATGCGCGCAG AAGCGAAGCG GAAAATCGAT GCAAAACAAA AGCTTTTGTT GCCCGCGCTG TCGTCGGGCG AATTGGGAGA CGAAGACGCG TTGACGCAAG AATTTAACTC GGCGCTCGAG CAAGACTTGA GCGCGCTGCT CGCGTCGCCT ACGCCGTTGA ATTCAGTTCA GCCCTTGGCC TTACCTTGCT CGTCGAAACG CGCGACTTCG GCGAGGAATA TGACGAAAAA GGGGCAAAAG CAGCAACAGC AACATCACGT TCAAACTAGT CGCCGCGAGC CGGTCGACGC TGATTTCTTG AGACGCGCCG AGGACGCCCT GCGCCTGCTC GTCGGATCTC GATGGGCCGA ACTCGTTTGC TTACCGTTTA CAAGCGATCT CTCCAAGGCG CGACTGTCGA AGATGCATAT GTGTGAGCTC CATCCAAACT ACGAAACCTT TGTGAACAAC GAAGGTGAAC GCATCGCGCA AGTGGACGCG CTGGTCTTAC ACTTTTTAAT AGCCGCGAAG TGTTTCGACG ATTCAGCTCT GGCGCAGTTG GCAAAGCATT CGCCGCACGA CGTCGAGGCT TTCCAAACGA CGCCGTTCGA TCCATCGCAA ACGTCATCGA AGGCCGCGCT TGCGGTCGTT GAAAGCGACG GTCTCGTCGC CAAGGAGGAT AACGAAGTCA TCGATACGCT CTCGGACGTT GACGACGATG AAATCGATTC GTACATTCAC AACGAAAACG AAGTCAACCT TCGTCGTTTG GTTTGGTCTG AGATGAACAA GGAGTACTTG GAATTCCAAG CCTTGAAAGA GCAAGCCGCC AGCCGCACGA GCGCGCCGAC GAAAAAGAAG CATAGAAAAG CCCCCGACAC GCTGCCCGCG GAGACTCCCG CGGAAGCTGC GCGTCAAGTC TTAGCTAAAA AGAAGGGCAG CTCGAAAATC AACTACGAAG CGTTGGAAAA TCTCTTCAAA GTTTCTGATG GTTCGCAGCC GCCTCCGAAC TCAAAAGCGA CGTCTGACGT CGAAAATGAC GCTTCTCCGA CAAAGTCTCC TCGCACGAGA CGCGCGCGTC CCGCAGGCTT ACCCTCGAGC GCTCCGATGT CGACGAAATC AACCGCCAAG CGTCGCGGCT CGAGCGTCTC GACGCACGCG CCGTCGTCCG CGCGTCCGAG CGGTCTCGCG AAGAAGCCAT CGGCGAAGAA AAAGTGA
|
Protein sequence | MVHRWCETCG KRVAAETNEA NGFTCCTTCG KILDERAAFS ADATFVKNAQ GASVPDGHYV PESGVAHGVI RATRGGRLYG VQLDSHERTL YRGKLEIKQL ADRLGIRPRE DVVDAAHRLY KLAVQRNFTR GRRISQVAGA CMYIICRQES RPYMLIDFAD ILQTNVYVLG GVFLQLCRLL RLEQHPLMQK PIDPSLFIHR FADKLNLGRR MHTVANTALR LVASMKRDWM QTGRRPNGIC GAALWVAAQI HGFSPSKRDV VAVVHVGEST LKKRLSEFEN TPSAALSIEE FDTQARTFEA EEEANKNTKS LASSPMSVLS CVHKDNENIP HFAHGMCRAC YVDYVRISGG SVGGADPPAF MRAEAKRKID AKQKLLLPAL SSGELGDEDA DGLVAKEDNE VIDTLSDVDD DEIDSYIHNE NEVNLRRLVW SEMNKEYLEF QALKEQAASR TSAPTKKKHR KAPDTLPAET PAEAARQVLA KKKGSSKINY EALENLFKVS DGSQPPPNSK ATSDVENDAS PTKSPRTRRA RPAGLPSSAP MSTKSTAKRR GSSVSTHAPS SARPSGLAKK PSAKKK
|
| |