Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_87576 |
Symbol | GTA3501 |
ID | 5002484 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009360 |
Strand | + |
Start bp | 557529 |
End bp | 560345 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | |
GC content | 58% |
IMG OID | 640417905 |
Product | predicted protein |
Protein accession | XP_001418275 |
Protein GI | 145347649 |
COG category | [K] Transcription |
COG ID | [COG5164] Transcription elongation factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.571615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACGC GTCTGCACGC GCGGATGGAG AATCAAGGAC AGTTTGAACG CATGGAGAAC GAGGAGGAGT TGGAGCGCAT GATCAAGGAG CGGTACGCGG CGCCGAGGTA CGAAGCGGGG ACGGGGCAGT TGGACGCCAA CGTCGATCAG CAGGCGCTGC ATCCGACGGT GCGTGATCCC AAGCTTTGGC TTGTCACGGT GAAGCTCGGG AAAGAGCGCG AGACCGTCGT GTGCTTGATG CAAAAGACGA TCAACCTGGC CAAGCAAGGG AAACCCCCCA TGCAAATCTT GTCGTGCATC GCGCAGGATC ACCTCAAGGG ATACATCTAC GTCGAAGCCG AGCGTGAAGA TCATGTGCGA AAAGCGTTAC AAGGAATGAG ACACGTGTAC CACGGTAAAC CTGTTCGATT GGTGCCGATT AACGAAATGG TCGACTCGAT CAGCGTCACA ACAAAGGAAG TCTCCGTGGT CAAGGTGGAT TCGTGGGTTC GCATGCGCAC CGGCGTGTAC AAGGGAGACT TGGCGCAGGT CGTCGACGTG AACTACGCCG ATAACCAGTG TACGGTCAAA TTGGTCCCGC GCATAGACTA CCAGCACCTC GCCGATAAGG AATCAGGCAA GGCGAAGGGC AAGACAAAAT CTCAAGTCCG CCCACCAGCG CGTCTTTTCG CGGAAGCCGA AGCCAAGCGA TTGAATTTGT CTTTCGAAAG AGGCCGATAC GATCGCAGCT TGGGTGCGAG CGTGGACGTA TTGTGCAGCA CGACGAAGCT CTTGGATGGG TATCACATGA AGACGTGCTC TTTAGCCACA GTCAAGCTCG CCGATGCGCC GACGTTGGAT GAACTTCAAA AGTTCGCCAT CGGTGACGAC GAGGACGGTG AGAAGGGCGG TAACTCCAGC GGCGCGTTGG CCGCGCTGTC CAAAGCCGTC GGTACTCGCA AGACTGATTT GAAGTTCATG CCGGGCGACC AAGTAATCAT CGTCGAGGGC GATTTGAGAA ACCTTGAGGG CGTCGTCGAG CGAATCGACC CCGACGGACG CGTCGTCGTC AATCCGTCGC ATAAAGAGCT GAACGAGTTA CTGACGTTCA AGTCCGAACA ACTGCGAAAG CACTTCAAGA CTGGTTCCAC TGTGCGCGTG TTGCACGGAA AGCACGAGGG CGTGGTCGGT ATGGTGGTGA AGGTGGAACG CGACGTAGCG CACATCTTCT CGACGGTGAG TAACGAAGAG TTCCAAGTCT TCATGCACGA TCTTGCCGAT AGCGAAGAAA CCACGCAGCG TATCGACACG ATTGGTGAGT ACGCTTTGCA TGATTTGGCG ATGCTCGAAG GCTCTGAAGT CGGTTGCATC ATTCGTGTCG AGAAGGATGT CGCCTTCGTC ATGACCAATG CCGGCACCCC AGACCGACCC GAGATCAGGC CCGTAAAGCT TCATGAGTTG AAAAAGAAGC TGCTGAGTAG GAACATTTCC GCACAAGACG CTCACATGGA CACCATCGAT CAAGGGTCGA TGGTTCGCAT CATAGACGGC AAGTACAAGG ACACCACGGG TACGGTAGAG CACATCTTCA AGGGTACGCT CTGGATTCGA GCTCGGCACG TACAAGAACA CGGCGGTATC GTTTGCATTC GCGCTCGAAA TTGCGTGGCA CACGGCGGTA ACAAGGGTTC CCAAATTGGT GGTGGTCTGG CTGCCCAAAT GATGGGCGCG CATGGCATGC CGCCGAAGTC TCCAGGTCAT GCTCTCTTGC AGAGTTCGTA CACTTCTGGT TTGCGTGGTG ATTTGATGTC TCAAAGTCTT CAAGCTCCGA GAGCTGCACC ACCACGTGCT TTTGGTGGAC CTGGTCGCCG CGGCCAGGAT CCGCTCATCG GTCAAACGAA GAAGGTTCGC GCTGGTGTGT ACAAGGGATA CATCGGACGC ATCGTCGACG TCACCGACAC GTCCGTCCGT CTCGAGCTTC AAGCGCAGGC ACGCACTGTT ACGGTCAATC GCGAGCACTT AGACGTCCCA CAAGTTGCCC CGAGTCGTGA CTCTTTCTTG GCTCCTCGCG CGACGAGCAT GTACGATGCT CCTGGCTCGA GAACTCCGGC GCACTACCCG ATGACACCGG CTCACGGCGG CGGTATGACG CCAATGCACG GCGGTATGAC GCCCGCGCGC GAGGCCGCTT GGAACCCCAC CGCGACACCG GCGCACATCC AGGACAACTG GGAGCCGACG TCTACTGCAG GCACCGGGTG GGGTGCTGGA AATGTCGGTT ACACGCCAGG CGGGTACGGC GACTCCAGCG CCCTGGCGGG TCCGACACCG AACGCATACG GCGTCGGCGC CACGCCAGGT GCCGCCTACG GTCAAACGCC CGGTGGATAC GGTCAAACGC CTGGTGGGTA CGAAGCAGAT GAATACGTCG CCCCACCGGC GGCAGCGTCG TGGCCGGAAG ATTACAAGGG TCGTTTCCTA CCGGGTGTCG TCGTGCGTTT GACGAGTGGC GCGCAAGGTT ACATCACCTC TGTCGCGCCG GCGGGTTCTA GTTTCAAGGT GAAAATCGGT ACCTCGCGCC CGCGTGACGG TGTGGAGGTC TTAGAGACCG TGCCGAAGTC AGCACCCGAA GAGACCGTCA CAGAACAAGA GCTCGAGATC GTGCGACCGG GAAAGAAGAC GTCAGTCATC ATCGTCACCG ATTCTGGAGA CGCGTCCCGC GGCGACACCG GCGAGCTCAT CACCATCGAT GGTGTCGACG GTGTCGTGCG TTTAAGTAGC ACGAATGACG TCGTACTCTT AGACATGTCG TGCCTGGCTC GTCGCTGGGT CGAATAA
|
Protein sequence | MDTRLHARME NQGQFERMEN EEELERMIKE RYAAPRYEAG TGQLDANVDQ QALHPTVRDP KLWLVTVKLG KERETVVCLM QKTINLAKQG KPPMQILSCI AQDHLKGYIY VEAEREDHVR KALQGMRHVY HGKPVRLVPI NEMVDSISVT TKEVSVVKVD SWVRMRTGVY KGDLAQVVDV NYADNQCTVK LVPRIDYQHL ADKESGKAKG KTKSQVRPPA RLFAEAEAKR LNLSFERGRY DRSLGASVDV LCSTTKLLDG YHMKTCSLAT VKLADAPTLD ELQKFAIGDD EDGEKGGNSS GALAALSKAV GTRKTDLKFM PGDQVIIVEG DLRNLEGVVE RIDPDGRVVV NPSHKELNEL LTFKSEQLRK HFKTGSTVRV LHGKHEGVVG MVVKVERDVA HIFSTVSNEE FQVFMHDLAD SEETTQRIDT IGEYALHDLA MLEGSEVGCI IRVEKDVAFV MTNAGTPDRP EIRPVKLHEL KKKLLSRNIS AQDAHMDTID QGSMVRIIDG KYKDTTGTVE HIFKGTLWIR ARHVQEHGGI VCIRARNCVA HGGNKGSQIG GGLAAQMMGA HGMPPKSPGH ALLQSSYTSG LRGDLMSQSL QAPRAAPPRA FGGPGRRGQD PLIGQTKKVR AGVYKGYIGR IVDVTDTSVR LELQAQARTV TVNREHLDVP QVAPSRDSFL APRATSMYDA PGSRTPAHYP MTPAHGGGMT PMHGGMTPAR EAAWNPTATP AHIQDNWEPT STAGTGWGAG NVGYTPGGYG DSSALAGPTP NAYGVGATPG AAYGQTPGGY GQTPGGYEAD EYVAPPAAAS WPEDYKGRFL PGVVVRLTSG AQGYITSVAP AGSSFKVKIG TSRPRDGVEV LETVPKSAPE ETVTEQELEI VRPGKKTSVI IVTDSGDASR GDTGELITID GVDGVVRLSS TNDVVLLDMS CLARRWVE
|
| |