Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28688 |
Symbol | |
ID | 4999580 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 143477 |
End bp | 146602 |
Gene Length | 3126 bp |
Protein Length | 936 aa |
Translation table | |
GC content | 61% |
IMG OID | 640415001 |
Product | predicted protein |
Protein accession | XP_001415744 |
Protein GI | 145341286 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0163307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAACG AACCCGACGC GCGCGCCGCG ACGGCGACGG GCGCGCTCGC GCTCGACGAC GCCGCGAGCG CCAAGTTCGT GCGGTTTTAC CGCGGGCTTC CGAGCGAGAC GGCGCGCGTG GTGCGGTTTT TCGATCGTAA AGATTGCATC AGCGCGCACG GCGATGACGC GATGTACATC GCGCGGGCGT TCTATAAGGT GCGTCGCGAG CGCGCGGGCG CGCGGGTGCG ATCTTCGAAC GCGTCGGCGC GCGCGCGGGG CGCGGGGCGC GCGCGGAGGC GACGAGAACG CGGATGATGA TTGGTGATTT TTGAATATCG GCGCGCGGCT GTCGCCGAGA CGCGAAATCG GCGACGAGCG CGGCGGCGCG ATCGTCGAGG GTGCGCTCGA GCGATGCGCG GGCGAGATGA CTGACGCGAA GCGTTTGACC GTTTGGTCCA CTTTTGACGT CGGAGCAGAC GACGAGCGTG ATCAAGACGA TGGGATCGGG CGACGACGCG TTGCCGGGCG TCGCGCTCAA CCGATCGATG TTCGAGAGCG CGCTTCGCGA ACTTTTACTC GATGGTGACG GCGCGCGCGT GGAGTTTTAC GAAGAGAGTA AACCGAGCGG GACGTGGACG TGCGTGAAGT CGGCGTCGCC GGGGAAATTG CAAGCGTTCG AGGATGAGCT GTTTCGGTCG AACGAGATGT CGGACGCGTC TGTGGTGTGC GCGGTGCGCG TGGCGAACGG GAACGTGGGC GTGGCGTACG CCAACACGAC GACTCGGGAG CTCGGGGCGT GCGCGTTCGT GGATGATGAG CAGTATTGCA CGCTGGAAAG CGTGTTATGT CAGATTGGAG TCAAAGAGTG CGTCGTGCCC AAGGAAGGGA CCGAGACGCC GGAGGGGCGT CGATTGCGAG ACGTCGTCTC GCGATGCGGT GCGCTCGCCA CGGAGCGTCA GGCCCGCGAT TTCGACGCGC AAGACTTGGA GAATGATTTG GGGCGACTCG TGCGAGGAAA CGTTGAGGCG CATCGGGCGG TGATTGATCA ATCGCACGCC GCAGCGTGTC TCGCGGCGGT TTTGCGATTC AGCGAAATGT TGGCGGATAG CGCCAACCAC GGTCGATGCA CGCTGAGTAT GTACGACACC GGTCGGTACA TGCGACTCGA TGCTTCGGCA CTGCGCGCTC TGAATGTTCT CCCTGAGCGC TCTGACGGCC CGAGCAGCTT CAGTTTGTAC GGTTTATTGA ACAAGTGCCG AACGCCGATG GGGCGCCGTT TGCTCTCGCG ATGGCTGAAG CAGCCGCTCG TGGACGTTAA CGAAATCGCC ACGCGACACG ATGTCGTGAA CGAATTCGTC ACCAATGCAG AGGTTCGTGA CGCCCTGCGC GGTGCGCACT TGCGAGCGTT GCCGGACATC GAACGGATCA CTCGTAAGCT CGAACGCCGC AAGGCGTCGC TCATGGATTT GTGCCGATTG TACCAAGCGA GCGCGGCGCT TCCACACATG GCCGAGGCGC TCGAACGGTG CGAGGGACGT CATGGCGATT ACATTCGCAA GAAGTATGCG GAAGAACTCA AAAAGCTGAG CGCCCCGAGT CATCTCGGCC GGTTCGAGGC GCTCTTAGAA GCGGCAGTGG ATCTGAGCAA GATTCCCGAC GAGTACGTCA TTTGCGCTTC GTACGACGCC GAGCTGGGCG AGTTGCAAAA ACAGAAGGAT ACACTGGAGA AGCAAATCCG TGATGCGTTT GCGGATGCGA GCGATGATTT GGGCATGGAA CGCGATAAGC AATTAAAGTT AGAACACAAC AACATGCATG GTTGGTTCAT GCGATTGACG AAGAAAGACG AAACGAGCGT GCGAAAAAAG CTCAGCGTGA GTTATCAGAT TCTGGAAGCG AAGAAGGATG GCACGAAGTT TACGAACAAG AAGATTCGTG GTCTTTCGGA GCAGCGTGTA TCTCTCGACA GAAGCTACGA CGCGAAGCAG CGACACTTGG TCGATCGCGT CGTCGACGTC GCGGCGACGT TTTCCGAAAT CTTTCTGAGC GTTTCGGCGA TGACGGCGGA AATCGACGTC CTCGCGTCTT TCGCTGAGGT CGCCGTCAGC GCGCCGGTGC CGTTCGTGCG CCCGATTATG CACGAGAAGA CGTCTGATAC GATCCACCTG GAGAACTCGC GCCATCCAAA CGTCGAAGCG CAGGACAACG TGCGATTCAT CGCCAACACG TGCTCGATGA AGAAGGGCGA GTCTTGGTTT CAAATTATCA CTGGCCCAAA CATGGGCGGT AAGAGTACAT TTATTCGCCA AGTTGGCGTG TGCGTGCTCT TAGCGCAGGT TGGAAGCTTC GTGCCGTGCG ACGACGCCGT GATTGCTGTA CGAGATGCCA TCTTCGCGCG CGTCGGCGCC GGCGACTGTC AACTCCGCGG CATTTCCACC TTCATGGCGG AGATGTTAGA AACCGCCGCC ATCTTGAAGG CCGCGACTTC ATCGAGTCTC GTTATCATCG ATGAACTCGG CCGCGGAACG AGTACATACG ACGGATTCGG TCTCGCGTGG GCGATAAGCG AGCACATCGT CAATGAGATT CAAGCGCCGT GTCTGTTTGC CACTCACTTT CACGAGCTCA CCGCGCTCGA AGGCCCGAGT GGCGTCTCCA ACTTTCACGT CGAGGCGCTC ATCGACCAGG AGAGCCGAAA GCTCACCATG TTGTACCAAA TCAAGCCGGG GGCGTGCGAT CAGTCGTTCG GGATTCACTG CGCCGAGTTC GCGCGGTTCC CCGAAGAAGT GCTCAAAATC GCTCGCGCAA AGGCGGACGA GCTCGAAGAC TTTTCCAAAT CCGGCGCCGA GCGCGCCGTC GCCGACATCT CCGACCCCAA GCGCCAACGA ACCGATGAAC CCGGTGTATC CGACGACATG GCCCGCGGCG TCGTCCGCGC TCGCCAATTC CTCTCCGACT TCGCCGCCGT CCCTCTCGAC CGCATGACGC CCGCCGAAGC CGTCGCGCGC GCTCGACAAC TCAAATCCGA GCTCGAGACC GACGCCAAGC ACTCCCCTTG GCTCCTCGAC GTCCTCTCCA ACGCCGCGTG ATCACTCAGC CGTCCGCGAT CCGCGCGCGC GTCGCGTTCG ACCGCGTCGC GCCCAG
|
Protein sequence | MSNEPDARAA TATGALALDD AASAKFVRFY RGLPSETARV VRFFDRKDCI SAHGDDAMYI ARAFYKTTSV IKTMGSGDDA LPGVALNRSM FESALRELLL DGDGARVEFY EESKPSGTWT CVKSASPGKL QAFEDELFRS NEMSDASVVC AVRVANGNVG VAYANTTTRE LGACAFVDDE QYCTLESVLC QIGVKECVVP KEGTETPEGR RLRDVVSRCG ALATERQARD FDAQDLENDL GRLVRGNVEA HRAVIDQSHA AACLAAVLRF SEMLADSANH GRCTLSMYDT GRYMRLDASA LRALNVLPER SDGPSSFSLY GLLNKCRTPM GRRLLSRWLK QPLVDVNEIA TRHDVVNEFV TNAEVRDALR GAHLRALPDI ERITRKLERR KASLMDLCRL YQASAALPHM AEALERCEGR HGDYIRKKYA EELKKLSAPS HLGRFEALLE AAVDLSKIPD EYVICASYDA ELGELQKQKD TLEKQIRDAF ADASDDLGME RDKQLKLEHN NMHGWFMRLT KKDETSVRKK LSVSYQILEA KKDGTKFTNK KIRGLSEQRV SLDRSYDAKQ RHLVDRVVDV AATFSEIFLS VSAMTAEIDV LASFAEVAVS APVPFVRPIM HEKTSDTIHL ENSRHPNVEA QDNVRFIANT CSMKKGESWF QIITGPNMGG KSTFIRQVGV CVLLAQVGSF VPCDDAVIAV RDAIFARVGA GDCQLRGIST FMAEMLETAA ILKAATSSSL VIIDELGRGT STYDGFGLAW AISEHIVNEI QAPCLFATHF HELTALEGPS GVSNFHVEAL IDQESRKLTM LYQIKPGACD QSFGIHCAEF ARFPEEVLKI ARAKADELED FSKSGAERAV ADISDPKRQR TDEPGVSDDM ARGVVRARQF LSDFAAVPLD RMTPAEAVAR ARQLKSELET DAKHSPWLLD VLSNAA
|
| |