Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30439 |
Symbol | |
ID | 5001085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 194230 |
End bp | 197052 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416506 |
Product | predicted protein |
Protein accession | XP_001416871 |
Protein GI | 145344713 |
COG category | [R] General function prediction only |
COG ID | [COG5210] GTPase-activating protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0509505 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGAGA CGTACCCGGG GCACTATCGC GCGCTGGTGG ATCTGAGAGG GGACGAGGAG ACGCACAGCG CGGAGACGTC GGGGCAGATT GATAAGGATT TGCCGCGCGT CGGCGGCGCG TTTCGAAACG CGTTGGATCT GAGCGAGCCG GGAACGAAGG ATTGGAACGC GCTGAAGCGA GTGCTGCTGG CGTTCGTGTC GCACGAACCG GAGATCGGGT ACGTGCAGTC GATGCACTCG ATCGCGGCGT TTTTGTTGCT GGCGGGATTG GACGAGGAGG ACGCGTTTTG GTGCCTGGTG CAGCTGGTGG GGGAAATCGT GCCGGGGTAC TTTTCGGAAG GGATGACGGC GGCGAAACTC GATCAGAGGG TGTTTATGAG GATATTGCGC GAGCGTTTGC CGAGCGTTGG CTTGCACGTG GGGGCGCTGG GGCCGGATGA TATCATCGCG GCCATCATGA GCGGGCAGTG GTTGCTCACG CTCTTCGTCA ACGTGTTACC GACGCGAGCG ACGATGGAGG TGTGGGATGA GATGTTTAGG CACCGACACC GCGCGCCGCT ATTTGCGGCG TGCGTCGCGC TGCTCGAACG GAACGCGCAG GCAATTTTAG CGACCACGGA GATGGGCGAG GCGATCGAGC TGCTGCAACG ATGCAGCGAG AGTTTGCGCC GAGCGCCGGC AAGCGAAGGC GAGGGCGACG CGTGCGATGA GGCGGAGTGC GACGCGTTTC TAGCGCGAGT TCGCGAGTTA TTGAGCAACG AACTTTCGCC CGCGAAAGTA GACGAACTTA CGGCGCGCGT GCGAGGGAAG TTTCGACGTC CGAGCGACGT GCGGTTGCCC GCCGCGATTA CAAACGTGTC GGCGTTGACG GATGTGGATG ACTTGTACGT CGGTCTACTC TCGGGAGATC TACGAGATAA AATGACGGCC GCGCACGAAA ATATGTGCGC GACTGAAAAT TTGAAAGATG AGTTATCCAT GCTCAAGGAT GCATCGCTCG CGAGCAGTGT TCATGATTCC GACATGAATG GCGAACCCGT CGAAAACGAG TCGCAGCGTG AGCGACACTC GAGTGATGAC AGGAACTCAC CCGAATCTAT TCTCAAAACG AGTGAGCTGA ATTCAATCTT TGCCAAAGTG GTGTCGATTG AGACGCACGC GAATGCGCTC ACAGAGCATG GCGATAAAGT GTTGACGTGC GTGAAACAAA TCGTTTTGCG ACCGTTGCGC GCCATTCTGG AGTCGCGACT GAAATCATTT TGCGAAGAGA TTCGCTTTCT TCAAGGTGAA TTCGCGACAA AAGTCATCAC ACTTCGCAAC ACTATGGATC AACAGTTGGA AATGGCGCCA GATTTGAGCG TGTTTCTCGC TTCGCAATCG TACAACAGAT CGATGTGGCC GCTTTGGACA GAGTCAATTT TTGAGGTCAC AATCGAACAG GCGGAAATTG TCCTCGAGTC CCTCGAACAA ATCCGTGCTG AACTCGCGTG GATGGTGAGC GTGCTCGTCG GTGAGCGTAA AAAAAGTGTC ACACCGAAAA TATCCCGAGC AGATGGTTGG GAAGACCTCG GAGAGGACAC AAACGCGGAG GCTTCAATCA CCGCCGAACC GGACGAGGAA TATCTCGCCG CGTCTTTTCA GTCTCATAAG AACGAGACGG AGAATAGGTT GAACGAAATT AGAATCATGG TGAAGCGCAC GCACACGGAA GCGATCGAAG ACTTGCCGAG GTTGAGGCAA AATTTAAGCG CGACTACGGC AAAACTCAAG GACGAATTAT CCGCGGAGGA GGACGCGGTG AACGATTGGG CGGCGAACGC GGAGAGCCGC AACAAGATAA AGCAGGAATC AACGGAGAAA AAGCTCGAGC AGGTCTCACG CGAACTTATG AAGGCGTATG ATAAGTCGCA TATTGCCATT GATGACGCGT TGGCGGCGAG CGATAGCGAG ACTGGCAGTG TGAGCGCAAA AGACGGCTTG GAGCAATCGC ACGCGGATGA AGAAAGACGT CTTCGAGCGG CTTTGCAAAG CGTCAAGACG CTCGAATCGG TTCTGAGTCG ACGCTCGAAC GCGCTGGAGC GTGAGAAGCA AACATCACAA AGCGTGCGGG CGATATCTTC GCGCACGTTA TCTCAGACCA ACGCGTCGTT ATTGCTCGTC TTTGAAGCGG GCGAGTGGTT GGAGCACGAG CTCGTCGCAC GGGGCTCAAA GGATTTCGTC GACGATTTTG AGCGACTCGC CGACATCATC GAAGAATTGA GTGCGAGCGC GAGAAATTCT TGCGTGAGAA TCCTTCGTGA GTGGTCATCG TTCGTTCGCA AAATCACGAG CCAGGTCGTT CTGGAGTACG TCAGCATCAT CGACTCGTCC GCACAGGCGC TCAGCGACAC GCAAGCCGCA CTCGCCGCAT CGACGAGTCG GCTAGACGCG AGCTCTTCGC TCTCTACATC ACCTGGATCA ATACGTGAGC TAAATCCATC GCACGCATCG CCGATGTCGG GCAACTCTCT CCGCTCGCTG ACGAGTCAGA TGGAAAAACT AACCGACATC GCCGGATCCA AGCTCGGCAG CTTTGGCAAT CGTCTGCGCA GTTTTGCCTC GCCGACGCCG TCAAAGGCTT CGATGAATCA AAACGCGCCA AATGATGTTG AAACGGTCTC GTCAACGAGC CCTGGTAACG ACGCGTCATC GACGAATTCG AAATTCTCCA GGCTGGCTGC CTCCGCGCGC GAGGACGGCG CCAGACTGGA ATCGCGCAAA GCGCAGTTGC TCGCGAAGAA GCGCTGGCTG CGCGAGCACC TCGTCGCGAG AGGTGAAGCT TAG
|
Protein sequence | MRETYPGHYR ALVDLRGDEE THSAETSGQI DKDLPRVGGA FRNALDLSEP GTKDWNALKR VLLAFVSHEP EIGYVQSMHS IAAFLLLAGL DEEDAFWCLV QLVGEIVPGY FSEGMTAAKL DQRVFMRILR ERLPSVGLHV GALGPDDIIA AIMSGQWLLT LFVNVLPTRA TMEVWDEMFR HRHRAPLFAA CVALLERNAQ AILATTEMGE AIELLQRCSE SLRRAPASEG EGDACDEAEC DAFLARVREL LSNELSPAKV DELTARVRGK FRRPSDVRLP AAITNVSALT DVDDLYVGLL SGDLRDKMTA AHENMCATEN LKDELSMLKD ASLASSVHDS DMNGEPVENE SQRERHSSDD RNSPESILKT SELNSIFAKV VSIETHANAL TEHGDKVLTC VKQIVLRPLR AILESRLKSF CEEIRFLQGE FATKVITLRN TMDQQLEMAP DLSVFLASQS YNRSMWPLWT ESIFEVTIEQ AEIVLESLEQ IRAELAWMVS VLVGERKKSV TPKISRADGW EDLGEDTNAE ASITAEPDEE YLAASFQSHK NETENRLNEI RIMVKRTHTE AIEDLPRLRQ NLSATTAKLK DELSAEEDAV NDWAANAESR NKIKQESTEK KLEQVSRELM KAYDKSHIAI DDALAASDSE TGSVSAKDGL EQSHADEERR LRAALQSVKT LESVLSRRSN ALEREKQTSQ SVRAISSRTL SQTNASLLLV FEAGEWLEHE LVARGSKDFV DDFERLADII EELSASARNS CVRILREWSS FVRKITSQVV LEYVSIIDSS AQALSDTQAA LAASTSRLDA SSSLSTSPGS IRELNPSHAS PMSGNSLRSL TSQMEKLTDI AGSKLGSFGN RLRSFASPTP SKASMNQNAP NDVETVSSTS PGNDASSTNS KFSRLAASAR EDGARLESRK AQLLAKKRWL REHLVARGEA
|
| |