Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43971 |
Symbol | |
ID | 5004500 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 570214 |
End bp | 573162 |
Gene Length | 2949 bp |
Protein Length | 962 aa |
Translation table | |
GC content | 60% |
IMG OID | 640419921 |
Product | predicted protein |
Protein accession | XP_001420386 |
Protein GI | 145352079 |
COG category | [K] Transcription |
COG ID | [COG0557] Exoribonuclease R |
TIGRFAM ID | [TIGR00358] VacB and RNase II family 3'-5' exoribonucleases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACCGCG TCGGGAAGAC GTTCGTCAAG ACGACGAAGC GCGGGCAGGC GGTGACCAAG GTGCGACGGG AGCGGTACCT GCGCGACGAC GTGTACGTCG GATGCGGGGA CGCCGCCGTC GAGGCGCGGT ATCGAGGACG CGACGAGGCG CTGTACGTGC TTCGACCGGA GCGCCGAGGC GCGAAAAGCG AGCGAACGAT CGCGCTGATC GATTCGAACG TCGCGCTGCA TCAGATGGAC GCGCTGGCGG ACGCGAGGGT GACGGATGTG GTGATTTGCG CGACGGTGAT GGAGGAGACG CGAAACAAGT CGCGGACGTC GTACGAGCGG TTACGAGGAT TGTGTAAGGA TAAAGAGAAG CGGTTTTACG TGTTTAGTAA TGAGAATCAC GCGGAGACGT ACGCGGAAGA CGTCGCGGGG GAGTCGGCGA ACGATCGGAA CGATCGAGCG ATACGCAAGG CGGCGGCGTT TTATCGCCGA GCGATTCCGC GCGCGGCGTG CGAACGCGTG ACGCTCGTGA CGAATGATCG AGGAAACGCG AGAAAGGCGC GAGAGGAGAA TATAGACGTG ATGAGCGTGA TTGAATGGAT ACAGTCGATC TCAAACGACG CGAGCTTGGT GGATTTGGTC GTCCCGGGCG GAGGGGCGGA GATTCCCGAG GACGAACGCG ACGCCAAGCG AGCGAAGAGC GGCGGCGGCG CGAGCGCGAG CGTTTTCGAG GCGTATCTCG ACGCGAATGA CATCAAGGAA GGTTTAGCGA GTGGAAAATT GATCAAAGGC GGCGTTCGAA CGTCTAGATA TAACCCTTTT GAGGCGCACG TGATGAATGA AGGTACCGGA GAAGGCGTAC TGCTGAGCGG ACGCGCGGCG ATGAATCGCG CCATCGACGG CGACTTGGTC GCCGTGGAAA TCTTGCCCGA ATCCCAATGG ATTCGTCCGG GTGGTATGAT CATCACCGGT ACCGCAGAAG AAGACAAAGA GACGTCCTCC GCGGACGTCG ACGCGAAGGA CGACGCCGGC GGACTGGCGC CGGAGACTGC GGATGAAACG GACGTCGCTG TCGCGGCGAC GAAGGCGTCG GGCGTCGTTC CCACGGGAAA AGTTGTAGGT ATCATTCGGC GCAACTGGCG CGAACGCGGT TATGCGGCGT CGGTCGACAT GGGAAGAGAC GGCAAAGGTC CAACCGCAGC CGGTGGCGGC TTCGCATCGC GCGTCTTGTG CGTGCCGAGC GATAGACGAT TTCCAAAGAT TCGCATCCAG ACGCGCCAAC TCGAAGGCTT GCTTGACCAG CGCATCGTCG TCGTCATCGA CGATTGGCCC GCGGATAGCA TGTACCCCGA AGGGCACTAC GTCAAGTCGC TCGGATTGAT CGGTAGCGTC GATGCCGAAA CACAGGCACT TCTGCTCGAA AACGACGTTG ACGATCGACC GTTCGCGCCG GCGGTGTACG CGTGCGTGCC CGCCTTGCCG TGGAAAGTCA CCGACGACCA TCTCGCCGAA CCCGGTCGCG AGGACTTGCG CGATTTATTG GTGTGCAGCG TCGATCCTCC GGGGTGTAAA GACATCGACG ATGCGCTGAG CGCGCGCGAT ATCGACGACG AGCGCATTGA AATCGGCGTG CACATCGCTG ACGTGACGTC GTTTCTGTTC CCAGACACCG CCATGGACGA AGAGGCGGCG CGTCGAGGGA CGACGACGTA CTTGGTGCAA CGCCGTTTGG ACATGCTTCC GGGAGCGCTG ACGACGGACA TATGCTCTTT GGTTGGAGGT CAAGAGCGCT TGGCGTTTTC GGCGTTTTGG ATCGTGCAAA AGTCCACGAT GCTTCCGGAT GAAACCGTCA AACCACGATT TACGAAGAGC GTCATCAAGA GCGCGGCGGC GTTGACGTAC GAGCAAGCGC AGACGCGAAT CGACGATGCA TCTTTGAACG ATGATTTGAC GCTCAGCTTG AGGCGACTTC GTGATGTCGC AAGACAGCTA CGCAAGCGGC GCATGGCGAA CGGCGCGCTG ACATTGGCTT CGCCCGAGGT GCGATTCGAG ATGGATCAGC AGTCGAGCGA TCCGCTCGAC GTGGGCATGT ACGTCACGCG CGAGACGAAT CAAATGGTTG AAGAAATGAT GTTGTTAGCC AACGTCAGCA CCGCGGAGCG TATTTTGCAA GCATATCCCG CGGCGGCGAT GTTACGCCGT CACCCGATTC CCGAGCAGAA AATGTTTGAG CCCTTACTCA AGGCGGCCAA GGCGTGCGGC GTGGACATTG ACACGCAATC CAACCGCACG CTCGCGGCGA GTCTAGACGC CGCCGTGCGA CCGGAAGACG CGTATTTCAA CACGCTATTG CGCTTACAAG CGACGCGGTG CATGTCACAG GCGGTGTACT GCAGCTCGGG CCAATACGCT GGACCAGAGC GCATGCACTA CGGTCTCGCC ATGCCGTTGT ACACGCACTT TACGTCGCCG ATTCGTCGGT ACGCCGACGT AATCGTACAC AGATTACTGA GTGCGGCGAT CGGGCTGTGT CCTCGTCACA AGTCGTTGGA GGACTCCGAT CACGTCAAAT CCGTCGCAGA CGTCTGCAAC GTGCGTCATC GCAATTCTCA GCAAGCCGGT CGCGCGTCCG TAGAGTTACA CACCTTGGTG TTTTTCCGCA AGCGCAAAAT TGTCGCCGAC GCGCGCGTCT TCAAAGTGCG CGCAAACGGA CTCGTGGTTT TTGTCCCAAA GTTTGGCATC GAAGGACCTG TGCTCTTCGT CGAGGGCGAA AATAGCGACA CCGCGACTTG TACGCTCGAC GAAGACGCTA TGACGGTGAC GCACAAAGGA AAGACGTGGA AAGTTTTCGA CCGGCTCACC GTGCGAGTGG AGGTCGAACA ACTCCCTGCG CATCGGAGTC GTTTGCTCAT CACCATCGTT CCTGACGACA CTCCGCGCGG GGAGATCGCG AGCGCGTGA
|
Protein sequence | MHRVGKTFVK TTKRGQAVTK VRRERYLRDD VYVGCGDAAV EARERTIALI DSNVALHQMD ALADARVTDV VICATVMEET RNKSRTSYER LRGLCKDKEK RFYVFSNENH AETYAEDVAG ESANDRNDRA IRKAAAFYRR AIPRAACERV TLVTNDRGNA RKAREENIDV MSVIEWIQSI SNDASLVDLV VPGGGAEIPE DERDAKRAKS GGGASASVFE AYLDANDIKE GLASGKLIKG GVRTSRYNPF EAHVMNEGTG EGVLLSGRAA MNRAIDGDLV AVEILPESQW IRPGGMIITG TAEEDKETSS ADVDAKDDAG GLAPETADET DVAVAATKAS GVVPTGKVVG IIRRNWRERG YAASVDMGRD GKGPTAAGGG FASRVLCVPS DRRFPKIRIQ TRQLEGLLDQ RIVVVIDDWP ADSMYPEGHY VKSLGLIGSV DAETQALLLE NDVDDRPFAP AVYACVPALP WKVTDDHLAE PGREDLRDLL VCSVDPPGCK DIDDALSARD IDDERIEIGV HIADVTSFLF PDTAMDEEAA RRGTTTYLVQ RRLDMLPGAL TTDICSLVGG QERLAFSAFW IVQKSTMLPD ETVKPRFTKS VIKSAAALTY EQAQTRIDDA SLNDDLTLSL RRLRDVARQL RKRRMANGAL TLASPEVRFE MDQQSSDPLD VGMYVTRETN QMVEEMMLLA NVSTAERILQ AYPAAAMLRR HPIPEQKMFE PLLKAAKACG VDIDTQSNRT LAASLDAAVR PEDAYFNTLL RLQATRCMSQ AVYCSSGQYA GPERMHYGLA MPLYTHFTSP IRRYADVIVH RLLSAAIGLC PRHKSLEDSD HVKSVADVCN VRHRNSQQAG RASVELHTLV FFRKRKIVAD ARVFKVRANG LVVFVPKFGI EGPVLFVEGE NSDTATCTLD EDAMTVTHKG KTWKVFDRLT VRVEVEQLPA HRSRLLITIV PDDTPRGEIA SA
|
| |