Gene OSTLU_28809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28809 
Symbol 
ID4999764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp344867 
End bp346033 
Gene Length1167 bp 
Protein Length388 aa 
Translation table 
GC content64% 
IMG OID640415185 
Productpredicted protein 
Protein accessionXP_001415461 
Protein GI145340706 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00340732 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGTGG ATTGCGACCC GGGACACGAC GACGCGTTCG CGCTGTACCT CGCGCTGCAC 
GGGACGGCGG AGCGCGCGCG CGTCTTGGTC GGCGCGTCCG CGACGCACGG GAACGCGAGC
GCGGGAAAGA CGACGGTGAA CGCGCTGCGC GCGCTGGCGT GGATCGGCGC GCGGGCGTCG
CGAGACGGGG GGGAAGAAGG GGGGCGCGGG ACGCGGGACG CGCGACGCGT GGCGTTCGCG
GCGGGCGCCG AGAGGCCGTT GCTTCGCGCG TCGAGCGCGT GCGAGGAGAT TCACGGCGAG
AGCGGGTTGG ATTCGGCGAG CGCGACGGAT GCGCTGGAGG AGTACGCGTG GGGAGAGGAC
GAGAGAGATT GGATGGGGGC GGGCGGGATC GAGGCGCTGT TTCATCCGCG CGCGAAGACG
GCGGCGGAGG CGATGTACGA CGCGTTCGTG CGGCACGTCG AAACTTCGGG AAGAGGACGG
CCGTTCGTCG TCGTCGCCAC GGGGCCGTTG ACGAACGTGG CGACGATGAT TCTCGCCAAG
CGTTCGCAGA TTGATTTATT TCCCGACGAA TGTCGACCCG TAATTTTTTG CATGGGCGGC
GCCGTCGGCG ACGGCAACAC CGGCGCTCGA GCGGAGTTCA ACATTCAGTG CGATCCCGAA
GCGGCGAAAA TAGTCTTCGA GAGCGGTTTG CGAGTCTACA TGATACCGCT CGAGGTGACG
CACACGGCGA TCGTCACCCC GAGCGTTCTC GACAGCTTGA CCACCGGTGG TCATTTCGAC
TCCGGCAAAG CGGGCGCGAG CGCGCACGCC AAGCAAATCA GATCGCTCTT GACATTCTTC
AAGGACACGT ACGAGAACGT CTTCGATTTC AAGACGGGAC CGCCTTTACA CGACCCCTGC
GCCGTGTGGG CTGCGATTAA TTTTATCGAG GGCGCCACGT ACGATGAATC CGAAACGGAC
GATAGATTTC GAGACTTGTT CGAATTCACG CACGAACGCG TCGACGTCGA GTGCGAGTCG
CGACTGACGT ACGGCCAAAC CGTCATCGAC CGCTGGGGGA CGAGCGCCGA GCCGAAGAAC
GTGTACGTCG CGAGAAGCAT GAACGTCGAT CGCTTTTGGG AAGCCATGCG CGACGCGATT
CAGCACCGTT TGAGCTGGAT GTCTTAG
 
Protein sequence
MWVDCDPGHD DAFALYLALH GTAERARVLV GASATHGNAS AGKTTVNALR ALAWIGARAS 
RDGGEEGGRG TRDARRVAFA AGAERPLLRA SSACEEIHGE SGLDSASATD ALEEYAWGED
ERDWMGAGGI EALFHPRAKT AAEAMYDAFV RHVETSGRGR PFVVVATGPL TNVATMILAK
RSQIDLFPDE CRPVIFCMGG AVGDGNTGAR AEFNIQCDPE AAKIVFESGL RVYMIPLEVT
HTAIVTPSVL DSLTTGGHFD SGKAGASAHA KQIRSLLTFF KDTYENVFDF KTGPPLHDPC
AVWAAINFIE GATYDESETD DRFRDLFEFT HERVDVECES RLTYGQTVID RWGTSAEPKN
VYVARSMNVD RFWEAMRDAI QHRLSWMS