Gene OSTLU_119550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119550 
SymbolUtp6 
ID5000524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp531348 
End bp532532 
Gene Length1185 bp 
Protein Length394 aa 
Translation table 
GC content48% 
IMG OID640415945 
ProductU3 snoRNP protein Utp6p 
Protein accessionXP_001416173 
Protein GI145342257 
COG category[R] General function prediction only 
COG ID[COG5191] Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0217444 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGATA CGAGCATGGT GCTGGAGCGC GCATCGCAAG ATTTGATGAA ACTAGAACTC 
GCTGGTATCA TCACACCGCA GGAACGTAAG CATATATTTT CAACGCGCTC CAAACATGAG
AATTCGCTGG CACGGCAGCG TGGTTCCAGT GAACGCGCTT TTCTTGCATA CAGTTCGTAT
GAGCTCGCTC TTGAGACTAC GCTGCGGCTA CGAATGAGAA AGAGCCGACA TCACGCTCTC
GCATCGACAG TCCAACAGCA CGTACGCCGA GTCCTGGGTC GCGCTGTTCG TCAAAATACT
TCGAGCCTTC ATCTCTGGTA CGTATTTGTA CAGTACTGCG AGAAGCATGG TAGCAACAAA
TTATTATCTC GTGTGCTTGC GAAAGCGCTG AGATATCACA CCGACAGCGT TGGACTTTGG
TTGTATGCCG CAACCTTCGA GTTTCAGAAA AACCTCAATG TGAATGCTGC TCGAGCTATC
TTGCAACGTG CGCTGCGAAA CTGCGCCGAA AAAGGCGACA TCTGGTTAGC TTACTTCAAG
ATGGAAATTA TGTATGCGAA CGTCATCAAG TCTCGGAAAG CGGCGCTACT ATCCCAAAGT
TTGACCGAAG CGACTGAGAG TAGCAAGGGT GGGATCCAAC TGAAGGTTGA AGATGGCGCG
ATTGCATTTC TGGTCTTTCA CAAGGGCGCG TCTCAACAAG AACGAGATTC AAACTTGTGC
CTGAGGATGC TCTGTGCTGC CATGGGCCTA CATCACGCTA GTACACTAAT CGAGCGCATG
CTCGCATCAC GCATAGCATA CGAGCAAGAG ACAGCTACGG TAGCAAATTT TATTCTTCAT
AAGAAACGAA CTCGTGCTAC TTATCGCAGC TTGCATAAGT TGCTTAGTAC GCGCGACGTA
TCAGCGCACA TGGTGAACCA ACTTCATGCC CTCCGGCGGA CCCTGGAAGA ATCTGAACTC
AGCACCAGTA CCTGGAGTAA ATTATCTTTC CTATTAAAAC GTGTCCATTA TTCGGTGGAA
GAAAAAGCCT TGCAAATGTT GAACAAGGCG CTCGTACTGC ATTCGCTTGG TGTGGACACA
AGATTTTCAA AGAGGCAGGG CGCACACGCA ATGATAACAT GTTTCAACTT TGAAGAACAC
GAATTTCTAC AAAAAGGGCA CTTTTTCAGT GCCCGAATGT ATTGA
 
Protein sequence
MGDTSMVLER ASQDLMKLEL AGIITPQERK HIFSTRSKHE NSLARQRGSS ERAFLAYSSY 
ELALETTLRL RMRKSRHHAL ASTVQQHVRR VLGRAVRQNT SSLHLWYVFV QYCEKHGSNK
LLSRVLAKAL RYHTDSVGLW LYAATFEFQK NLNVNAARAI LQRALRNCAE KGDIWLAYFK
MEIMYANVIK SRKAALLSQS LTEATESSKG GIQLKVEDGA IAFLVFHKGA SQQERDSNLC
LRMLCAAMGL HHASTLIERM LASRIAYEQE TATVANFILH KKRTRATYRS LHKLLSTRDV
SAHMVNQLHA LRRTLEESEL STSTWSKLSF LLKRVHYSVE EKALQMLNKA LVLHSLGVDT
RFSKRQGAHA MITCFNFEEH EFLQKGHFFS ARMY