Gene OSTLU_49070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49070 
Symbol 
ID5000653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp24718 
End bp25903 
Gene Length1186 bp 
Protein Length381 aa 
Translation table 
GC content57% 
IMG OID640416074 
Productpredicted protein 
Protein accessionXP_001416824 
Protein GI145344615 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCACATCCAT CGCACGATGG CCGCCGTGAC TTCTATCCCG CGCACGACGC TGCGTCGAAT 
CCCTCTCGGC AGCGCGAAAG ACGTTTTCGT CACCGACGTG TGCCTGGGGA CGATGACGTG
GGGCGTGCAA AACACCGAAG CCGAGGCGCA CGAGCAGTTG GATTACGCCG TCAAACAACG
AGGCGTGAAC TTCATCGACA CCGCGGAGAT GTACCCGGTG CCGTCGAGCG ATGCGCGATG
GAAACCTGGG ACGACGGAGG AAATCATCGG GAATTGGCTC GCAAAGAACG TCGAGCTGAG
AAAGGAGCTC GTCGTGGCGA CCAAGGTGAG CGGATACCAA GCCAAGAGCG AGACGGCGGG
TAACCGAACG GTGCCTGCGG GCGCGCCGTG CGCGGCGAGA TTGGATAAAC AAAGCATATT
TCAAGCGTGT GATGCGTCGC TGCGACGATT GAGAACGGAT TACATCGATT TATACCAAGT
GCACTGGCCC GACAGGTATC TGCCCATCGG CGCGTTCACG GGATCGACAG AGTACATTCA
GAGCAAGGAG AGATCGGACT CTGTCCCTAT TCGCGAGACG GTCGAAGCGC TCGGTGAGCT
CATCAAGGCT GGGAAGATCA GGCATTACGG GTTATCAAAC GAGTCAACGT TCGGAGTGTG
CGAGTTTGTT CGCGCGGCGG ATGAGCTCGG CGTTCCCCGT CCGGTGTCGA TTCAGAACTC
TTTTTGCCTT CTGCATCGAC AGTTTGACAC TGAAGTCGCC GAGGCGTGCT CGAAGTCAAA
CTACAACATT TTACTCCTTC CCTGGACCCC ACTCGCGGGC GGAGCCTTAT CGGGCAAATA
CCTCGACGGC GCTCGTCCGG AGGGCGCTCG CATGTCTGTC TTCAAACATT TCCACCAGCG
TTACCTGAAC GAAAACTCCG TCAAGGCGAC GAAGCAGTAC AAAGAAATCG CCGATAAGGC
GGGTATGAGT CTCACCACCA TGGCGCTTAA CTGGTGCAAG ACGCGCGCTT TCAACACTTC
CACCATCATC GGAGCCACCA CGCTCGAGCA GTTGAAGGAG AACATCGATG CGTTTGAGCC
CTCGGTTGTG TTGAGCAAGG AAACGCTCAA GGCCATAGAC GCCGTGCATC AGCAGTGCAG
AGACCCGTGC ATCGCCGTTT AAACGTGCGA CTCCTTCGTC GCTGTC
 
Protein sequence
MAAVTSIPRT TLRRIPLGSA KDVFVTDVCL GTMTWGVQNT EAEAHEQLDY AVKQRGVNFI 
DTAEMYPVPS SDARWKPGTT EEIIGNWLAK NVELRKELVV ATKVSGYQAK SETAGNRTVP
AGAPCAARLD KQSIFQACDA SLRRLRTDYI DLYQVHWPDR YLPIGAFTGS TEYIQSKERS
DSVPIRETVE ALGELIKAGK IRHYGLSNES TFGVCEFVRA ADELGVPRPV SIQNSFCLLH
RQFDTEVAEA CSKSNYNILL LPWTPLAGGA LSGKYLDGAR PEGARMSVFK HFHQRYLNEN
SVKATKQYKE IADKAGMSLT TMALNWCKTR AFNTSTIIGA TTLEQLKENI DAFEPSVVLS
KETLKAIDAV HQQCRDPCIA V