Gene OSTLU_26228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26228 
Symbol 
ID5004099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp455971 
End bp457977 
Gene Length2007 bp 
Protein Length668 aa 
Translation table 
GC content63% 
IMG OID640419520 
Productpredicted protein 
Protein accessionXP_001420009 
Protein GI145351277 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.281249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.273766 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCG CGTCGACGCT CCGCACCGCG TCGACGCGCG AACGCGCATC GACCCGACGC 
GTCCCGGCGC GCGCGCGCGC GCTTCCGTCG TCGTCCTCGC CCCGCCGCCG ACGCTTATCC
ATTACGACGT CACCGGGATT ACGTCTCGAC GCCGCGTCGC CGCGCGCGCC GACGCGCGCG
CGCGCGGCGT CGGACGTCGT CGAATCGTCC TCCGCGCCGC TCGATCCGAT CGGGCTGAAC
CCCGATCGCG CCGCCGTGGT GAACGCCGAC CCCGACGCGT ACTCGTGGAC GAAGCAGTGG
TACCCGGTGG CGGTGGTGGA CGTGCTGGAC GAGACGAAAC CGCACCCGAC GACGCTGCTC
GGGATAGATT TGGTGGTTTG GAAAAATGGT GATGGGTCGT GGAGCGTGTT CGAGGATAAG
TGTCCGCACA GGTTGGCGCC GCTGAGCGAG GGACGGGTGG AGAGCGACGG GACGCTGCTG
TGCGCGTATC ACGCGTGGCG ATTCGACGGC GACGGAAAGT GTACGTCGAT GCCGCAAGCG
TCGAGCGCGG AGGAAGAAGA AAGAATCAAG GCGAACGTGC GATCGTGCGC GTTCAAGCGA
CCGAGCATGG TGGCGCAAGG GTTGGTGTGG GCGTGGGGCG AAGGCGGGAA GGATGCGGAG
ATGGAAGCCG CGATGACGCC GCCGTTGTTC GTGCCTGAAA TAGAGGGCAT CGGTAAGAGT
GGTCGCGCGA GCTGCGGTGG GTTCAGAAAT CACTGGCAAG TGCGCGATTT ACCGTACGGT
TGGAACGCGT TCTTCGAAAA CGCCATAGAT CCCGCGCACG CCGTCGTGAG TCACCACACG
TTGGTCGGTT CGCGATACGA CGACCCAGCC GGGTTTCAGT GCGTCGTCGA GCGTCCGGTG
ACCGACGCCG GTGGGTTCCG ATGCGCCATC GACCCGGCGG TGCCACCGTT CAACTCGATC
GGGAAATACG ACGCGGAGAC GTCTTACGAC TTCCAGCCGC CCGCGCTGTT GAAGATTGAC
TGGCGACACG AGGGGGGGCG ATTTTTGACG TCGCACTACT GCGTGCCGAC GCGTCCGGGG
TGGTGCCGCC ACTTCGTCGT CACCATCGCG CAGCGACGAC CCGAAATGGG GAACAAAATT
CGCGAGCACC GATGGTTCAA GCTAAACCTG TTCACGCTCA CGTCGCCCGC GTGGCTGACG
CACGTGTTGG GGCCGACGTT TTTGCATCAA GACATGGTGT TGTTGCACCA ACAAGAAAAA
ATCATCGCTC AGGGCGACGG ACAGGCGATG GCGCAAAAGT GGAAGGATCA AGTCTTCACG
CCGAGCACGG CGGATAAGAT GACCATCTTC TTTTACAAGT GGTTCGAGAA GAATGGCCCG
ATTCCGTGGG CGCCCGGGAC GGAGCAAATG CCGCCCATCG AGCGCGATTC GAGCAAGCTC
TTCGACACGT ACGAGATGCA CACCAAGTAC TGCACGCACT GCCAAGGCGC GCTTCGCAAC
ACGGAGATCG GGATGTGGGC TACGGGCGCG ATCGCGGGGG CGAAGTTGTT TTGGGTCGGC
GCGAGTGTCG TCTTCACCGC GGCGTTGCTC GGCAGCGGCG ACGACGCGTC GTCGTCGCTC
GACGTGTTCG AGTTAGCGAG CGCCGTCGAC GGTTCGGTGT ACGGTGACTT TTTCAGCGCT
TTGAGCTTGG GCGCGACGTC ATTCTTTCTG TGGGGTTTCG CGCAAATGTT TCGCACGTAC
CCGTTTTCGC ACTCTGAGGA CGACATCGTC ATGGAGGGTA CGGCGAAAAT CGGTTTGTCC
AACGACGGAC CGAGCGCGTA CATCGATTTC GTAGATTCGA CGCTGTTCAA GGAAAAAGGT
GGCGATCACA ACCGCGGTTG CGAGTGCAGC ACGTGCTCGC CGCATTTCAA GGATTTGATC
AAGAGCACCA TGTTGGCGCG CGCGAAAAAG TCACCCGCCG TCGTCGAGGA GGCGGAGGAA
GAGCGATCGA TCCCGGTCGC GCGATGA
 
Protein sequence
MTAASTLRTA STRERASTRR VPARARALPS SSSPRRRRLS ITTSPGLRLD AASPRAPTRA 
RAASDVVESS SAPLDPIGLN PDRAAVVNAD PDAYSWTKQW YPVAVVDVLD ETKPHPTTLL
GIDLVVWKNG DGSWSVFEDK CPHRLAPLSE GRVESDGTLL CAYHAWRFDG DGKCTSMPQA
SSAEEEERIK ANVRSCAFKR PSMVAQGLVW AWGEGGKDAE MEAAMTPPLF VPEIEGIGKS
GRASCGGFRN HWQVRDLPYG WNAFFENAID PAHAVVSHHT LVGSRYDDPA GFQCVVERPV
TDAGGFRCAI DPAVPPFNSI GKYDAETSYD FQPPALLKID WRHEGGRFLT SHYCVPTRPG
WCRHFVVTIA QRRPEMGNKI REHRWFKLNL FTLTSPAWLT HVLGPTFLHQ DMVLLHQQEK
IIAQGDGQAM AQKWKDQVFT PSTADKMTIF FYKWFEKNGP IPWAPGTEQM PPIERDSSKL
FDTYEMHTKY CTHCQGALRN TEIGMWATGA IAGAKLFWVG ASVVFTAALL GSGDDASSSL
DVFELASAVD GSVYGDFFSA LSLGATSFFL WGFAQMFRTY PFSHSEDDIV MEGTAKIGLS
NDGPSAYIDF VDSTLFKEKG GDHNRGCECS TCSPHFKDLI KSTMLARAKK SPAVVEEAEE
ERSIPVAR