Gene OSTLU_31267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31267 
Symbol 
ID5001509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp584982 
End bp588124 
Gene Length3143 bp 
Protein Length965 aa 
Translation table 
GC content60% 
IMG OID640416930 
Productpredicted protein 
Protein accessionXP_001417292 
Protein GI145345598 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.533629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGCGCGCGTC GCGCACGCCG CGCGCGCGAA CGCACGCGAA ACGCGGACGC GCGCGCGATC 
GACGCGCTCG GACGCCCGTT CGGTGCGGCG CGCGCGCGCG ATGCGGCGCC GTTTCGACGA
CGAAGCCGCG GGGTGCGCGC GAAGATGCGA ACGCGCGCGC GGGCGAACGC GCGGCGCGCG
GTGGTCGCGC GGCGCCGCGG TGGCGATGCT CGTCGTCGTC GCGCGCGTCG TCGTCGCGCA
GCGCGTTGAA CCGATGAAGG CGCCGACGGT GACGTGGTTG GAACCGCCCA AGGGGCACGT
CGCGGGCGGG ACGACGCTCG CGGTGTACGG CGGCGGTTTC CTAAACTCGG CGCGGTTGAA
GGTGAGGTTC GCGAGGGGCG AGGAGACGAC GGAAGCGTAC GCGACGTATC ACTCGTCGAC
GATGATCACG GTTGTGACGC CCGCGCGCGT CGGCGCGGGA TGGACGCAAG TCACGGTGGC
GAACGACGGG GAGACGTGGA GCGGGACGCC GAACGTGTAC ACAAAGGGAT CTGGGACGTT
TTTGGCGTAC GTCTACGACG ATTCGTTGGC TGGATTTTAC GATGGCGTGC GACAAGGGGA
GAATACGAAC GCGTATCACG AGTTGTGGAG CGCGAGTAAC ACGACGGGGC CGTACATCGG
CGGTACCATC GTGCGCGTCT ACGCGCGCAA CTTGGATTTG GGCTCGAATA CGTTGTCGTC
AAGCGCTACG TACGTAGGCC CGACAGACGG CGCGGGGTCG CCGAATCCGA ACTTTCCAGA
CCCTAATGCG GTGATGAATA GTCCGCCGGT GCATGGGACG TTCTATCCGG GCTCGAAGTT
GACGTGCAGG ATGACGTGCA ACATCGATGT AAATCAAGAC TCCAGCATCG CTAGCGACGG
GTCGGAAACC TTTGTCACGG CGCAACCAGC AATTTGGCAT AGCTACACGT CGGTTGAGTG
CGAAACACCA CCGATGCCGG TGCCAGTGGG CGATCCGATT CCGTCGACGG CGTGCCACAT
GCACATCTCC AACGATGGGA TCAACTACGA TTACGCCAAC GTTACATTTA CGTACGCGGA
TCCGTTGCCG ACAGTGACTA GCATTAGCAC GGCGCAGTCT TCGGTTTGGG GCGCTCGAGG
CCCTTTTGAT GGAAACACCG AAGTCATCGT CAAGGGAACG AACTTCCTTC CTAGCAAGTA
TCTCAAATGC AAATTTGGGG GTATACCGGA GAGCGGTAAA AATTTATGGG AAAGCGACGA
CGTGTCGCAC GTCGTCGGCA CGCCCGGCGG ACGTGTTCGG TGGGTTTCGA GCACGGAAAT
TCGTTGTATA ACCCCGGAGT TCGGTCCTGC GTCACAACAA CGTCAATATC CAGCTGGTTC
GACGTTAGCG GCCTCCATCG GGTGTTGCGC CGTGCTAGAT GTCACCTTTG ATGCCAACAA
CGGCATTTCG TCCACCGTCA CCATAGTCGA CGGCGGGCGA GGCTACGCCA CCGCGCCGGT
GCTGACAATA ATCGGTGGTG GAGGAGGGGG TGCGACGGCG ACAGCGACCA TAGACTCGAG
CGGAGTCGTC AACGCGGTGA CGATCACGAA CGCGGGGCAC TCTTATAATC AGGGCTCTGG
CGCCACCGCC ACGGCGACGC TCGACGCCTC TGGTGGAACG TTGAGCGCGC TTACGCTCAC
GGCTGCGGGA AGCGGTTACA CCGTGCCGCC TGACGTGACG TTTTCGTGTA GCGGCGGGGG
TGACTCGTGT CTGGGTACGC ATATGCAGCA CGCTCGTGCT GTGGCAACTC TAGGTCCCGC
CGCCAATTGT TTCCCCGAGT ATGGATGCTA CGACAAAGTT GTCGTATCGC TGCGACTGAC
GTTCGTCGGT AATACGTACA CTTCCGCGCC TGTGGTTACG ATATCACCTC AAAAGCCGTA
CGTCTTGGTG CAGGCGAATG AACTTTTCAC ATCGGCGAAT GATCCCCCGT CGATTGGGGC
GTACACCATG CAAGGGAAAG GGGAAACCGA GCTCGATCCG CAGCTCTCGC ACGGCTCGTG
GGTGGACGAC GGAAGCGGAG GGCGAATTTA CAACGTCACC GACGCGCCTT ACGACGTTGT
CGCGGTCTCG GGTGTACCAG GAGCGAATCC TGGGCGACGT CTGTACGCGG TCAATCAATA
TAGTCAGAAT AGCCCGGAAG ATCGCGGTGA TCTTCTCGCA CCGCTGGGCC AAGCGGGAAG
CGATGGGTCG ATTAAACCGG CGCACCAGGA TCTCGTACAA GTGAGCAACA ATTACAACAA
GTTCGGCGTT CCTGTGATCG GTAACGCGGC TGCAACGCCC ACGCACGTTG GATATCGCGA
CAACACCGCA GCGGTAAAGG GATATTGGAT GTGGAGTCGC AGTACGGGCT CAGCGAGCGA
TTGCCAAGTT TCCAACAATC CGCCGTTGAA CTTTTATGAT GGCACAACGC TTATGGGACA
TAACCTCGAT AAAGGCACGG GGTGGGTCGT CGCTGGCTCC TCAAACGAGT ACTTGGGCAT
TCCTGGAAAC TCCGACTTGG GTCATCCGTC GAAATCGTGC TTGTACTTTT TGTACGGCGA
CATTTACGTA TCGCCGTCGG GGAGCGACGC AACCGGACAA GGCACCGCGG CAAGACCGTA
CGCCACGATT CAAAAGTGCA TCGATTCGGC GTTGACAGAC GTGCGTGATT ATCACGTGAA
CGCCGTCGGT GAGTCAAACC CGAGCGTGCC CGCGAGGATA TCGACGCAAA TCACTAAATT
CAACGCTGGG CGCTCGCAAA AGCGCGATGG AGGCGGCGGC TACGCGTACA CTGTGAATCG
CGATCGTTGC ATCCTGAAAG ACGGCACGTA CTTCGGCCCG GGCAATCGAC AACTTCGTGC
CAACGGTCGC GTCATCCAGC TATGGGCAGA AAACGAGGAG CGAGCCGTCA TCGACTGCGA
AGGCTTCCCC GTGGGCAAGC AAGTCTACGC CAAGCGCGAG CCGACGCGCG TGAGCGCGCC
GGGTTCGATC GCCACCCAGG GCGTCGTGCT CAAGCGATGC GGATTCAAGA TGCCTTACGC
CGCGGCGGAT AAATTATTTT ACCCCGGTCG TCCGACGACG TGATCAAACG CGCGCGCAAC
GCGATAAAAC AATCATTTGA ATT
 
Protein sequence
MLVVVARVVV AQRVEPMKAP TVTWLEPPKG HVAGGTTLAV YGGGFLNSAR LKVRFARGEE 
TTEAYATYHS STMITVVTPA RVGAGWTQVT VANDGETWSG TPNVYTKGSG TFLAYVYDDS
LAGFYDGVRQ GENTNAYHEL WSASNTTGPY IGGTIVRVYA RNLDLGSNTL SSSATYVGPT
DGAGSPNPNF PDPNAVMNSP PVHGTFYPGS KLTCRMTCNI DVNQDSSIAS DGSETFVTAQ
PAIWHSYTSV ECETPPMPVP VGDPIPSTAC HMHISNDGIN YDYANVTFTY ADPLPTVTSI
STAQSSVWGA RGPFDGNTEV IVKGTNFLPS KYLKCKFGGI PESGKNLWES DDVSHVVGTP
GGRVRWVSST EIRCITPEFG PASQQRQYPA GSTLAASIGC CAVLDVTFDA NNGISSTVTI
VDGGRGYATA PVLTIIGGGG GGATATATID SSGVVNAVTI TNAGHSYNQG SGATATATLD
ASGGTLSALT LTAAGSGYTV PPDVTFSCSG GGDSCLGTHM QHARAVATLG PAANCFPEYG
CYDKVVVSLR LTFVGNTYTS APVVTISPQK PYVLVQANEL FTSANDPPSI GAYTMQGKGE
TELDPQLSHG SWVDDGSGGR IYNVTDAPYD VVAVSGVPGA NPGRRLYAVN QYSQNSPEDR
GDLLAPLGQA GSDGSIKPAH QDLVQVSNNY NKFGVPVIGN AAATPTHVGY RDNTAAVKGY
WMWSRSTGSA SDCQVSNNPP LNFYDGTTLM GHNLDKGTGW VVAGSSNEYL GIPGNSDLGH
PSKSCLYFLY GDIYVSPSGS DATGQGTAAR PYATIQKCID SALTDVRDYH VNAVGESNPS
VPARISTQIT KFNAGRSQKR DGGGGYAYTV NRDRCILKDG TYFGPGNRQL RANGRVIQLW
AENEERAVID CEGFPVGKQV YAKREPTRVS APGSIATQGV VLKRCGFKMP YAAADKLFYP
GRPTT