Gene OSTLU_32403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32403 
Symbol 
ID5002200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp816433 
End bp818433 
Gene Length2001 bp 
Protein Length666 aa 
Translation table 
GC content56% 
IMG OID640417621 
Productpredicted protein 
Protein accessionXP_001418342 
Protein GI145347785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTAGGT GCTTTCCCAC GTGGAGTCGC GGCGTCGGTG CCGCGGAAGA GGCGCGAAGA 
GGAGAGTCTG ACGTGAAAAA GACGAAGAAA GAGCCGCGCG CGCCCTTGTT GTCTTTGAAA
GAATACAAAG AGAACATGGA AGAGAAGATG GCGCAGAAGC AGCGCGAGAA GGAGGCAAAA
CAGCGCGAGA AAGAAGAGAA GGAGCGCAAG CGCAAGAAGG AGGAAGATGA GGCATTGCGA
CGTTTAGTAG AGGTGAATGT GACGCACGGC GCCGTTTCCG AAAACGAAGA CGCAGAGACA
AAAGGCGAGA CGTTGGAACC GAATAGCACG GAGACGACGA CGGTGGACGA AGAACCCGCG
CCAAGCGAGG TTTCGATCGA AGTAGAAGGC GGGCAGCAGC AGGCGGAGAC GATGGACGGA
GCGTCGACGG CGGAGACTGG CGAGCTCGCT CGCGACGCCG TGAGCGAAAC GTCGCAGGCG
GCGCCCGCGC CGGTCGAACC GCCCGTGATG GAAGAAGAAG ATACCGACGG TGAAGCAACT
TTCGCTGAGC TCGTGATCAA ACCTGAACGC CTCACGGAAG CAGACGCGGA GATGTACAAT
TACGCCGCGA GCTTCAACGG AGCAAAGGTG GTGGCGAGCG ATAAGGATTC AAAACACGCG
AGCGCCGCTT TGAAGGAAGA TAAAGATGTT TATTACATTT CTCCATGCGC TTCGGAAAAG
TTTGTCACTG TTGAGCTGAG CGAGGAGGTG ACGGTGACGA GCTTGGTTTT AGGAAACTTC
GAGTTTCATT CGTCTCGCGT CAAAGATTTC GAAGTTTGGG GCACGGACGG GCACCACGCT
ATTGAGGAAG GTTGGAAGAG ACTGATGATT GGACGTGCGG ATAACACGCA AAACTATCAA
AAGTTTGCCG TGCCTTCGCC AGCGTGGGTG CGTTACGTAC AAATTCGCAT GACTGGTCAT
CACGATCAGC AACACTTTTG CACGTTGAGC CTGCTGCGCA TCCACGGTAA AGACGCCAAG
GAGACGTTGA AAGAAGAGAT GGAGCGTTTG CAAGCGGAGG TGCAAGAGGT AGAGTCGTTA
TTGTCAGACG AGGACGAGGA CGAGGACGAG GACGAAGACG TGGATGTTCG CGAAAGTTCT
GCAGAAGTTG TGCTAGACGT TGAGGAGCAA AATCGTGAGG AAACAAACGC GAGCGCTGTG
GTGGGCGAAG AAAACGAGCG CGCGTCGACT GGTGACGACA GAGATGTCTC TACTTCGAGC
GAAACAGATC ACTCGGCAAA TGTCAACACA TCGATCGCGG AGGGTGCACC GAGCGAAACG
ACGTCGAACT CTGATGAGGA TGACAACGCC GCACAGGAGA GAGCGACGAA GATTGATGCG
TCGACAACGT CTCGTCCAAA TGCGACCGCC GCGGGCGCGA CCGCCGTGAA CGCGACGAAC
TCGAACGCTA CGGGCGTCGC GACGGCTAAA CCCAAGATGG CGACATCGAC GAATGAACTC
GCCAAGGGTG GCGGCGATGC GAACGTGTTT CGGTTGCTGG CGCAGAAGAT CAAGGATTTA
GAGCTCAACC AATCGCTTTT GTCGCGGTAC GTGGAGTCGC TCAACGTGCG ATACGGCGAA
ACGTTGGAAG ACTTCGGGAA AGAGATTGAC GAGATTGAAG AATCAGTGTC GAATTCCACT
GGCAAGCTCG ACGAAGCCAG TCGCCAAGCG CGAGCGAGCT CGAAAGCGTG CGATGACGCC
GTCGCGCGCG TCAACGATAG CTCCGAAAAG CTCGTCGCCG CAGCCGTGTC TGAGTTGGAC
GCGTATCGCA CGACTGTCGC GAAGCGGGAC ACCGTTCTCG CACTCGCGCT CGCGCTCACA
GCAGGCGCGC TCGTGGCGTC GCGCAGATCG TCTGGTGCGA TCGAACGCGT CTTGAGCGCG
CTCTCATCAT TCGCTTTGCT CGTCATCGTC GTGGCGAACA TCGTCCTCAT AGCACAAAAT
TTCTTGTTAA AGTCGATGTA A
 
Protein sequence
MCRCFPTWSR GVGAAEEARR GESDVKKTKK EPRAPLLSLK EYKENMEEKM AQKQREKEAK 
QREKEEKERK RKKEEDEALR RLVEVNVTHG AVSENEDAET KGETLEPNST ETTTVDEEPA
PSEVSIEVEG GQQQAETMDG ASTAETGELA RDAVSETSQA APAPVEPPVM EEEDTDGEAT
FAELVIKPER LTEADAEMYN YAASFNGAKV VASDKDSKHA SAALKEDKDV YYISPCASEK
FVTVELSEEV TVTSLVLGNF EFHSSRVKDF EVWGTDGHHA IEEGWKRLMI GRADNTQNYQ
KFAVPSPAWV RYVQIRMTGH HDQQHFCTLS LLRIHGKDAK ETLKEEMERL QAEVQEVESL
LSDEDEDEDE DEDVDVRESS AEVVLDVEEQ NREETNASAV VGEENERAST GDDRDVSTSS
ETDHSANVNT SIAEGAPSET TSNSDEDDNA AQERATKIDA STTSRPNATA AGATAVNATN
SNATGVATAK PKMATSTNEL AKGGGDANVF RLLAQKIKDL ELNQSLLSRY VESLNVRYGE
TLEDFGKEID EIEESVSNST GKLDEASRQA RASSKACDDA VARVNDSSEK LVAAAVSELD
AYRTTVAKRD TVLALALALT AGALVASRRS SGAIERVLSA LSSFALLVIV VANIVLIAQN
FLLKSM