Gene OSTLU_33125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33125 
Symbol 
ID5003446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp429510 
End bp433574 
Gene Length4065 bp 
Protein Length1354 aa 
Translation table 
GC content54% 
IMG OID640418867 
Productpredicted protein 
Protein accessionXP_001419161 
Protein GI145349481 
COG category[S] Function unknown 
COG ID[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.517999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0400566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCACA ACCGTCATCG CGCGCGATCC GGACGCTCGA CGCGCTCAAC GACGACGAAT 
CGCGCGCGAA CGCGCACGAC GGTCCGGTCG GCGGCTCGAG CGCATTATCA AGACGGCGAC
GCGCACGCGG CGTCGCAGGA GGGTTACTAC AGGTTCCCGG TGATCCGTGG TAACGAGTTG
TTCTTCGTGT GCGAAGACGA CGTGTACGCG ACCACGATCT CGGGGCTCGA TAAACGCGAG
AGCGGCGCGG AGACGACGCC GCCGCGACGG CTGACGCAGG CGCACGGCGC GGTGCAGCGG
TTAGTCGTCT CTCCCGATGG ATCACGCGTC GCGTTCGCGT GCGCAGAGGA TGGATACACC
GAAATATACG TCGTGGACGC GCGTGGAGGT CCTATGAAAC AGTTGACACA CATGGGGGCT
TCGTACGCGC GGGCGTGCTG CTTTTCAGAG GATGGTCGAC GGGTGTACTT TACGTCGAGC
GGGGCGACGG CGGAACCGAA TGGAGACGAG CTTTGGGTGG TGGACTGCGA TGGTGGAGCG
CCAATGAGAA TGAACCTCGG TCCAGTGCAT GATTTCGACG TGCGCAACGT GAACGGGAAA
GAATTGGTTG TCCTCGGTCG TAATACTGAA GACACGGCGA CGAAACACTG GGACGGATAC
GCGGGCGGCG CCGGTGGAGA GATTTGGTAT GGGACCTTAG ATAATTTGCT GCGACTTGAC
TTGCGTCTTC CAAACGAGCG TTTGTTGCGA AATGTTGGCA ACGTGTCGTG GTTTGACGAC
GAGCACGTCG CGTTCACGGA AGCGGGCGGG CGTTCGACAG CGTTCAAGGC TAAAATTGAT
ACCAATGAAT TCAACACTCG CATCGTAGAT TGTCAAGGAT TCAGTGACTC CACAGCAGAC
AACGAAGACA TTAAGTACTT TCCCGTGCGT CATCTGAGCA TTGATGTCAG TAGGCAAAGA
ATCGTGTACA CCAGAGGCGG GGATGTATTC GTGGCAGAAG TTGTAGGTGG CGAACATAAG
TCACCTATAA GGTTGCCTAT TGAGTGGCGT GGACCTCGTA CACAGCTTGC AAAACGATTT
GTTCACGCGG ATGATTGGAT CGAAGACTGG GATTTACACC CAGAAGGCTT GACGATGATG
GTTCTTGTTC GCGGCCAACC GTTCACGATG GGCATCTGGG ATGGACCGGT GTTGAGTTAT
CCGCCAGCGA CGAAACTGCG ATCACCCCAA AGCTCAGTGA TAGCACCACT GGCGTCTCTT
GCACAAAAGA GTCAAGCGCG TGTCCGGCAT GGGGCTTACT TGTACGACGG TGAACGTCTC
GTTTTTGTAT CGGACGCCAG TGGTGAGGAG GACATTGAAG TGCATTGGGA AGAAGCTGAG
CGACCGGCAA AACGATTAGG ATTGCATCAC GAGTTGCTTG GAAGAGTAGA ACGCCTGATA
CCTAGTCCAG AGGCGCCATT GGTGGCGATC GTGAACCACA GAAACTCACT GTTGATAGTC
AACGTTGAAA CTGGGGAAAT GCGAACGGCT GACACGTCGA GTGAAGTGGA CGGAATCGAC
GATTTGACGT GGAGTCCTTG CGGGAATTGG CTCGCTTACA CGTACTACTT AAACAACGAA
AGATCCTGTA TTCGCATTCT CGATGTTCGG AACGGAAAGG TGTTCGATGC CACAAACCCT
GTGCTCGGAG ATCACTCACC TGCGTGGGAT CCCGACGGCA AATATCTGTA CTTTTTAGGA
TCTCGAGAAC TCGAACCGGT GTACGATGCC GCCACGTTTG GCTTAAATTT TCCAACTGTT
GAACGCCCTC ATTTGATCAT ACTGCAAAAG GATTTACGTA ATCCACTTCT CAAGGAGCTC
AGACCGCCGT ACGATACGGA ATCGTCGTCT GGCTCTGACT ATGACTCTGA AACAGATAGT
GATGTGGGAA GAAAGATGGA TAAACGAGGC AATGGAAGGG ACTACATGCC TCGCAAGAAG
CCAGTTGTGG ATAAGGATGA TGATGGCAGC GACTGGTCAA CTGTGGATGG AGACAGTGAT
GACCAAATCT TGTCGAGCGG CGACGAGGAT GACGATTACG AATCAGACGC GTCATATTAC
CATGAAGACG CGCCCCCTGC GATTGAGATT CACATCGATG GCTTGACAGA AAGGGTAGTT
GCGCTCCCAA TGCCGATATC ACGTTATGAT TGCCTTTGCG GTCTTGAAGA CGGACGATTT
ATGGTTGTTG AATACCCTCC GAGTCGCGGC TCTCCCGGAA GCGTTGGTTT GGACTACTCG
TCCGATGAAG ATGATGACGG TTTGGGTTCC CTTATTTCTT ACAGTATCCG AGATTTGCGT
CGAAGTATCT TAATACACGG TGGCGTGAGC GAAGTGTCAC TGTCGATGGA TCGCAAATGC
ATGGTTGTGG AAAAAGAGTC GGATGGATTC CTAGAGCTTC GCGTTTACAA AGCTGGCGTG
CGGCCGGAGG AAGAAGGGAG CGACAGCGAA GAATTGGATC AAATGCGGTG CGATCGTAGG
ACTGGCTTGG TGAATCTAGA TGGTCGCATT CGTGTACTAG TTGATCCTGC GAGAGAGTGG
GCACAGATGC TCGGGGAAGT GTATCGTCGT CTTCGCGACG ACCTCTGGAC TGAGAAAATT
TGGAATGAAA CGATTGGCGA TGACTGGGAA GTCATGTTTG AGGAGTATGT GAAAGTGCTT
CCAAAAGTGA GTACGCGGAC CGAGTTTGGG GATTTATTGC GTGAAATTGC CGCTGGCGTT
TGTTACTCGC ACGTGGCTAT CACGTCTGGT GATCCTGGAC GCTCCCATCG CAGACACTCT
GCTGGGTATC TCGGCGCTGA CTTCACTTGG GACGGCAAAG TCGGCGGTTA TCGCATCTTG
AACATCGTGA AGGGTGACAT ATGGGACGAC ATGCGAGGTG GTGTGCTCAG CAAACCGGGC
GTCAATATCC ACGAGGGTGA CATACTTCTC TCGATCGATA GAGTACCGCT CACCGAAGAT
GTTCCGCCGG CTGCGTTGTT GATTGAGAAG GGTGGCGTTG AAGTTCTGTT GACGGTCAAA
ATTGATAGCG ACGGCAAAGG TGGTATTGAC GAAGCGCTCG ACAGACTCAT GCTGAAAAAA
CAAAAGAACA AGAAAAAGGA CAAGAGAGAT GACAACGCAC CGAAGAAAGG TGATGTCATC
CCGGTGCGTG TCCGAGCCAT GCACTCTGAA ATCGATGCGA GGTATCGCGA TATGATTCAG
AAGCGCACGG AGCGAGTCCA CAGCCTGAGC GATGGCGTCG TCGGCTACTT GCACATTCCA
GACATGGAAA GCACAGGGTA CTCCGAATTT TGGCGTCACT ATGCGTCTGA AGTTCGCAAG
GGAAGCTTGA TTCTCGACTT GCGAGGTAAC ACAGGCGGAC ACATTAGTGA ATTGTTGCTC
GCTAAGCTCT CGCAGCGCGC ATTGGCTTGG GACATCCCGC GACGCGGCGA GGTGCAAGTT
TATCCATCCA ACACGCCTGG CCCGCTCGTG ATGCTGGTCG ATCAACGCAC AGGCTCTGAT
GCTGAGCTCA TGGCGGAATC TTTCAGAAAA CTAGGTTTAG GACGAGTCGT TGGGATGCGC
ACTTGGGGTG GTTTGCTCGC CATCAACGGC GTCGCCGAAC TCATCGATGG GTCCGAGTTG
AGTTTGCCTT CACAAAATGT GCTCCTTGTC GACGAGGCGA AGGGCGTAGA CGCGAGATCC
GACGCGACAC AAGCGTACAC GAACGCGGTG GAAAACCGCG GCGTCATTCC GGACGTCACA
GTTGACATAT CTCCCGCTGA ATATTCTCGC CGCGAGGACC CGCAGCTCGA CACCGCCGTG
CGCGAGGCGT TGCAGTTACT CAAAGACACC GGCGCCGCCG GCGTCGCGAC CTACCTTCGC
AAGATCCGCG AGGACGAGAC AACGGCCGCT GAATTAGAAC GCAAACTGAC GCGCAAACCT
TGGTCGTTTT CCACGTGGGC GCCGCTTCCG CCGACCAAGG AGGAAGAAGA GAAGCAACTT
CGAGCCAAGC GCCGCGCCGG GCGAAATAAT ATCCCGCGCC CGTGA
 
Protein sequence
MHHNRHRARS GRSTRSTTTN RARTRTTVRS AARAHYQDGD AHAASQEGYY RFPVIRGNEL 
FFVCEDDVYA TTISGLDKRE SGAETTPPRR LTQAHGAVQR LVVSPDGSRV AFACAEDGYT
EIYVVDARGG PMKQLTHMGA SYARACCFSE DGRRVYFTSS GATAEPNGDE LWVVDCDGGA
PMRMNLGPVH DFDVRNVNGK ELVVLGRNTE DTATKHWDGY AGGAGGEIWY GTLDNLLRLD
LRLPNERLLR NVGNVSWFDD EHVAFTEAGG RSTAFKAKID TNEFNTRIVD CQGFSDSTAD
NEDIKYFPVR HLSIDVSRQR IVYTRGGDVF VAEVVGGEHK SPIRLPIEWR GPRTQLAKRF
VHADDWIEDW DLHPEGLTMM VLVRGQPFTM GIWDGPVLSY PPATKLRSPQ SSVIAPLASL
AQKSQARVRH GAYLYDGERL VFVSDASGEE DIEVHWEEAE RPAKRLGLHH ELLGRVERLI
PSPEAPLVAI VNHRNSLLIV NVETGEMRTA DTSSEVDGID DLTWSPCGNW LAYTYYLNNE
RSCIRILDVR NGKVFDATNP VLGDHSPAWD PDGKYLYFLG SRELEPVYDA ATFGLNFPTV
ERPHLIILQK DLRNPLLKEL RPPYDTESSS GSDYDSETDS DVGRKMDKRG NGRDYMPRKK
PVVDKDDDGS DWSTVDGDSD DQILSSGDED DDYESDASYY HEDAPPAIEI HIDGLTERVV
ALPMPISRYD CLCGLEDGRF MVVEYPPSRG SPGSVGLDYS SDEDDDGLGS LISYSIRDLR
RSILIHGGVS EVSLSMDRKC MVVEKESDGF LELRVYKAGV RPEEEGSDSE ELDQMRCDRR
TGLVNLDGRI RVLVDPAREW AQMLGEVYRR LRDDLWTEKI WNETIGDDWE VMFEEYVKVL
PKVSTRTEFG DLLREIAAGV CYSHVAITSG DPGRSHRRHS AGYLGADFTW DGKVGGYRIL
NIVKGDIWDD MRGGVLSKPG VNIHEGDILL SIDRVPLTED VPPAALLIEK GGVEVLLTVK
IDSDGKGGID EALDRLMLKK QKNKKKDKRD DNAPKKGDVI PVRVRAMHSE IDARYRDMIQ
KRTERVHSLS DGVVGYLHIP DMESTGYSEF WRHYASEVRK GSLILDLRGN TGGHISELLL
AKLSQRALAW DIPRRGEVQV YPSNTPGPLV MLVDQRTGSD AELMAESFRK LGLGRVVGMR
TWGGLLAING VAELIDGSEL SLPSQNVLLV DEAKGVDARS DATQAYTNAV ENRGVIPDVT
VDISPAEYSR REDPQLDTAV REALQLLKDT GAAGVATYLR KIREDETTAA ELERKLTRKP
WSFSTWAPLP PTKEEEEKQL RAKRRAGRNN IPRP