Gene OSTLU_31833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31833 
Symbol 
ID5001773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp608427 
End bp611828 
Gene Length3402 bp 
Protein Length1133 aa 
Translation table 
GC content55% 
IMG OID640417194 
Productpredicted protein 
Protein accessionXP_001417820 
Protein GI145346696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0358806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.410614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGATGGA GGACGAGCGG AAGGTTATGG ACGGCGGTGG TGTGGACGCT GGCGACGATT 
GCGCGAGGGA GGACGGCGAC GGCGCAGTAT CAGACGACGG CGACGACGCT GACGTTGGAG
CACGTGTGGC CGGATACTGG AGCGGTGTAC GCGTCGACGG TGGTGTCGTT GTATGGGAGT
GGATACGCGA ACGCGTTGCC GCCGATGGGG TGTCGGTTCG GGGAGATTGT CGCGAATAAG
GACTCGGCCG CGAGCACGAC TTCGAAAGTC GTGTGCGCGA CGCCGACCAA CGTCTTCGCG
GGTTTCGTCG CCGTTGGGTT GGCGCAGGCG ACGGGAAAGA GATACGTTCC CGGGTCAGAT
GATTTAGTCG TGGACAACGG GCAGCATTCT TTCGAATTCG TCGTGCCTTG GAAACTATCC
AAAGTGAATC CAGAGTACGC GTACAAGAGC GGCGGCGAAG TTCTGCGACT TTCAGGGACA
CATTTTCGTC CAGGAATGCT GTGCACGTTC AAAGACTCTT CGCTGACATC CGAGTACAGA
TTCATTTCTT CTGCGTTAGC CATGTGTGAG ACCCGTGCGT CGAGCGAAGC GGAAGGAACG
GTGGATTTGA ATCTCACTCC AACTCACGCC GTCGGAGGCG GCGGCGCGAG CGTCGAGTAC
CAAACCGCGC CCATCATAGA TGGGCCTTTA GTGTCCACCA CCGCAGTTGG TGGTGACGTA
GTGATACAAG CATCAAGTAG TACTCCGCTG AGCGGCGCCA TAGCATCGTT CACGGCGAGT
CCGATTCGTA TCGGATGCTG GTTTGACGGC ATTTGGGTGG CGGCGACTCT GCGCAGCGAA
AGAGAGTTGG TGTGCAAAGC GCCTTTGCAG TCATTCGGCA CTCCATCGCT GAGTGTCGTG
GATATGTATG CTCAAAGAAT GTTTCCGACG AATGCAACTC AGACTGGTTG GTTCACCTCA
TTCACCGTGA GCAAAGACGA AGTTGTCGAC GTCGTGTTGC CTTCGGTTGG CAATGCGATG
CGAAATACGG TGGCCGATTC TTCGACTCTG GTCGACTTTT TCGGCCGCAA CCTTATTGCT
GGCTCGGGTT CTGTTGGTGC ACGCATTTGC CAAGCCTTGA ACCCGGTTAC ATCTCCCGTT
TTAGTGACGA ATCAGGTCCA AAGCAAATGT GATTTTCCGG CGACACCATC CGAGACAGCG
GCGCTGTCCA CGCTCAGATA CGGCTTTCAC GCTGTGAGTG CAGGAGCCGG TGCAAGTGCC
TCAGCGCAAT TTCTGATTGT CTCGCCACCA CAAATTACGT CGGTGGTACC GGGATTTCTT
CGTGCGGGTA CTGTCGCGAC ATTTTCGGGA CAAAATCTCA TGGACCCGTT TCGGCAAACA
TGGTGTGGGC ACGACGGTAT AGCTCTCGTC GCGCACGCGG TGTCGAGCGC GCTCATCCGA
TGCTCAGTCC CGTACCATCA TCAGCTGCCT TCGGGTTCCT CTAGTTCTGC GCACGATCTA
ACCATAGACG TGCTTTCTGA TTTATCGTCG CCGGGGGCCG GAGGAACCAC CATGGGCTGG
CTACCAGTAG CGAATGATTT AGAGAGCATC ACGCCAAACG TCGGAGCGAC GAGCGGTGGC
ACTCGAACGG TGCTGAAACT CACGGGTGGA ACTATCCCAA CTTCATCTTA TTACACACCC
ACGTGCAGAT TTGGCACGAT AGTAACGTCG GCGATACACG TCCCTGGAGG TGGAGTTGCG
TGTAGCTCTC CTGCATACGC CGCGCGTAAC GTGACGGTGG GCATTGACGT CGAGTCGACG
ATAGAATTTC AGTACGTCGC ACAGATTGGT GTGAGTGCCA TAGTACCAGG AGCTCTTCCA
CAGAATGGGG GCTCGATATC TGTGTACTTG GATGCGGCAC TTTCGTCTTC GTATTCAGCT
GACTGCGTTT TCGTCACCAG CGGCGGGGAT AAATTAAAGT CTTCTCTCGC CGATTCCTCG
GGAACGCTCA AGTGCGCCTC GCCGCAGACT GGTATTGGCT TTGCAACGAT GGCGATCGTC
GTATCTGCGG CTAGCACGAA CAACACTGCG TTTATAGACG AAGCAGCTGG GCAGTACTTC
GACCTTGAAG TTCAAACCAG AACGCCTGGG GTGGAAGTTT TCCTCCCAAC GGGCGCGAAT
TGGGTACAGG CGGAGGAAGT CATACACGCT GTGACGTCTG ATGGATCTCT TCTCGGCGAC
AATTCGGGAG ACGACGATTT CTGGTGTGTG TTCGGAAAAG CTGGAATATA TGGAGGGTCA
ATATATTTGA GTGCGGCGAG TAGAGTATCG GGAACTATCC TGAAGTGCAC CGTACCAGAC
CTCGATACGC TAATAGTTTC TGGGCAGAGT GGATTTGAGA TTGTCATCGG TATTTGTTCT
TCGACTGAAG TTTCGGCGAG CAGTTGCACC AGTTACGCGG GAACACGCGT GAGATACGAG
AAAAAGCTGG GAGTTTCTTC AGTACTTCCA GTGAACGGTA CGCAAGCAGG AGGCGATGTT
GCAGTTCTCG TTGACTCTGC ATCCGTGAAA GGTTTCGGCG CAAACGTGCC AAGCTGCAGA
TTTGGCACAA TCTATCCCGT TGCGGGAACT TCGGTAACTG GAACGGGCGA GATAAATTGC
GTGACACCGG CGCATGCTGC GGGTGTCGTT CCTGTGAGTG TTCCTCCGCT CGATTTGGGG
TTGACGGCTT TGACATTTGA GTATATTTCC GTGTCCACTT CGCAATCGGT CTTGTCCACG
ACGTACGGCG CCGATCCGTA CGTCGTTGCA TACCTGATTG AACCGACACC AATGATCACA
GAGGTCGTGC CTTGGGTTGG TTGGAGTGGA AGTGTTGTCA CCTTGCTAGG CACAAACTTT
CCAACTGGAT CCGCCGTCAA GTGCAGATTC GGATCTGTCT CCGTCGATGC TCAGGTTGTT
TCTACGGCTG TGATACAGTG CGGCGATACT TCGCCGATTA CCTCTGACGA CGTCGAAGAG
CAGCGCGTCG CTGTCACGAC GAGTTCGGGC GATTCAAATC CGAATGTGAC GACGTTAGCA
CACTATGTCA TTACTCAAGG AGATATCAGC GCCATTGATG CCGCTGACGG TTGGCAACAG
GGTGGAAACG TAGTCGGCGT CACCGTTGCC AAGTGGGTCC CTGAAGGCTA CACATCCTGT
CGCTTTGGCA CGATAACCGT TCAGAGTAGA GGCGGAGACG GCTTCGGTGC GATTGGCAAG
GCGTCGGTAT CGCAGTCATC GCAGTGGTGG TCAGATTCGA CTGATGGCAA GAAAATAGAG
TGCGTATCTC CAGCAGGAGC GCAAGGAAAC GTAAATTTGG GAGTATCCAT CCTCGGAAGC
ACTGCATCGT CTTTCATTGG TACTACGTTT ACCTACATTT AG
 
Protein sequence
MRWRTSGRLW TAVVWTLATI ARGRTATAQY QTTATTLTLE HVWPDTGAVY ASTVVSLYGS 
GYANALPPMG CRFGEIVANK DSAASTTSKV VCATPTNVFA GFVAVGLAQA TGKRYVPGSD
DLVVDNGQHS FEFVVPWKLS KVNPEYAYKS GGEVLRLSGT HFRPGMLCTF KDSSLTSEYR
FISSALAMCE TRASSEAEGT VDLNLTPTHA VGGGGASVEY QTAPIIDGPL VSTTAVGGDV
VIQASSSTPL SGAIASFTAS PIRIGCWFDG IWVAATLRSE RELVCKAPLQ SFGTPSLSVV
DMYAQRMFPT NATQTGWFTS FTVSKDEVVD VVLPSVGNAM RNTVADSSTL VDFFGRNLIA
GSGSVGARIC QALNPVTSPV LVTNQVQSKC DFPATPSETA ALSTLRYGFH AVSAGAGASA
SAQFLIVSPP QITSVVPGFL RAGTVATFSG QNLMDPFRQT WCGHDGIALV AHAVSSALIR
CSVPYHHQLP SGSSSSAHDL TIDVLSDLSS PGAGGTTMGW LPVANDLESI TPNVGATSGG
TRTVLKLTGG TIPTSSYYTP TCRFGTIVTS AIHVPGGGVA CSSPAYAARN VTVGIDVEST
IEFQYVAQIG VSAIVPGALP QNGGSISVYL DAALSSSYSA DCVFVTSGGD KLKSSLADSS
GTLKCASPQT GIGFATMAIV VSAASTNNTA FIDEAAGQYF DLEVQTRTPG VEVFLPTGAN
WVQAEEVIHA VTSDGSLLGD NSGDDDFWCV FGKAGIYGGS IYLSAASRVS GTILKCTVPD
LDTLIVSGQS GFEIVIGICS STEVSASSCT SYAGTRVRYE KKLGVSSVLP VNGTQAGGDV
AVLVDSASVK GFGANVPSCR FGTIYPVAGT SVTGTGEINC VTPAHAAGVV PVSVPPLDLG
LTALTFEYIS VSTSQSVLST TYGADPYVVA YLIEPTPMIT EVVPWVGWSG SVVTLLGTNF
PTGSAVKCRF GSVSVDAQVV STAVIQCGDT SPITSDDVEE QRVAVTTSSG DSNPNVTTLA
HYVITQGDIS AIDAADGWQQ GGNVVGVTVA KWVPEGYTSC RFGTITVQSR GGDGFGAIGK
ASVSQSSQWW SDSTDGKKIE CVSPAGAQGN VNLGVSILGS TASSFIGTTF TYI