Gene OSTLU_33832 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33832 
Symbol 
ID5000870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp461962 
End bp465132 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table 
GC content56% 
IMG OID640416291 
Productpredicted protein 
Protein accessionXP_001416669 
Protein GI145344290 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.828638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGGG CGACGATCGA TCCGGAGAAG GCGGCGAAGA AGGCGGCGAA GCTGGCGGAG 
AAGGAGGCGA AGAGGGCGAA GGCGGCGGCG AAGGCGGCGA AGGCGGCGGA GGAAAAGGCG
AAGAGGGACG CGGGCGGCGG CGGCGAGGGC AAGGCGAAGA GGGAGAAGAA GGCGAGCGGG
CCGAGCGAGG AGGACGTGCG AGCGCTGGAG AGCGCGCTGA AAACGCCGAG GGGGGAGATG
AAGGATTTGG TGGGGACGCC GATGGCGAAG TCGTATAATC CGGTGGCGGT GGAGGCGGCG
TGGTACGACT GGTGGGAACG GTGCGGGATG TTTACGCCGA CGATGGGGAC GAATAAGCCA
AAGTTTGTCA TCGTGATTCC GCCGCCGAAC GTCACGGGTG CGTTGCACAT TGGACACGCG
CTGACGAACG CTATTCAAGA CACCATCGTG CGGTGGCGAC GAATGCAGGG ATATGAAGCG
CTGTGGGTTC CGGGGACGGA TCACGCGGGC ATCGCGACGC AAACCGTGGT GGAGAAGAAA
TTGCAGCGCG AGGAAGGCGT CACGAGGCAC GATCTCGGGC GAGAAAAGTT CCTCGAGCGC
GTGTTCGAGT GGAAAGACGT GTATGGGGGC AAGATCTGCA ATCAGCTTCG CCGAATCGGC
TCTTCTATGG ACTGGACTCG AGAGGCGTTC ACGATGGATG AAAAGTTGAG CAAAGCCGTC
AAGGAAGCCT TCGTGCGCAT GTTCGACGAA GGTTTGATTT ATCGCGACAA TAGATTGGTG
AACTGGAGTT GTCAGTTGAA GACTGCGATC AGTGATATCG AGGTGGATTA CATCGAGCTC
GATGGGCCGA CGATGCTCGC GGTGCCGGGG CACACCAAAA AGGTGGAGTT CGGCGTCATC
ACATCCTTTG CGTACCCGTT CGAAGACGGG CAAGGCGAAG TTGTCGTGGC GACGACGCGT
ATTGAGACCA TGCTTGGTGA CACCGCTGTT GCGGTGCACC CCGAAGACGA ACGCTACAAG
AGCTTACACG GAAAATTCGT TCTTCACCCG TTCAACGGTC GTCGTATTCC CATCATTTGC
GACGCCGAAC TTGTCGACAT GGAATTCGGT ACAGGTTGCG TGAAGATCAC CCCAGCCCAC
GATCCGAACG ATTTCAACAC CGGTAAGCGT CACGATCTTG AGTTTATCAA CGTATTCACC
GAGGAAGGAT TGGTGAACGA GCAAGGCGGC GAGCAATTCC AGGGCATGAA GCGCTTTGAG
TGCCGCGTCG CCATCACGGA GGCGCTCGAC AAGCTTGGTC TTTACCGGGG CAAGGCGAAC
AACCCGATGC GTTTGGGTCT TTGTAGCCGT AGTAAGGATG TCATCGAACC GATGCTCAAG
CCGCAGTGGT GGGTCAATTG CCAAGACATG GCAAAGGAGG CGTGCGATGC CGCTCGCGAC
AAGCGACTTG AAATTCTCCC TACTTTCATG GAACCGACGT GGTTCAGATG GTTGGAGAAC
ATTCGTGACT GGTGCATCTC CCGTCAACTT TGGTGGGGTC ATCGTATTCC AGCCTTCTAC
GTTCGTTTCG CCGGCGAAGG TGACGACGAC TGTGGGATGC CAGGCGGTAG CTCTGAGAAG
ATGGATCGAT GGGTCATCGG TCGTGACGCC GATGAAGCGC GCGTTTCGGC CGAGAAGAAG
TTCCCGGGTC GAGAATTCAC ACTCGAACAA GACGAAGACG TGCTCGACAC TTGGTTCTCG
TCCGGTTTGT TTCCGTTTTC CGTCTTTGGC TGGCCCGACG AGACACCGGA TTTGGCCGAG
TTTTACCCAA CGTCCCTCCT CGAAACTGGA CACGACATTT TGTTCTTCTG GGTCGCTCGT
ATGGTAATGA TGGGCATGAA ATTGACTGGT AAGGTGCCGT TCAAACAAGT GTACCTGCAC
GCCATGGTTC GTGACGCGCA TGGTCGTAAA ATGTCAAAGT CTCTCGGTAA CGTCATCGAT
CCTCTCCATG TCATTGAAGG TATCGACTTG GCCGCCCTCA ACGAGACTCT CGTCGGCGGT
AACCTCGACG AGAAGGAACG CAAAAAGGCG CAAGCGGGTC AAAAGGCTGA CTTCCCCGAT
GGCATTCCAG AATGTGGGAC CGATGCTATG CGCTTCGCGC TCGTGTCTTA CACCGCTCAA
GGTCGTGACA TCAACCTGGA CGTTCTCCGT GTCGTTTCGT ACCGCCACTG GTGTAATAAG
CTTTGGAACG CGACTAAGTT TGCGATGATG AATTTGGGTG ACGAATACGT GCCGCCGGCC
GATTTCAACG CTACTTTCGA TGTACAAAGT CTTCCGCTCG CGGCCAAGTG GGTTCTCAGC
CGCTTGAATG CAACATGCGC GTCGACAAAC GCGGGTATGG AAGTGTACGA TTTTAACACC
GCCACGAACT CTGTGTACGC GTTCTGGCAA TACGAACTCT GTGACGTTTA CATCGAGATC
ATCAAGCCTG TTATGAGTGG ATCGGATGAA ATGGCGAAGA AACAACTGCG AGATGCGCTT
TGGATTTGCC TCGACGCTGG TTTGAGACTT CTCCACCCGT TCATGCCCTT CGTCACCGAA
GAGCTCTGGC AGCGACTTCC GCGCACTCGC AATGAGAACA CACCGAAGAG TATTATGATT
GCTGAGTACC CAGTCGCGGT CGATTCTTGG GCCAACGCCA CGGCTGAGTC GCAGATGCAA
ATCATAATGG ACTCTGTTAA GGCGTTCCGA TCGTTGAAGT CCAACTACAA CCTTCCTCCC
AGAGCGAGAC CGGATGTTTT CTACAGTGTC AAGACGGATG ACAGCGAAGC GGCGATGAAG
ATTGATCCAG AAGGTTTGGT CTCTCTCGCG GGCGTCGGTG AGTTGAAACA GCTGGCTCAA
GGTGAATCTG CGCCGCCGGG TTGCGCGGTG AGCATCGTGA ACGAGTCCGT GACGGTCTAT
GTTCTCTTGA AGGGAATCGT CGACGCCGCG ACGGAAATTG CCAAGCTCGA TAAGAAGCTT
GATTTGCTCA CCAAATCGAC AGAAGCGCTC GTCAAAAAGA CGCAAGAAGA TGGCTACGAG
ACCAAGGTTC CGGAGAAGGT TCGCAACGAA AACGCCGACA AGATCGCTAA GCAAGCTGAA
GAAATTGATT CCATCAAGGC CGCACGTGCG GACTTTGAAG CGTTGCTGTA G
 
Protein sequence
MERATIDPEK AAKKAAKLAE KEAKRAKAAA KAAKAAEEKA KRDAGGGGEG KAKREKKASG 
PSEEDVRALE SALKTPRGEM KDLVGTPMAK SYNPVAVEAA WYDWWERCGM FTPTMGTNKP
KFVIVIPPPN VTGALHIGHA LTNAIQDTIV RWRRMQGYEA LWVPGTDHAG IATQTVVEKK
LQREEGVTRH DLGREKFLER VFEWKDVYGG KICNQLRRIG SSMDWTREAF TMDEKLSKAV
KEAFVRMFDE GLIYRDNRLV NWSCQLKTAI SDIEVDYIEL DGPTMLAVPG HTKKVEFGVI
TSFAYPFEDG QGEVVVATTR IETMLGDTAV AVHPEDERYK SLHGKFVLHP FNGRRIPIIC
DAELVDMEFG TGCVKITPAH DPNDFNTGKR HDLEFINVFT EEGLVNEQGG EQFQGMKRFE
CRVAITEALD KLGLYRGKAN NPMRLGLCSR SKDVIEPMLK PQWWVNCQDM AKEACDAARD
KRLEILPTFM EPTWFRWLEN IRDWCISRQL WWGHRIPAFY VRFAGEGDDD CGMPGGSSEK
MDRWVIGRDA DEARVSAEKK FPGREFTLEQ DEDVLDTWFS SGLFPFSVFG WPDETPDLAE
FYPTSLLETG HDILFFWVAR MVMMGMKLTG KVPFKQVYLH AMVRDAHGRK MSKSLGNVID
PLHVIEGIDL AALNETLVGG NLDEKERKKA QAGQKADFPD GIPECGTDAM RFALVSYTAQ
GRDINLDVLR VVSYRHWCNK LWNATKFAMM NLGDEYVPPA DFNATFDVQS LPLAAKWVLS
RLNATCASTN AGMEVYDFNT ATNSVYAFWQ YELCDVYIEI IKPVMSGSDE MAKKQLRDAL
WICLDAGLRL LHPFMPFVTE ELWQRLPRTR NENTPKSIMI AEYPVAVDSW ANATAESQMQ
IIMDSVKAFR SLKSNYNLPP RARPDVFYSV KTDDSEAAMK IDPEGLVSLA GVGELKQLAQ
GESAPPGCAV SIVNESVTVY VLLKGIVDAA TEIAKLDKKL DLLTKSTEAL VKKTQEDGYE
TKVPEKVRNE NADKIAKQAE EIDSIKAARA DFEALL