Gene OSTLU_42662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42662 
Symbol 
ID5003137 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp615767 
End bp618580 
Gene Length2814 bp 
Protein Length924 aa 
Translation table 
GC content57% 
IMG OID640418558 
Productpredicted protein 
Protein accessionXP_001419436 
Protein GI145350050 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC CCGGAAAACG ACACGGAGAC CTGCCGAAGA ACTTTGAGCA CCAAAACGTG 
GAGGAAAACC TGTACGAGTG GTGGGAGTCG CGCGGATACT TTGCCCCGAA CGATGAGACG
GCCACGGGAC CGCCGTTCGT GATCCCGATG CCGCCACCGA ATGTCACGGG AGCGCTGCAC
ATGGGACACG CGATGTTCGT GACGCTGCAG GACGTGATGA CTCGAAGCGC GCGAATGCGT
GGTCGCAAGA CGCTGTGGCT GCCGGGGACG GACCACGCGG GCATCGCCAC GCAATTGGTC
GTGGAGAGAA AGTTAGAGAG CGAGGGAGTG AAACGAACGG ATATGACGCG CGATGAGTTC
GTGGAGCGAG TGTGGGAGTG GAAGGCCGAG TATGGGGGAC GGATTCAGCA GCAGATTAAG
AGGTTGGGGG CGTCTTGCGA TTGGTCTCGC GAGAGATTTA CGCTCGACGA GGGGTTATCT
GAGAGCGTGC TCGAGGCATT CATCACGCTG CACGATCGCG GTCTCATTTA TAAAGGAACC
TACATGGTGA ACTGGGCGCC GAAGTTGCAA ACCGCGGTGA GTGACTTGGA GGTCGAGTAC
ACCGAAGAGC CGGGGACGTT GTACTTTTTC AAGTACCCGG TCGAGGGCGG CGGCGCGGAC
GACTACCTTC CGGTGGCAAC GACGCGTCCG GAAACGATTC TTGGTGACAC TGCAGTCGCG
GTCAACCCAG AAGACGATCG CTTCAAGCAC ATGATCGGGA AGAGATGCGT CGTGCCCTTC
ACCAACGGTC GCACCGTGCC CATCATCGGC GACTCGTACG TGGACATGGA ATTCGGCACG
GGGGCGTTGA AGATCACGCC GGGACACGAC CCGAATGATT ACGAAATCGG CAAGCGTGTG
GGCTTGGATT TGATCAATAT TATGAACAAA GATGGTTCCA TGAACTCTAA CTGCGGCAAG
TATGCCGGCA TCGATCGCGC GGATTGCCGA AAGCAACTCT GGGCGGATAT GGAGGCGGAA
GGTTTAGCAA TCAAGGCGGA GCCGTATACG AACCGCGTGC CTCGCTCGCA GCGGGGTGGC
GAAATCATCG AACCAATTGT TTCTGAGCAA TGGTTCTGTA AGATGGAAAC CATGGCGGAG
CCCTCGCTGA AGGCTGTGGA GACTGGCGAA CTCACCATCA TCCCGCAGCG TTTTGAGAAA
ATTTACAAGT CTTGGTTGAC GGATATCCGA GATTGGTGCA TCTCGAGACA GCTGTGGTGG
GGTCATCAAA TCCCAGTGTG GTACGTGCAC GATTCTGAAG AAGATCTCGC GCGTGCGAGA
GAGGGGGAAG GTAAAGGCAC GAATAAGCGA TACGTAGTGG CGCGAAATGA CGCCGACGCC
GCAGAAAAAG CGAAGGCGCA ATACGGCGAA AACGTCGTGT TGTACCGTGA AGAAGACGTG
CTCGACACTT GGTTTTCTTC TGGTCTGTGG CCGTTCAGCA CGTGTGGGTG GCCCAACGAA
GAAGCTCCTG ACATGAAAAA TTTCTTTCCG GCGAGCGTGC TTGAGACTGG GCACGATATT
TTATTCTTTT GGGTGGCGCG TATGATCATG ATGTCGTACG GGATGACGGG GAAATTACCC
TTCCACACTG TCTTTCTCCA CGGCCTCGTC CGCGATTCGC AAGGTCGCAA AATGAGCAAA
TCGCTTGGTA ACGTCGTGGA TCCGTTAGGC GTCATCGCTG AGCAAGGCTG CGACGCTTTA
CGATTCACGC TCGCCACCGG CACCACGCCC GGTCAAGACT TGAACTTGAA CCTCGAACGT
TTGGCGTCGA ACAGAAACTT CACCAACAAG ATTTGGAACG CCGGTAAGTT TGTGCTGTAT
AGTATGGAGG ACATGACAGA CGATGAGCGC ATGGCGCTCG TCGACGAAGG TGCTGCGCTT
TGCGCCGATG AAGCGTCAAT TGCCGAGCTT CCGCTCGCCG AGCGTTGGAT CGTGAGCAAA
TTGAACGCGA CGGTGGATCA CGTCACCGCC GCGCAAGACA AGTACGACTT TGGTGAGGCT
GGTAGATCGA CGTACACGTT CTTCTACGAC TCCTTTGCCG ATTGGTTCAT TGAAGGCGCT
AAAAGTCGCA CGTACGGAGA CGATGCCGAC GCCGCGCGAG TCACCAAAGC GGTGACGCTG
TACGTTTTGG ATCAAACGCT TCGATTGCTC CACCCATTCG TGCCGTACGT GACGGAGGAA
GTTTGGCAAG CGCTTCCGCA TCGCGGCGAG GCGTTGATAG GCCAAGACTG GCCCGCGCTT
GAAGCTTACG TGGACAAATC CGCAGTGTCG ACGTTTGAAA ACTTACAATC GATCGTCACG
CGTATTCGCA ACGCGCGCGC CGAGTACTCG GTGGAGCCGG CGAAGCGTAT TCCCGCCGTC
ATCGTAGCGA CGGATGCCGA AGCGAACGCG GCGTACAGCG CCGAGTTGAA TCTTATCGCC
ACGCTCGCGC GTTTAGACGC CGAGCAAACG TCCGTCGCGA GCGCGCCACC CACCGAAGTC
GCGAGCGCGC CGGAGAATTT CGTTCAAATC ATCGTCAACG AAAGCCTGGA GGTGTACTTG
CCGCTCGCGG GTCTCGCGGA TCCGGTTAAA GAAATCGCCC GCCTCACGAA GCAAGAGACA
AAGATGCAAA AGGAAATCGA TGGGCTGGCT GATCGCGTCA ACTCGCCCGC GTTCGTCGAC
AAAGCCCCCG CGGCGGTCGT AGAAAAAGCG CAGAAGGAGT TGAGCGAGTT GCAAGAGCAA
CTCGCCGTCG TTCGATCTCG CATCGAACAA ATGCAGGCGC TCGTCGCCAA GTGA
 
Protein sequence
MPKPGKRHGD LPKNFEHQNV EENLYEWWES RGYFAPNDET ATGPPFVIPM PPPNVTGALH 
MGHAMFVTLQ DVMTRSARMR GRKTLWLPGT DHAGIATQLV VERKLESEGV KRTDMTRDEF
VERVWEWKAE YGGRIQQQIK RLGASCDWSR ERFTLDEGLS ESVLEAFITL HDRGLIYKGT
YMVNWAPKLQ TAVSDLEVEY TEEPGTLYFF KYPVEGGGAD DYLPVATTRP ETILGDTAVA
VNPEDDRFKH MIGKRCVVPF TNGRTVPIIG DSYVDMEFGT GALKITPGHD PNDYEIGKRV
GLDLINIMNK DGSMNSNCGK YAGIDRADCR KQLWADMEAE GLAIKAEPYT NRVPRSQRGG
EIIEPIVSEQ WFCKMETMAE PSLKAVETGE LTIIPQRFEK IYKSWLTDIR DWCISRQLWW
GHQIPVWYGE GKGTNKRYVV ARNDADAAEK AKAQYGENVV LYREEDVLDT WFSSGLWPFS
TCGWPNEEAP DMKNFFPASV LETGHDILFF WVARMIMMSY GMTGKLPFHT VFLHGLVRDS
QGRKMSKSLG NVVDPLGVIA EQGCDALRFT LATGTTPGQD LNLNLERLAS NRNFTNKIWN
AGKFVLYSME DMTDDERMAL VDEGAALCAD EASIAELPLA ERWIVSKLNA TVDHVTAAQD
KYDFGEAGRS TYTFFYDSFA DWFIEGAKSR TYGDDADAAR VTKAVTLYVL DQTLRLLHPF
VPYVTEEVWQ ALPHRGEALI GQDWPALEAY VDKSAVSTFE NLQSIVTRIR NARAEYSVEP
AKRIPAVIVA TDAEANAAYS AELNLIATLA RLDAEQTSVA SAPPTEVASA PENFVQIIVN
ESLEVYLPLA GLADPVKEIA RLTKQETKMQ KEIDGLADRV NSPAFVDKAP AAVVEKAQKE
LSELQEQLAV VRSRIEQMQA LVAK