Gene OSTLU_18002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18002 
Symbol 
ID5005396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp112339 
End bp113685 
Gene Length1347 bp 
Protein Length448 aa 
Translation table 
GC content56% 
IMG OID640420817 
Productpredicted protein 
Protein accessionXP_001421209 
Protein GI145353842 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.938893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGATCC GGCCGTACGG GTACGCGATA TGGGAGGGGA TTCAAAAGTA TATGGATGCG 
AAGTTTAAGG CGACGGGGGT GCAGAACGCG TACTTTCCGC AGTTGATACC GTATTCGTTC
ATAACGAAGG AGGCGTCGCA CGTGGAGGGT TTCGCGCCGG AGTTGGCGTT GGTGACGCGA
GGGGGAGGGA AGGAGTTGGA GGAGCCGCTG GTGGTGCGAC CGACGAGTGA GACGATTGTG
AATAATATGC TATCGCAGTG GATTCAGAGT TATCGGGATT TGCCGATGCT GTTGAATCAG
TGGTGCAACG TGCACCGGTG GGAGATGCGC ACGCGGCCGT TCATTCGCAC GTTGGAGTTT
CTTTGGCAAG AGGGTCACAC CGCGCACGCG ACGGCGGAGG AAGCGGAGGA GAGAGCTATG
CAGATGATTC GTGTTTACGC CGAATTCGCC CAGACGCAGG CGGCGATGCC GGTGATTCCT
GGGAGAAAGT CGCGCGTCGA ATCCTTCGCC GGGGCCAACG TGACGTACAC CATCGAGGCG
ATGATGGGAG ATAAAAAGGC GCTTCAAGCG GGAACATCGC ACAATTTGGG CGATAACTTC
GCCAAGGCGT TCGACACTAC GTTTTTAGAC GACAAGGGCG AGACGCAGTA CGTGCATCAG
AGCTCTTGGG GGGTCTCTAC GCGCTTGATT GGCGGTATTC TCATGACGCA CGGCGACGAT
TCCGGGTTAA TTTTGCCCCC GCGTTTGGCG CCGATTCAAG TCGTCGTGGT GCCAATTTGG
AAGAAGGACG AAGAGAAGGA AGCGGTCATG GCATCTGTCG ATAGCATCAT TTCTTCTCTC
TCCAACGCGG GCGTTCGAAC CCATCTTGAC GCGGATCAAA GTAAGTCGCC GGGGTGGAAA
TTCAACCAGT ACGAAATGAA GGGCGTGCCG ATTCGCATTG AAGTCGGTCC GAAGGATGTC
GCGAAGGGTG CGTGCGTCGT CGCTCGTCGC GATGTTCCGG GCAAGGAGGG TAAGGAGTTC
GGCGTGAGTA TCGAGCCAGC CGCGCTCGAG ACCAAGGTCA ACGACGTGCT GAATGACATT
CAAAACTCGA TGTTGCAAAA GGCGACCGAG TTTCGCGACG CCAACATCGT CGACGTTAAA
ACTATGGACG AATTAAAGGC GACGATTGAG GCGGGGAAGT GGGCGCGATG CGGCTGGGAA
GGTACCGACG AAGAAGAAAA AGCCATCAAG GAGGAGACCG GGGCAACGAT TCGGTGCTTC
CCGTTCGATC AACCCGCGGG CGAGCACACG TGCTTGATGT CGGGTAAGCC GGCGAAGGAG
GTGTGTATCT TTGCAAAATC GTACTAA
 
Protein sequence
MVIRPYGYAI WEGIQKYMDA KFKATGVQNA YFPQLIPYSF ITKEASHVEG FAPELALVTR 
GGGKELEEPL VVRPTSETIV NNMLSQWIQS YRDLPMLLNQ WCNVHRWEMR TRPFIRTLEF
LWQEGHTAHA TAEEAEERAM QMIRVYAEFA QTQAAMPVIP GRKSRVESFA GANVTYTIEA
MMGDKKALQA GTSHNLGDNF AKAFDTTFLD DKGETQYVHQ SSWGVSTRLI GGILMTHGDD
SGLILPPRLA PIQVVVVPIW KKDEEKEAVM ASVDSIISSL SNAGVRTHLD ADQSKSPGWK
FNQYEMKGVP IRIEVGPKDV AKGACVVARR DVPGKEGKEF GVSIEPAALE TKVNDVLNDI
QNSMLQKATE FRDANIVDVK TMDELKATIE AGKWARCGWE GTDEEEKAIK EETGATIRCF
PFDQPAGEHT CLMSGKPAKE VCIFAKSY