Gene OSTLU_24923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24923 
Symbol 
ID5003079 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp286989 
End bp289233 
Gene Length2245 bp 
Protein Length652 aa 
Translation table 
GC content55% 
IMG OID640418500 
Productpredicted protein 
Protein accessionXP_001419119 
Protein GI145349392 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.321104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.342415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGA AGGAGGTGGC GGAGGAGGTG GCGAGGGAGT TCGGGGAGGA GGTGGCGAAG 
CGGCCGCGGC GAGGCGCGAT CGTCGGGTGC GAGGCGGCGG GGGTGTACGT GAACTTGAAA
ATGAATCGAC CGATGGTGTT CGAGGCGACG ATGAAGGCGG TGGCGAGACG GGGAGGAAAG
TTTGGGCACA CCGATGCGGC GAAGGGGAAG CGGGTGATCA TCGAACACAC GAGCTCGAAC
CCGAACGCGC CGCTGCACAT CGGCAACCTG AGAAACGTCA TGATCGGCGC GCACTTGTCG
AGGATGATGA AGGCGTGCGG ACACGACGTG AAGGAGGCGT TTTACGTCAA TGATTTGGGC
GCGCAAATCG GGTTGACGGC GTTGGCGTAT TCGAGGATTT ATGACAAGAT GACGCCATTC
ATGAAGATTG ACCACTGGAT CGGGGCGATG TACGCGGTGA TGAACACGTG TCAAGAGTTG
CAACAAGTGG GCGTGAAACC GGGCGAGCTC GAGGTGCGTG CGCGATCGAG ATGCGCGTGA
TGTGCGTTCG CGTCGCGTGT ACGGCTATGA TTAATGTGTT TTTTAACTCT CTTTCTTACT
TGAACCGTGG AGAAAAACCA TATTCACCTC TGACAATATG AGCCGTCGAC GCGTCGCGAG
ACGAGGCGAA CCGTTATGCG GACGAGTCGG GCCTTGACGA GAAAAATGTT TTTGACTAAC
GATGAATACG TTTTCGATAT CACCGCAGGA CGCGTGCAAA GCCGGGCAAG AAGCGGTGGA
CGCGCTCTTG AAGACGTCGC TCGCGGCGGT GGCGGGCGAC GAGAAGAAGG AGAAGGGCGT
CGCTGAATAC TTTGATATTT ACCAAGATCT TCGCGGACGA TTCGAAAAGA TGATGGTCGT
CATGCTCGAA GACATTCGCA CGATCGACGA TATCAAGGTT GAGGCCGGTA AGCTTAACTT
GGCGTACGAG AAGCAAGAGC CTTGGGCTAT TAAGATTTTC CGCAAAATGG TGTGCGACTG
TCTCACCGGC GTGCAAGAAA CGCTCTCGAC GTACGGCGTG CAGCACGATC GATTCGACTT
CGAGTCCGAA CTCGGCTGGG AGGGCTCGAA TGCGAAAGTA TTGGAGATCA TGCAAAATAG
CGACTATTAC GTGCCGCAGA CGCAGAGCAA CGAAAAGGGC GTGCCGCAAG GCGCGTACCT
CGACATGGCT GGCTTCATCA CGGATATGGG TTTCAAGGTT GGTAAGGGGG GGTACCAAAA
GGAGTACCCG CCTCTCTACG TCCTCCGTCC CGACGGATCG ACGCTTTACA CCTACCGTGA
CATCGTGTAC TCGTTCAAGA AGGCTTCGAT GAGCGATTTG ATTTTGAACA TTATTTGCAG
CGAGCAAGAT TTGGCGCAGC AAAAAGTGTC GTTGGCCATG GCGATGATGA ACCCGGCAAT
GGAGGGTCGC CAATACCACT TATCATACGA CCTCGTAAAG CTCACCACGG GGAAGATGAG
TGGTCGCCGC GGTCGCTACT TGTTGGCCGA TGACTTGTAC GAAGACCTCA AGACGGTCAT
TCGTGAAAAG ATGGACAAGA AGTACAAAGA GAAGGGCGAG GTCATCTCCG CGGAGATGTT
CGACACGGTG ACGCACGAAG TATCTACGGC AGCGATGAAA TACGCGCTTT TATCAGTGAG
TTGCATGACG CAAATTAACT TTGACATCGC AAAGATTACC GATTTCGAAG ACGCTTCGGC
GCCGTTCATC CTGTACAACT CCACCCGCCT CACCTCTGTC ATTCGCAAGT TTGACGAACG
TTCCGCCGCC GGTGTGTTGG AAAAGCTCTG CCCTCTCGAT GAAGTAGACT TCACCAAGCT
TGACGACGAT CGCGAATGGG CCTTGCTCTT AGACTTTGTC CTGCCCTTCG CCAGTATGAT
CACCGACGCG GCGATGCCGA CGTTACCCAA GCCCCCGGCT TTACCCTCGT ACGGCATGCA
CAGGGTGTGC GATTTCCTCA ACATGTTCGT TCGCGCCCTC TCGGGTTACT ACGGCCCCGC
GGGCGTTCGC ATGATGCCCG TTCAAAGCCA ACTCGACGCC GGCTGGAACG AGACACCCTC
GATGCACGCT CGCATCCACG CGTGCAAGTG CTTCAAGCAG GTCATCGACA ACGGTTTGCG
TTTGCTCATG ATCGAGCCAC TCGAGCGCAT GTGATTCCAT CGCACCTACT CATTCGTTGT
CATTCAATCA ATCACAATCG ATCAT
 
Protein sequence
MNPKEVAEEV AREFGEEVAK RPRRGAIVGC EAAGVYVNLK MNRPMVFEAT MKAVARRGGK 
FGHTDAAKGK RVIIEHTSSN PNAPLHIGNL RNVMIGAHLS RMMKACGHDV KEAFYVNDLG
AQIGLTALAY SRIYDKMTPF MKIDHWIGAM YAVMNTCQEL QQVGVKPGEL EDACKAGQEA
VDALLKTSLA AVAGDEKKEK GVAEYFDIYQ DLRGRFEKMM VVMLEDIRTI DDIKVEAGKL
NLAYEKQEPW AIKIFRKMVC DCLTGVQETL STYGVQHDRF DFESELGWEG SNAKVLEIMQ
NSDYYVPQTQ SNEKGVPQGA YLDMAGFITD MGFKVGKGGY QKEYPPLYVL RPDGSTLYTY
RDIVYSFKKA SMSDLILNII CSEQDLAQQK VSLAMAMMNP AMEGRQYHLS YDLVKLTTGK
MSGRRGRYLL ADDLYEDLKT VIREKMDKKY KEKGEVISAE MFDTVTHEVS TAAMKYALLS
VSCMTQINFD IAKITDFEDA SAPFILYNST RLTSVIRKFD ERSAAGVLEK LCPLDEVDFT
KLDDDREWAL LLDFVLPFAS MITDAAMPTL PKPPALPSYG MHRVCDFLNM FVRALSGYYG
PAGVRMMPVQ SQLDAGWNET PSMHARIHAC KCFKQVIDNG LRLLMIEPLE RM