Gene OSTLU_34104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34104 
Symbol 
ID5000899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp850459 
End bp852699 
Gene Length2241 bp 
Protein Length746 aa 
Translation table 
GC content58% 
IMG OID640416320 
Productpredicted protein 
Protein accessionXP_001416778 
Protein GI145344518 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.121587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTCG ACGCGCGCGT GCGCGAGGGT TACTTTTTCG GGCGCGCGTG CGCGTGCGCG 
CCGCCCACGA GCTGGTGGCG CGATGACGCC GAACGCGCCG CGCGCGACGC CGCGGACGCG
CGTCGACGCG CGCCGGCGCT GCTGAACACG TTCACGAAGA CCAAAGTACC GTTTAAACCG
CTGAGCGGGA ACTCGGTGGG GTGGTACATC TGCGGACCGA CGGTGTACGA CTCGGCGCAC
GTGGGACACG CGCGCAACTA CGTCAACTTT GACGTCTTGC GCAGGGTGAT GATGGAGTAC
TTTGGCTACG ACGTGCGGTT CGTGATGAAC GTGACGGACA TAGATGATAA AATTATCATG
CGCGCGCACA CGAGACGGGC GGAGGCGGTG GTGAAGGCGG CGAGGGAAAC GGGCGAGACG
AGACTCGGAG CGGAGACGCT GGCGGTGGAA AAGTTGTTGG CGGAGGGTGG GAAACCGTTA
GGGGCGCTCG ATAGCGCCAC GCGGACGCTG GCGAACGCGG TGAAGGCGGC GATAGGAAGC
GGGATCGATG CCGAGCACTG TTGCGCGAAA GATTGGACGA TTCAGGATGG ATACCTGACG
CTCGCGCACC AATTCGAAGC CGAGTTCATG GAGGATATGA AATCCTTGGG CGTGGCGCGA
CCGGACATGC TGACGCGGGT TTCGGAGTAC GTGGATAAGG TGATTTTGTA CATTCAAGTC
ATCATTAACA AAGGATTCGC GTACGAGTCC AACGGCAGCG TGTACTTTGA CGTCAAGGCT
TTCGAGGCGG CGGAGAACCA CAAGTACGGC AAGCTGAATC AAAACGCCAT GGAAAACATC
ACGGAAGCCA TGGATGGAGA GGGCGCACTC GAGGCGGAAA AGAGCGAAAA GAAGTGCGAT
TTTGACTTTG TGCTTTGGAA GATGAGCAAA GACGGTGAGC CGTGCTGGAG CTCGCCCTGG
GGTATGGGTC GCCCGGGGTG GCACATCGAG TGCTCCGCGA TGTGCAGCGA CATTCTCGGT
CAATCCGTCG ATATCAACGG TGGCGGGATC GATTTGAACT TTCCTCATCA CGAAAATCAG
CTCGCGCAGT CGGAGGCGCA TTACGATACC GAACAATGGG TGAACTTTTT CATCCACACC
GGTCACTTGC ACATCGACGG GTTGAAGATG AGCAAGAGTT TGAAGAATTT CATCACCATT
CGCGCGGCGC TCAAGATGTA CTCGGCGAGA CAAATTCGGT TCTTATTCCT GCTTAACCAG
TGGTGCGATC CGATGGAACT CACCCCAGTC GCTGCGCCTG ATGGATCTGG CGTCATCGGC
TTTAAGCAGA TGGATCTCGC GCTGAGCATC GAACGCTTGT TTGTGGAATT TTTCCACTCC
ATCAAGGGCG TGTTCCGCAC CTCTGGAAGT TACCACGTCG ACAAGCAGCA GACTTGGAAC
GAGCGCGAGC GCGAACTTAG TGATGCGCTC GACACGTCTC AAGCCGCCGT GCACGAAGCA
CTCATCGACA ATATCAACAC CCCCAACACC CTTCTTGCAC TGCAAGATCT CGTCAAGGCG
ACGAATAAGT ACCTCGCGGA GACAGGCAGC GTTGATGTTC GACCACTTCT CCTCGAACGC
GTGGGCAAGT TTGTCACGAA GATCCTTAAC TGCTTGGGCG TGTGCTTAGA CACCGGCGCG
GTCGGGTTCC CGGAGTCTTC GGAGGGTTCG TCCGAAGGTC GTGAAGAAAC GCTTTCACCG
TTTCTTGATT TGATGACGAA ATTCAGAGAC GACATTCGCA AGCTCGCCCA AGGCGGGGCG
TCCGCGAAGG AACTTCTCAC CGCGTGCGAC AACCTGCGAG ACGTCGGGTT ACCAGAGCTC
GGCGTGAAGC TCGACGACAA GGAGGGCGGT GCGTTGTGGA AGCTTTACGA CGCGGACGAG
TTGAAGAAAG AAATTGCGCG CGAGCACGAA GCCAAGGAGG AGAAGGAAAG AGCGAAGCGC
GCAGCCAAGG AGGAAGCGGC GCGCAAGGCG GCGGAAAAAG AAGCCAGGGC TAAGGTTCCA
CCGAGTGAAA TGTTCAAGAC GTTCAGTGAA TACGAAGGAT TGTACTCCAA GTACGACGAC
GACGGAGTGC CGACGCACGA CGCCGCGGGC GAGGCTTTGG CGAAGAGCGC GGCAAAGAAA
CTGCTCAAGT CGCGCCAACA GCAAGAGAAG GCTCACGAAA CGTACCTCGC CAAGGCGGGC
ATGGAAAAGC TCGCAGTCTA A
 
Protein sequence
MPVDARVREG YFFGRACACA PPTSWWRDDA ERAARDAADA RRRAPALLNT FTKTKVPFKP 
LSGNSVGWYI CGPTVYDSAH VGHARNYVNF DVLRRVMMEY FGYDVRFVMN VTDIDDKIIM
RAHTRRAEAV VKAARETGET RLGAETLAVE KLLAEGGKPL GALDSATRTL ANAVKAAIGS
GIDAEHCCAK DWTIQDGYLT LAHQFEAEFM EDMKSLGVAR PDMLTRVSEY VDKVILYIQV
IINKGFAYES NGSVYFDVKA FEAAENHKYG KLNQNAMENI TEAMDGEGAL EAEKSEKKCD
FDFVLWKMSK DGEPCWSSPW GMGRPGWHIE CSAMCSDILG QSVDINGGGI DLNFPHHENQ
LAQSEAHYDT EQWVNFFIHT GHLHIDGLKM SKSLKNFITI RAALKMYSAR QIRFLFLLNQ
WCDPMELTPV AAPDGSGVIG FKQMDLALSI ERLFVEFFHS IKGVFRTSGS YHVDKQQTWN
ERERELSDAL DTSQAAVHEA LIDNINTPNT LLALQDLVKA TNKYLAETGS VDVRPLLLER
VGKFVTKILN CLGVCLDTGA VGFPESSEGS SEGREETLSP FLDLMTKFRD DIRKLAQGGA
SAKELLTACD NLRDVGLPEL GVKLDDKEGG ALWKLYDADE LKKEIAREHE AKEEKERAKR
AAKEEAARKA AEKEARAKVP PSEMFKTFSE YEGLYSKYDD DGVPTHDAAG EALAKSAAKK
LLKSRQQQEK AHETYLAKAG MEKLAV