Gene OSTLU_94354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94354 
Symbol 
ID5001895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp525742 
End bp526905 
Gene Length1164 bp 
Protein Length387 aa 
Translation table 
GC content60% 
IMG OID640417316 
Productpredicted protein 
Protein accessionXP_001417800 
Protein GI145346654 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.25713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00194835 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGATGC GAGCGCATCA CGCGCGCGCG ACGGTGACGA CGACGAAGAG AGGGCGATGC 
GAGCGACGCG ATAGGGCGCG AACGACGCCG CGGGCGCGGA TTTATGAAAA TATTCTCGAG
ACGGTCGGGG ACACGCCGGT GATCAAGGTC AACCGACTGG CGCCGGCGGG GATCGATATG
TACGTCAAGT GCGAGTATTT TAATCCGTTG AGCAGCGTGA AGGATCGATT GGCGGTGGCG
GTGATCACGG ATGCCGAACG ACGGGGGTTG TTGAAGCCGG GGGATACGGT GGTGGAGGCG
ACTTCGGGCA ACACCGGGAT CGCGGTGGCA ATGGCGTGCG CTCAACGCGG CTACAGATGC
GTCATCTGCA TGGCCGAGCC GTTTTCTGTG GAACGTCGGA AGATCATGCG CATGCTCGGC
GCGAAAGTCA TCGTGACGCC GAAGGGGGGT AAAGGTACGG GGATGGTGGC CAAGGCGGAG
GAATTGGCGG AAAAGAATGG TTGGTTTTTG TGCCGACAAT TCGAGAACGA AGCCAATCCC
GCGTACCACG CCTCGACTAC GGGGCCGGAA ATCTTGCGAG ACTTCGCGGG TAAGAAGCTC
GACTATTTCG TCACCGGTTA CGGCACCGGC GGTACGTTCC AGGGCGTCGC GCGAACGCTC
AAGGAATCTC GTCCGGACAC CAAGGTGATT TTGCTCGAAC CCGAAGCCGC GGCGTTGGTG
ACTTCCGGCA TCAAGACCGA GCGTAAGCCC ACGGGCGCCC CGAATGGGTC TCATCCGGCG
TTCGCGGCGC ACCCTGTGCA AGGTTGGACG CCCGATTTCA TCCCTTTGGT TCTCGAAAAT
GGTTTGAACA TGAACCTCTA CGACGAACTC GTGAAGATCG AAGGCGGCGA CGCCGTCAAG
ACGGCGCAAG CGTTGGCGAG AAGCGAAGGT ATCTTCACCG GTATTTCTGG TGGCGCCACG
TTCGCCGGTG CGCTCAAGGT TGCCGAAAAG GCGCCGAAGG GCTCGGTGAT CTTGGCGATG
TTGCCGGATA CGTCTGAGCG TTACATGAGC ACGCCACTTT ACGACTCGAT CGAGGCGGAC
ATGAACGAGG AAGAGCTCGA GATCGCGAAG TCGACGCCGT CTTTCCAACT TATCCCGGGC
CAAGAACCGA CGCTGCAAAT GTAA
 
Protein sequence
MTMRAHHARA TVTTTKRGRC ERRDRARTTP RARIYENILE TVGDTPVIKV NRLAPAGIDM 
YVKCEYFNPL SSVKDRLAVA VITDAERRGL LKPGDTVVEA TSGNTGIAVA MACAQRGYRC
VICMAEPFSV ERRKIMRMLG AKVIVTPKGG KGTGMVAKAE ELAEKNGWFL CRQFENEANP
AYHASTTGPE ILRDFAGKKL DYFVTGYGTG GTFQGVARTL KESRPDTKVI LLEPEAAALV
TSGIKTERKP TGAPNGSHPA FAAHPVQGWT PDFIPLVLEN GLNMNLYDEL VKIEGGDAVK
TAQALARSEG IFTGISGGAT FAGALKVAEK APKGSVILAM LPDTSERYMS TPLYDSIEAD
MNEEELEIAK STPSFQLIPG QEPTLQM