Gene OSTLU_45194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_45194 
Symbol 
ID5001052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp377405 
End bp378693 
Gene Length1289 bp 
Protein Length395 aa 
Translation table 
GC content54% 
IMG OID640416473 
Productpredicted protein 
Protein accessionXP_001416640 
Protein GI145344231 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR02006] cysteine desulfurase IscS
[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0622302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.609433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGGCCGCTGT ACTTGGACAT GCAGGCGACG ACGCCGTTGG ACCCGCGAGT GCTGGACGCG 
ATGTTGCCCT ATTTCACAGA GCAATACGGG AATCCGCACT CGAGAACGCA CATGTACGGC
TGGGAGACGG AGGATGCCAT CGAAAAGGCG AGAGGAGAAT TGGCGTCGCT CATCGGGGCG
AACGCGAAGG AGATTGTGTT CACGAGCGGG GCGACGGAGT CGAACAACAT GTCGCTCAAG
GGGGTGGCGC GCTTTTACAA GGATAAAAAG AAGCACATAA TCACAACGAC GACGGAGCAC
AAGTGCGTGT TGGACTCGTG CAGACAGCTC GAACGTGAAG GTTTCGACGT GACGTATTTG
CCCGTGAAGG AAAATGGATT GGTAGACTTG AAGGAGCTTG AAGCGGCGAT GCGCGACGAC
ACCGCCATCG TCTCCGTCAT GGCGGTGAAC AACGAAATAG GGGTGATTCA GCCTTTGAAA
GCGATCGGTG AGCTTTGCCG ATCGAAGAAA ATATTTTTTC ACACCGATGG CGCGCAAGCA
GTTGGGAAGG TACCGATGGA TGTGAACGAT ATGAACATCG ACCTGATGTC GATTAGCGGG
CACAAGTTTT ACGGTCCCAA GGGGATCGGC GCTTTGTACG TCCGTCGTCG TCCTCGAGTT
CGGATGGAGC CTATCATCAA CGGCGGCGGT CAAGAGCGAG GGTTACGCTC GGGGACGCTA
CCGACCCCGC TCATCGTCGG TATCGGTGAA GCTGCTCGCG TGGCGCAGAA GGAGTTGCAG
CGCGACGAAG AGCACGTCAA CCGCTTGGCT AAGAGATTGA TAGAGGGCAT CGAATCTCGC
GTCGAGCACA CGCAATTAAA CGGTGACCGT GAAGCGCGCT ACCACGGCAA CGTGAACATG
TCCTTTGCAT ACGTGGAGGG TGAATCCATG CTCATGGGAC TTAAAGAAAT CGCGGTGAGC
AGCGGCAGCG CGTGCACGAG TGCGTCTTTA GAGCCATCCT ATGTTTTGCG TGCGCTCGGT
GTGAACGAAG AGATGGCGCA CACGTCGGTA AGATATGGAT TAGGCCGATT CACTACTGAA
GCCGAGGTCG ATCGCGCCAT CGAAGCCACA GTGCGTCAAG TCGAAAAGCT TCGTGAGATG
TCTCCGCTCT GGGAGATGGT CCAGGAAGGC ATAGATTTAA AGACGATCGA GTGGAGTCAA
CATTAACAAG CTCGCGCGCG CGTCATTTTT CGTAGAATTC AATTAGTTCA GTTCATGTTA
TCATCACCAC GTTTTGTTCG TAATATTCC
 
Protein sequence
MQATTPLDPR VLDAMLPYFT EQYGNPHSRT HMYGWETEDA IEKARGELAS LIGANAKEIV 
FTSGATESNN MSLKGVARFY KDKKKHIITT TTEHKCVLDS CRQLEREGFD VTYLPVKENG
LVDLKELEAA MRDDTAIVSV MAVNNEIGVI QPLKAIGELC RSKKIFFHTD GAQAVGKVPM
DVNDMNIDLM SISGHKFYGP KGIGALYVRR RPRVRMEPII NGGGQERGLR SGTLPTPLIV
GIGEAARVAQ KELQRDEEHV NRLAKRLIEG IESRVEHTQL NGDREARYHG NVNMSFAYVE
GESMLMGLKE IAVSSGSACT SASLEPSYVL RALGVNEEMA HTSVRYGLGR FTTEAEVDRA
IEATVRQVEK LREMSPLWEM VQEGIDLKTI EWSQH