Gene OSTLU_41218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41218 
Symbol 
ID5002196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp807436 
End bp808680 
Gene Length1245 bp 
Protein Length414 aa 
Translation table 
GC content63% 
IMG OID640417617 
Productpredicted protein 
Protein accessionXP_001418339 
Protein GI145347779 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGT GCGTCGCGCT CGCCGTCGTC GTCGTCGCGA GGGCGTCGGC GCCGGTGCGC 
GGCGACGAAA GCGCCGCGAG GAAAGTGCGC CGTGGCGAAC GCGCGACGAA GGACGACGCC
GACGACGCGC GCGCGCAGTC AAACCGACGC GCGAACGTGC TGAGCGCGAT CGGAAACACG
CCTGTGATGC GCGTGGAGTC GTTGTCGCGC TTGACGCGAT GCGACATCTA CGTCAAGTGC
GAGTTTCTCA ATCCCGGCGG CTCGGTGAAG GACCGCGTGG CTCTGCGAAT CGTCGAGGAC
GCCTTAGCCA GTGGCGCGCT GCGACGAGGA GGGTTGTGCA CCGAAGGTAC CGCGGGGAGC
ACGGGAGTAT CGCTCGCCAT GGTGTGCAAA GCGATGGGGG TGGAATGTTT CGTCGCCATG
CCGGACGACG CCGCGAAGGA GAAATCGGCG CTCGTCGAGG CGTACGGCGC TCGGGTGGAG
CGCGTGCGAC CGGTGTCGAT CGCAAATCGT GGACACTTTG TCAATGTGGC CAGACGCGAG
GCTGAGCGCG CACGCGCGCG CGACGGCGTG GGCGGAGGGT ACTTTGCAGA TCAGTTTGAG
AATTTAGCGA ACTTTCGCGC GCACGCCGAC GGCACGGGAG TGGAAATATT TTCTGAAATC
GGCGCCGAAC TCGACGCCTT TGTGTGCGCG TGCGGTACCG GGGGCACGCT CGCGGGTGTG
GGGGTGGCGC TGAAGGAACG GAAGCCGTCT GTCAAGCTCT TTCTCGCGGA TCCGCAAGGA
AGCGGGTTGT TTAATCGCGT CTCGCGCGGC GTCATGTATA CGAAAGAAGA GGCGGAGGGA
AAGCGCTTGA AGAACCCGTT CGATACCGTG ACGGAAGGTG TGGGAATCAA TCGCATCACG
GAGAATTTCA AAGTTTTGCT CGATCGTCCG GGAATGCTCA CGGGCGCCGT GAAGGTGAGC
GACGCCGAGG CTGTCGCGAT GAGCCGCTTC GTCGCGAGGC ACGACGGGCT CTTCATCGGA
AGCTCAAGCG CCGTCAATCT CGTTTCTGCG GTGCGCGTGG CGCAATCGCT CGGACCAGGA
CATTGCATTT GCACGATCGC GTGTGACAGC GGACTGCGTC ACATGACGAA ATTCTGGGAC
GACGAATATC TCGCCAAGAT CGATTTGACG TCTCACGATG TCGCGTCGGC TGATTCCTTA
TCGTTTCTCG ACGACGACAC GGTGGTGACT GCGGCGCGTT GTTAG
 
Protein sequence
MVACVALAVV VVARASAPVR GDESAARKVR RGERATKDDA DDARAQSNRR ANVLSAIGNT 
PVMRVESLSR LTRCDIYVKC EFLNPGGSVK DRVALRIVED ALASGALRRG GLCTEGTAGS
TGVSLAMVCK AMGVECFVAM PDDAAKEKSA LVEAYGARVE RVRPVSIANR GHFVNVARRE
AERARARDGV GGGYFADQFE NLANFRAHAD GTGVEIFSEI GAELDAFVCA CGTGGTLAGV
GVALKERKPS VKLFLADPQG SGLFNRVSRG VMYTKEEAEG KRLKNPFDTV TEGVGINRIT
ENFKVLLDRP GMLTGAVKVS DAEAVAMSRF VARHDGLFIG SSSAVNLVSA VRVAQSLGPG
HCICTIACDS GLRHMTKFWD DEYLAKIDLT SHDVASADSL SFLDDDTVVT AARC