Gene OSTLU_33390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33390 
SymbolCGS1 
ID5003536 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp139529 
End bp140963 
Gene Length1435 bp 
Protein Length433 aa 
Translation table 
GC content57% 
IMG OID640418957 
Productcystathione gamma synthase 
Protein accessionXP_001419722 
Protein GI145350669 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0131523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG CGATGAGCGC GAAGGCGCGA ACGGTGGTGA AGCTGAGCGC GACGACGCGG 
AAGAGCCGAG GAGGGACGGG ACGACGAGGC GCGACGACGC GCGCGGCGAA CGGCGAGAAA
CCTTGGGGCG ACTCGACGCA GTGCGTGCAA AACGGTGCGT CGAACGACGA GGAGCGACGC
GCGATGGACG CGCGATGGTG CATAAAAATG TGATGCGCGA GTGAGGAGAC GCGAGGGGGG
AGACGCGCGG ACTGACGGTG ATGATCGCGG TTGTTACTTT GACGCAGGCG AACGCAAGGA
GGGGCGACCG AGGGTATCCG ACTCGTTGAC GACCCCGATC GTGTGCACGT CGACGTACCA
CTTCAAGGAC ACGGAAGAGT TGATCGCGTA TCAAGAGGGA CGATATGGGA GCTTTGAGTA
CGGTCGATAC GGGAATCCGA CGACCAAGGC GTGCGAAGAG AAGATTCGCC AACTCGAGGG
CGGAGAAGAC GCTCTGTTGA GCGCGTCCGG CATGTGCACG GCGACGACTA TGCTGCTCGC
GCTCGTCCCA GCCGGTGGAC ACATTGTCAC CACCACGGAC TGCTACAGAC GCACTCGACA
ATTCATTCAA ACTTTCTTGC CGAAGATGGG CATCACGTCC ACGGTGCTCG ATCCGTCCGA
TTACGACGGC TTGGAAAAGG CGCTCAAGGA GAACAAGTGC TCGCTGTACT TTTCCGAATC
ACCAACGAAC CCGTACCTTC GTTGCGTCGA TATCGAACGA ATCGTGAACC TGTGCAAGCC
CACGGGTTGC CTCGTGTGCA TCGATGGTAC TTTTGCCACG CCGTGCAACT CTCGCGCGCT
CGACTTTGGC GCCGACTTGG TGATTCACAG CGCGTCGAAA TTCATGGGCG GGCACAACGA
CGTTCTCGCC GGTGTCATCG TCGGTAAGAA GGAAGCAGTC GCCGCGTGCA GACAATTTCA
TAATATTTTG GGTGGTGTCA TCGATCCTCA CGCTGCTTAC TTGGTTCTTC GAGGTTTGAA
GACCTTGTCG CTTCGAGTCG AGCGCCACAA CGACAGCGCG ATGAGACTGG CTCAATACCT
CGAGAAGCAT CCAAAGATTG ACAAAGTGCA CTATCCGGGA TTACCGAGCC ACTGCGACCA
CGAAGTTGCG AAAAAATACA TGAAAGGTTT CGGTGGGGTG GTATCTTTCG AGGTCAAGGG
TGATCTTTGG GCGACTGCAA AGTTTATCGA TAGTTGCGAG CTTCCGTACA TCGCGCCGTC
TCTCGGTGGC GTGGAGTCGC TGATTGAACA ACCGACTGTC GTTTCTTATT GGGATCAGGG
TCCTGAAAAG CGCGCTGAGA TTGGCATTAA GGATAACCTC GTTCGATTTT CGACTGGGAT
TGAGGATTAC GTCGACATCG AAGCGGATAT TGCTCAAGCC CTGGAGAAAA TTTAA
 
Protein sequence
MSAAMSAKAR TVVKLSATTR KSRGGTGRRG ATTRAANGEK PWGDSTQCVQ NGERKEGRPR 
VSDSLTTPIV CTSTYHFKDT EELIAYQEGR YGSFEYGRYG NPTTKACEEK IRQLEGGEDA
LLSASGMCTA TTMLLALVPA GGHIVTTTDC YRRTRQFIQT FLPKMGITST VLDPSDYDGL
EKALKENKCS LYFSESPTNP YLRCVDIERI VNLCKPTGCL VCIDGTFATP CNSRALDFGA
DLVIHSASKF MGGHNDVLAG VIVGKKEAVA ACRQFHNILG GVIDPHAAYL VLRGLKTLSL
RVERHNDSAM RLAQYLEKHP KIDKVHYPGL PSHCDHEVAK KYMKGFGGVV SFEVKGDLWA
TAKFIDSCEL PYIAPSLGGV ESLIEQPTVV SYWDQGPEKR AEIGIKDNLV RFSTGIEDYV
DIEADIAQAL EKI