Gene P9303_21201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_21201 
Symbol 
ID4777085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1880227 
End bp1881213 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID640087628 
ProductO-acetylserine (thiol)-lyase A 
Protein accessionYP_001018120 
Protein GI124023813 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01136] cysteine synthases
[TIGR01139] cysteine synthase A 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.872322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCA TTTACGAAGA CAACAGCCAG GCAATCGGCA ACACTCCACT GGTCAAACTG 
CACGCCGTCA GCAAAAACGC CAAAGCAACT GTTCTCGCAA AGATTGAAGG CCGCAATCCC
GCCTACAGCG TGAAATGCCG CATTGGCGCC AACATGATCT GGGATGCCGA AAAGCGCGGC
TTACTGAGCA AAGAACGCAC GATCATTGAG CCCACCTCTG GCAACACAGG TATCGCTCTG
GCCTTCACAG CAGCAGCCCG CGGCTACAAA CTGATCCTCA CCATGCCCGA ATCAATGTCG
CTAGAACGGC GGCGGGTGAT GGCAGTGCTT GGTGCAGAAC TCATCCTCAC AGAAGCAGCC
AAGGGGATGC CAGGCGCCAT TGCCAAGGCA AAAGAGATCG CCGAAAGCGA TCCCCAGAAA
TACTTCATGC CCGGTCAGTT TGAGAATCCG GCCAATCCCG ACATCCACAG CAAAACAACC
GGTCCAGAAA TCTGGAACGA CTGTGATGGC GCCATTGATG TGCTCGTGGC TGGCGTAGGC
ACCGGCGGCA CGATCACCGG TGTGTCCCGC TACATCAAGC AGGAGAAAGG CAAATCCATC
GTTTCTGTAG CCGTCGAACC AACCCATAGC CCGGTCATCA GCCAAACCCT CAACGGGGAA
GACGTCAAAC CTGGTCCTCA CAAAATCCAG GGCATTGGAG CCGGCTTTAT TCCCAAAAAC
CTCGATCTTG CGTTAGTGGA TCGTGTTGAG CAGGTCAGCA ATGATGAATC CATCGCCATG
GCCCTGCGCT TGGCAAATGA AGAAGGTCTA CTGGTGGGCA TCTCCAGTGG TGCGGCAGTT
GCAGCGGCTA TTCGTCTGGC AGAACAACCC GAATTTGCAG GCAAAACCAT CGTGGTAGTT
CTGCCAGACA TGGCCGAGCG TTATCTCTCC TCAGTGATGT TTGAGAGTGT GCCCACAGGC
ATCATTCAGG ACCCTGTAGC TGCCTAA
 
Protein sequence
MSRIYEDNSQ AIGNTPLVKL HAVSKNAKAT VLAKIEGRNP AYSVKCRIGA NMIWDAEKRG 
LLSKERTIIE PTSGNTGIAL AFTAAARGYK LILTMPESMS LERRRVMAVL GAELILTEAA
KGMPGAIAKA KEIAESDPQK YFMPGQFENP ANPDIHSKTT GPEIWNDCDG AIDVLVAGVG
TGGTITGVSR YIKQEKGKSI VSVAVEPTHS PVISQTLNGE DVKPGPHKIQ GIGAGFIPKN
LDLALVDRVE QVSNDESIAM ALRLANEEGL LVGISSGAAV AAAIRLAEQP EFAGKTIVVV
LPDMAERYLS SVMFESVPTG IIQDPVAA