Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21201 |
Symbol | |
ID | 4777085 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1880227 |
End bp | 1881213 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087628 |
Product | O-acetylserine (thiol)-lyase A |
Protein accession | YP_001018120 |
Protein GI | 124023813 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.872322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCGCA TTTACGAAGA CAACAGCCAG GCAATCGGCA ACACTCCACT GGTCAAACTG CACGCCGTCA GCAAAAACGC CAAAGCAACT GTTCTCGCAA AGATTGAAGG CCGCAATCCC GCCTACAGCG TGAAATGCCG CATTGGCGCC AACATGATCT GGGATGCCGA AAAGCGCGGC TTACTGAGCA AAGAACGCAC GATCATTGAG CCCACCTCTG GCAACACAGG TATCGCTCTG GCCTTCACAG CAGCAGCCCG CGGCTACAAA CTGATCCTCA CCATGCCCGA ATCAATGTCG CTAGAACGGC GGCGGGTGAT GGCAGTGCTT GGTGCAGAAC TCATCCTCAC AGAAGCAGCC AAGGGGATGC CAGGCGCCAT TGCCAAGGCA AAAGAGATCG CCGAAAGCGA TCCCCAGAAA TACTTCATGC CCGGTCAGTT TGAGAATCCG GCCAATCCCG ACATCCACAG CAAAACAACC GGTCCAGAAA TCTGGAACGA CTGTGATGGC GCCATTGATG TGCTCGTGGC TGGCGTAGGC ACCGGCGGCA CGATCACCGG TGTGTCCCGC TACATCAAGC AGGAGAAAGG CAAATCCATC GTTTCTGTAG CCGTCGAACC AACCCATAGC CCGGTCATCA GCCAAACCCT CAACGGGGAA GACGTCAAAC CTGGTCCTCA CAAAATCCAG GGCATTGGAG CCGGCTTTAT TCCCAAAAAC CTCGATCTTG CGTTAGTGGA TCGTGTTGAG CAGGTCAGCA ATGATGAATC CATCGCCATG GCCCTGCGCT TGGCAAATGA AGAAGGTCTA CTGGTGGGCA TCTCCAGTGG TGCGGCAGTT GCAGCGGCTA TTCGTCTGGC AGAACAACCC GAATTTGCAG GCAAAACCAT CGTGGTAGTT CTGCCAGACA TGGCCGAGCG TTATCTCTCC TCAGTGATGT TTGAGAGTGT GCCCACAGGC ATCATTCAGG ACCCTGTAGC TGCCTAA
|
Protein sequence | MSRIYEDNSQ AIGNTPLVKL HAVSKNAKAT VLAKIEGRNP AYSVKCRIGA NMIWDAEKRG LLSKERTIIE PTSGNTGIAL AFTAAARGYK LILTMPESMS LERRRVMAVL GAELILTEAA KGMPGAIAKA KEIAESDPQK YFMPGQFENP ANPDIHSKTT GPEIWNDCDG AIDVLVAGVG TGGTITGVSR YIKQEKGKSI VSVAVEPTHS PVISQTLNGE DVKPGPHKIQ GIGAGFIPKN LDLALVDRVE QVSNDESIAM ALRLANEEGL LVGISSGAAV AAAIRLAEQP EFAGKTIVVV LPDMAERYLS SVMFESVPTG IIQDPVAA
|
| |