Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_26311 |
Symbol | |
ID | 4777222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2321992 |
End bp | 2322978 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640088153 |
Product | O-acetylserine (thiol)-lyase A |
Protein accession | YP_001018626 |
Protein GI | 124024319 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATGA CATCGCCCAT GCCTATCGCT GCTGACATTA CGGCCTTAGT TGGGTGCACT CCGTTGGTGC GCCTTAATCG CTTGCCACAG GTCAGTGGTT GCCATGCGGA GGTGCTGGCC AAGTTGGAAA GCTTCAACCC CACTGCCTCG GTTAAAGACC GCATCGCGGG GGCCATGATC CATGCAGCGG AGGAAGCTGG AACCATTGCT CCGGGGCGCA CCGTGCTGGT GGAACCCACC AGTGGCAACA CTGGGATCGC CTTGGCCATG GTGGCTGCCG CCCGAGGCTA TCGCCTCATT CTCACCATGC CGGACACGAT GAGTACAGAG CGTCGCGCCA TGCTTAGGGC CTATGGCGCT GAACTGCAAC TCACCCCAGG CAGCGATGGA ATGACTGGTT CCATTGAGTT GGCAAAGGAG TTGGTGGCAA CCATCCCTGA GGCTTATCTG CTGCAGCAGT TTGATAATCC TGCGAATCCG CAGGTGCATG AGCGCACAAC CGCAGAGGAG ATCTGGAATG ATTGCGAGGG GCGGTTGGAT GGTCTGATTA CTGGAGTAGG AACGGGGGGC ACACTCACAG GTTGTGCGCG TTTGCTCAAG CAACGCAATC CAAGCCTGCG AGTGTTTGCG GTAGAGCCAG CGTTAAGCCC TGTTTTGGCA GGTGGGAGCC CTGCTCCCCA TCGCATTCAG GGGATTGGTG CTGGCTTCAT TCCTAGCGTT CTTGATCTCT CTTTGATCGA TGAGATCCTG CCGGTCTCTG ATGACGACGC GATGGAGATG GGTCGGCGCT TGGCCAGGGA GGAGGGCTTG CTAAGTGGGG TAAGCAGTGG CGCTGCTGTT GCGGCTGCAT TGCAAGTGGG TAGTAGGGCT GAGATGGTGG GCAAGCGTCT TGTTGTTGTC TTGGCAAGTT TTGGTGAGCG TTATCTATCA ACGCCGATGT TTAGCGCTGC TTCGGCGGTG CCGGCGCAGC GTGATGGATA CCTTTGA
|
Protein sequence | MTMTSPMPIA ADITALVGCT PLVRLNRLPQ VSGCHAEVLA KLESFNPTAS VKDRIAGAMI HAAEEAGTIA PGRTVLVEPT SGNTGIALAM VAAARGYRLI LTMPDTMSTE RRAMLRAYGA ELQLTPGSDG MTGSIELAKE LVATIPEAYL LQQFDNPANP QVHERTTAEE IWNDCEGRLD GLITGVGTGG TLTGCARLLK QRNPSLRVFA VEPALSPVLA GGSPAPHRIQ GIGAGFIPSV LDLSLIDEIL PVSDDDAMEM GRRLAREEGL LSGVSSGAAV AAALQVGSRA EMVGKRLVVV LASFGERYLS TPMFSAASAV PAQRDGYL
|
| |