Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_04571 |
Symbol | |
ID | 4717155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 397385 |
End bp | 398371 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640078169 |
Product | O-acetylserine (thiol)-lyase A |
Protein accession | YP_001008852 |
Protein GI | 123967994 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.754639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAA TTTATGAGGA CAACAGTTTT GCTATTGGAA ACACTCCATT AGTAAAATTA AAATCAGTTA CTAAAAACGC GAAAGCTACA GTACTTGCAA AAATTGAAGG TAGAAACCCC GCTTATAGTG TCAAATGTAG GATCGGCGCA AACATGATCT GGGATGCCGA GAAAAGCGGG AAACTTACAA AAGACAAAAC TATTGTTGAG CCAACTTCTG GAAATACAGG AATTGCTCTA GCTTTTACTG CTTCAGCAAG AGGCTATAAG CTGATCCTTA CAATGCCAGA ATCCATGTCA ATTGAAAGAA GAAGGGTTAT GGCAGTGTTG GGTGCTGAAA TTGTTTTAAC AGAGGCATCT AAAGGTATGC CTGGAGCAAT AGCTAAGGCT AAAGAAATTG CAGAAAGTAA TCCTTCTCAA TATTTCATGC CAGGTCAATT TGATAATCCA GCAAACCCTG AAATTCATTT CAAAACTACT GGACCAGAAA TCTGGGATGA TTGCGATGGT GAAATTGATG TTTTAGTTGC AGGTGTTGGA ACTGGCGGCA CAATTACAGG AGTTTCAAGA TACATTAAGC AAGAGAAGGG AAAGAATATT ACTTCTGTCG CTGTAGAACC ATCACATAGT CCTGTTATTA CACAGACGAT GAATGGTGAA GAGGTTAAAT CTGGACCTCA TAAAATTCAA GGAATTGGAG CAGGATTTAT TCCTAAGAAC CTTGACTTAT CAATTGTTGA TAAGGTTGAA CAAGTAACAA ATGAAGAGTC AATCGAGATG GCTCTTAGAT TAGCAAAAGA AGAAGGTCTA TTAGTGGGAA TATCTTGTGG AGCTGCTGCT GCTGCTGCTG TTAGATTAGC TGAACAAGAT GAATATGCTG GGAAGACAAT TGTAGTTGTT CTTCCTGATT TGGCAGAGAG GTATTTATCA TCAATTATGT TTACTGAAGT TCCAAGCGGA ATCATTCAAG AACCAGTCAA AGCCTAA
|
Protein sequence | MAKIYEDNSF AIGNTPLVKL KSVTKNAKAT VLAKIEGRNP AYSVKCRIGA NMIWDAEKSG KLTKDKTIVE PTSGNTGIAL AFTASARGYK LILTMPESMS IERRRVMAVL GAEIVLTEAS KGMPGAIAKA KEIAESNPSQ YFMPGQFDNP ANPEIHFKTT GPEIWDDCDG EIDVLVAGVG TGGTITGVSR YIKQEKGKNI TSVAVEPSHS PVITQTMNGE EVKSGPHKIQ GIGAGFIPKN LDLSIVDKVE QVTNEESIEM ALRLAKEEGL LVGISCGAAA AAAVRLAEQD EYAGKTIVVV LPDLAERYLS SIMFTEVPSG IIQEPVKA
|
| |