Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21101 |
Symbol | |
ID | 4777062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1871058 |
End bp | 1872344 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087618 |
Product | putative L-cysteine/cystine lyase |
Protein accession | YP_001018110 |
Protein GI | 124023803 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.601707 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTATG TGCTGCTGCA TCGGCTAGTG AAGCGAGCAG CAGATAAAAC GGCAGGCGAT CATGAACACA GGCCCCGACA ACCCATGCTT AGAGACCTCT GCCCAGCACT CGCTAACAAG ACCTACTTCA ACTACGGCGG CCAGGGCCCC TTACCCACGC CTTCCCTTGA AGCGATTACT GCAAGCTGGC AGAACATTCA AGAACTGGGT CCCTTCACCA ACAGCGTGTG GCCCTATGTG GCCAAAGAAG TCCAAGCCAC TCGTTCACAC TTAGCCAAGC TCTGCGGCGT TGCCCCCCAT CGCATCGCGC TCACTGAAAA TGTCACCAGT GGCTGTGTCT TGCCGCTCTG GGGCCTGCCT TTCTCAGAAG GCGATCGCTT GCTCATTAGC GACTGTGAAC ATCCGGGAAT CGTTGCTGCA TGCATCGAAC TTGCCCGCCG ACAACACCTG GAAATAGACA CGCTGCCTGT AAAGAACTTG CGTCATGGTG CTAACGATCA AACGACTAGC GACAGCCTTG TGCTGGAAAG ACTTGAGCAA CACCTCAAAC CAAGCACAAG GCTGGTAGTG CTCTCCCACC TGCTATGGAA TACAGGCCAG GTGATGCCGA TCTCGGCTGT TTCAACAGCC CTCAGCCATC ATCCACAACA GCCTTTCTTG CTCGTGGATG CGGCACAAAG CTTCGCCCAA ATGCCTATAC AGGAAGCCGC TGCCGCTTCA GACATCTATG CCTTTACGGG GCACAAATGG GCCTGCGGGC CTGAAGGGCT TGGTGGAGTC GCCCTCTCAG AACGGGTGCT CGCCGAAGCG AATCCCACCC TGATTGGCTG GCGCAGCTTG CAGAACGAAG GCCATCTTCA AAGCAACCTG GACGAACTCT TCCATCACGA CAGTCGACGC TTTGAGGTAG CAACCTCCTG CGTGCCGCTG ATGGCGGGCC TGCGCTGTTC GTTGGAGCTG CTCGAAGCCG CAGGCTCGCA GCAGGAACGA CTGAGCCAGA TTCGCCAAGG CAGCCGACAC TTATGGAATC AACTACAACA GCTCACAGGC GTCGAAACAC TGCTCAACAG TGCCCCAGCA GCTGGTCTTG TCAGCTTTGA GTTACCCCAA GGCCCCCCAG CTCCTGATGT GGTTAAACAA TTAGGAAACG ATCAGCTCTG GATTCGGCAT CTAGAAGATC CAATCTGCCT ACGTGCCTGC GTGCACATCA CCACTGAAGA GCAAGAACTC AACACACTTA CAACCTCACT CAAGCAGCTA GCTAGCAAAG GAGAGCCAAG CAATTAA
|
Protein sequence | MTYVLLHRLV KRAADKTAGD HEHRPRQPML RDLCPALANK TYFNYGGQGP LPTPSLEAIT ASWQNIQELG PFTNSVWPYV AKEVQATRSH LAKLCGVAPH RIALTENVTS GCVLPLWGLP FSEGDRLLIS DCEHPGIVAA CIELARRQHL EIDTLPVKNL RHGANDQTTS DSLVLERLEQ HLKPSTRLVV LSHLLWNTGQ VMPISAVSTA LSHHPQQPFL LVDAAQSFAQ MPIQEAAAAS DIYAFTGHKW ACGPEGLGGV ALSERVLAEA NPTLIGWRSL QNEGHLQSNL DELFHHDSRR FEVATSCVPL MAGLRCSLEL LEAAGSQQER LSQIRQGSRH LWNQLQQLTG VETLLNSAPA AGLVSFELPQ GPPAPDVVKQ LGNDQLWIRH LEDPICLRAC VHITTEEQEL NTLTTSLKQL ASKGEPSN
|
| |