Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_03051 |
Symbol | |
ID | 4778512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 320023 |
End bp | 321327 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640085807 |
Product | putative cysteine desulfurase or selenocysteine lyase |
Protein accession | YP_001016323 |
Protein GI | 124022016 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.512887 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCTT CCGTTAGTTC AGATCAGAGC ATGCCGTTGG CGATGGCTGC TGTAGAAGCT TCAGCCAACC TGGCAGATCT CACGCGAGCG GATTTCCCGC TACTAGGCCA GACCGCCTGC TTAGGTCAGC CCTTGATTTA CATGGACCAT GCGGCGACAA GTCAGAAGCC ACGGCAGGTG CTGGATGCCT TACAGCATTA CTACGACCAT GACAACGCCA ATGTGCACCG TGGTGCCCAT CAGTTGAGTG TTCGGGCTAC CGAGGATTTT GAACGAGCAC GTCAGAAGGT GGCTGATTTC ATCGCTGCCT CAAGCGCGCG GGAAATCGTT TTCACCAGGA ATGCAAGTGA GGCGATCAAC CTGGTGGCTC GCAGTTGGGG TGATGCCAAC CTCCATGAAG GTGACGAGGT GCTGCTTACC TTGATGGAGC ATCACAGCAA CATTGTCCCT TGGCAGATGC TTGCCAAGCG AACAGGTTGC GTGCTGCGCT TTGTCGACCT CACCGATTGC GGGGAGCTTG ATTTAAATGA TCTTCGGCAA AAGCTCTCAG AGCGCACTCG TTTGGTCAGT CTGGCTCACT TGAGCAATGT GCTGGGCTGT TTTAACCCTA TCTCTGAGGT CACTGCAGAG GCTCATCGCT TCGGTGCTCT GGTGTTGCTG GATGCTTGCC AGAGCTTGCC ACATATGCCT GTTGATGTGT CCCGGCTTGG ATGTGACTTT CTCGTGGGTT CTTCCCACAA ATTGTGTGGT CCTACCGGGA TGGGTTTTCT TTGGGCTCGA GAGGAGTTGC TTGATGCCAT GCCGCCTTTC CTCGGCGGTG GCGAAATGAT TCAGGATGTC TATCTCGACC ACAGCAGCTG GGCTGATCTG CCTTACAAGT TTGAAGCAGG TACCCCTGCT ATTGGGGAAG CTATTGGTAT GGGCGTTGCT CTCGACTACC TGAACCAGGT TGGCTTAGAT CGTATTCACG CTTGGGAGCA GCAGCTCACG CTGCAATTGT TTGATCGCCT CCAAGGCATC GATGGGTTGA CGATTCTGGG CCCAACTCCT CAGCAGGAGC CTGATCGGGC GGTCCTGGCG GCTTTCACAG TGGATGGCTT GCATCCCAAT GATATTGGTG CCTTGCTTGA TTCAGCAGGG ATCTGTATTC GTAGTGGCCA CCACTGCACC CAGCCTTTGC ATCGTCACTA TGGGATCCCT GGATCAGCTC GTGCCAGCTT GAGCTTCACC AATACACCAG AAGAAGTCGA TCGTTTTGCT GAGGAATTGG TTTCGACGAT CGGCTTCTTA AGAGAGCACA GCTAG
|
Protein sequence | MTSSVSSDQS MPLAMAAVEA SANLADLTRA DFPLLGQTAC LGQPLIYMDH AATSQKPRQV LDALQHYYDH DNANVHRGAH QLSVRATEDF ERARQKVADF IAASSAREIV FTRNASEAIN LVARSWGDAN LHEGDEVLLT LMEHHSNIVP WQMLAKRTGC VLRFVDLTDC GELDLNDLRQ KLSERTRLVS LAHLSNVLGC FNPISEVTAE AHRFGALVLL DACQSLPHMP VDVSRLGCDF LVGSSHKLCG PTGMGFLWAR EELLDAMPPF LGGGEMIQDV YLDHSSWADL PYKFEAGTPA IGEAIGMGVA LDYLNQVGLD RIHAWEQQLT LQLFDRLQGI DGLTILGPTP QQEPDRAVLA AFTVDGLHPN DIGALLDSAG ICIRSGHHCT QPLHRHYGIP GSARASLSFT NTPEEVDRFA EELVSTIGFL REHS
|
| |