Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_00351 |
Symbol | dhsS |
ID | 5730643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 33241 |
End bp | 34398 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 641284377 |
Product | soluble hydrogenase small subunit |
Protein accession | YP_001549920 |
Protein GI | 159902576 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.269173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAA AACTTGCTCT AATGATCCCT GGACCAACCC CAGTGCCAGA GAGAGTTCTA AAAGCTCTAA GTCAACATCC CATTGGTCAC CGCACTCCAG AGTTCCAAGA AATCGTCAAA AAAACAACTC AGCTACTGCA ATGGCTGCAT CAAACAGAAG GGGATGTTTT AACAATTACT GGCAGTGGGA CTGCTGCCAT GGAAGCAGGA ATCATTAATA CCTTAAAAAA AGGGGATAAA GTTATTTGTG GAGAAAATGG AAAATTTGGT GAAAGATGGG TGAAAATTGC CAAGGCCTAT GGACTCAATG TTGAAATTAT TAAGTCCAAT TGGGGTGAAC CATTAGAACC TGAAAAGTTC AGAAATATAC TTCAAGCTGA TAATGAAATT CGTGCCGTAA TACTTACCCA CTCAGAAACA TCTACAGGAG TTATTAACAA TCTAGAGGCA ATTAGTAAAG AGGTTAGAAA ACACGAAAAA GCAATCACTA TTGCAGACTG TGTTACTAGT TTAGGTGCAT GCAATGTACC TATGGATGAA TGGGGCATAG ACGTTCTTGC ATCTGGCTCT CAAAAAGGAT ACATGATGCC TCCTGGCCTC AGTTTTGTTG CAATGAATCA AAGAGCTTGG AAGGCAAGTG AACGCTCAGA TTTACCAAGT TTTTATTTAA ACCTGAAGTC ATACAAAAAA ACTAGTGATA AAAATAGCAA TCCTTTTACG CCTAGTGTAA ATCTATATTT TGCATTAGAA GAAGCGTTAA ATATGATGAA AGAGGAAGGT TTAGAAAAGA TATTTAGTCG TCATAATAGA CATAAAGAAG CAACCCAAAA AGCAATGGAA GCTATTGGTT TGAAATTATT TGCAGCCCCT GGGTATGGCA GTCCTTCCAT CACTGCAGTA GAGCCTAAAG ATATTGATGC TGATCTAATA AGAAAAGTAG TAAAAGAAAA TTTTGATATA TTACTTGCAG GTGGTCAAGA TCACTTAAAA GGAAAAGTCT TTCGCATAGG TCATCTTGGA TTTGTCAATG ATCGTGACAT TATTACAGCT ATAGCATCTA TAGAATCTGC TCTTAATCAA TTAGGGGCTT TAAAAGAGCC AATTGGTACT GGAGTAGCTA CCGCTTCAAA AATACTTTTT AAAGAAAATA GAGTATGA
|
Protein sequence | MKEKLALMIP GPTPVPERVL KALSQHPIGH RTPEFQEIVK KTTQLLQWLH QTEGDVLTIT GSGTAAMEAG IINTLKKGDK VICGENGKFG ERWVKIAKAY GLNVEIIKSN WGEPLEPEKF RNILQADNEI RAVILTHSET STGVINNLEA ISKEVRKHEK AITIADCVTS LGACNVPMDE WGIDVLASGS QKGYMMPPGL SFVAMNQRAW KASERSDLPS FYLNLKSYKK TSDKNSNPFT PSVNLYFALE EALNMMKEEG LEKIFSRHNR HKEATQKAME AIGLKLFAAP GYGSPSITAV EPKDIDADLI RKVVKENFDI LLAGGQDHLK GKVFRIGHLG FVNDRDIITA IASIESALNQ LGALKEPIGT GVATASKILF KENRV
|
| |