Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_00331 |
Symbol | dhsS |
ID | 4780297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 32708 |
End bp | 33862 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083296 |
Product | soluble hydrogenase small subunit |
Protein accession | YP_001013862 |
Protein GI | 124024746 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.417582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGACA AACTCAATTT AATGATTCCT GGACCAACAC CGGTCCCAGA AAATGTTTTG AGTTCTATGA GTAAACACCC CATTGGTCAT AGAAGTGGAG ATTTTCAAAA AATTGTTCAA AAAACAACTG AACAACTCAA ATGGCTTCAC CAAACAACTG CAGACGTCCT AACAATTACA GGAAGTGGAA CAGCTGCAAT GGAGGCAGGA ATAATTAATA CATTAAGCAA AGGTGATCAA GTCATTTGCG GCGACAATGG AAAATTTGGT GAAAGATGGG TAAAAGTAGC AAGGGCATAT GGATTAGATG TAAAAGTTGT AAAAGCTGAT TGGGGAACCC CTCTTGATCC AAATCAATTT AAGAGGATTC TTGAAGAAGA CACCAATGAA AAAATTAAAG CGGTTATTTT AACTCATTCA GAAACTTCAA CAGGAGTGAT TAATGATCTT AAATCGATTA ATAACGAAGT AAAAAATCAT AGTAAAGCTA TTACAATTGC GGATTGTGTA ACAAGTCTTG GTGCATGTAA CATCCCAATG GATGAATGGG GAATTGATGT AATAGCTTCA GGCTCTCAAA AAGGTTATAT GATTCCACCT GGCCTGAGTT TTGTTGCTAT GAGCAAAAGA GCATGGGAAG CAAATAATCA ATCAAATTTA CCTAAGTTTT ACTTAGATCT AAAACAATAT TTAAAGACAG TTAATCAAAA TAGTAATCCT TTTACGCCTG CAATAAATTT ATACTTTGCT TTAGAAGCTT CACTAACAAT GATGCAAAAA GAAGGGTTAA ATAATATATT TGCCCGCCAT GCTCGTCATC AAAAAGCAAC GCAAGAAGGA ATAAAAGCAA TGGGTTTGAA TTTATTTACA AAAGAAAATT TTGGAAGTCC AGCAATAACA GCTGTTAAGC CTGAAAATAT TGACGCTGAA AGTATAAGAA AGGCAATAAA AAATGACTTC GACATACTCC TTGCTGGAGG TCAAGATCAT TTAAAAGGAA AAATCTTTAG AATTGGACAT TTAGGATTTG TCAATAATAG AGACATTATT AGTGTCATAT CAGCTTTAGA AAGCACTCTT GATAAAATGG GCAAACTAAA CGTCCCCATT GGCCAAGGAA TTGCAAAAAC AATTTCAGTA CTAAATAACG AATAA
|
Protein sequence | MQDKLNLMIP GPTPVPENVL SSMSKHPIGH RSGDFQKIVQ KTTEQLKWLH QTTADVLTIT GSGTAAMEAG IINTLSKGDQ VICGDNGKFG ERWVKVARAY GLDVKVVKAD WGTPLDPNQF KRILEEDTNE KIKAVILTHS ETSTGVINDL KSINNEVKNH SKAITIADCV TSLGACNIPM DEWGIDVIAS GSQKGYMIPP GLSFVAMSKR AWEANNQSNL PKFYLDLKQY LKTVNQNSNP FTPAINLYFA LEASLTMMQK EGLNNIFARH ARHQKATQEG IKAMGLNLFT KENFGSPAIT AVKPENIDAE SIRKAIKNDF DILLAGGQDH LKGKIFRIGH LGFVNNRDII SVISALESTL DKMGKLNVPI GQGIAKTISV LNNE
|
| |