Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00431 |
Symbol | dhsS |
ID | 4778242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 42440 |
End bp | 43588 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640085543 |
Product | soluble hydrogenase small subunit |
Protein accession | YP_001016065 |
Protein GI | 124021758 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGGACA AACTCACCCT GATGATCCCG GGTCCTACAC CGGTTCCCGA AACAGTACTC AAAGCAATGG GTCGCCACCC CATCGGCCAT CGCAGTGGAG AGTTTCAAGC CGTCGTTAGA CATACCACCG AGCAGCTGCG CTGGCTGCAT CAGACCAAAT CAGATGTTCT AGTCATCACT GGCAGCGGCA CCGCTGCGAT GGAAGCAGGA ATCATCAATA CCCTCAGCTA CGGCGACAAG GTTCTCTGCG GCGACAACGG CAAGTTTGGA CAGCGCTGGG TAAAGCTGGC TAAGGCTTAC GGCCTGGACG TACAAGTGAT AAAAGCAGAC TGGGGACAAC CGTTAGACCC AGAAGCCTTC AAAAGAGCAC TGGAAGCAGA CAATGGCAAA ACAATCAAAG CGGTAATTCT TACCCACTCG GAAACCTCAA CCGGGGTCAT TAACGACCTT GAAACCATCA GCAAATATGT CCGCACTCAC GGCAAAGCAC TAACCATCGC TGATTGCGTG ACAAGCCTTG GGGCCTGCAA TGTACCCATG GATTCTTGGG GTCTAGATGT GGTTGCCTCC GGTTCCCAAA AGGGATACAT GATGCCTCCA GGTCTCAGCT TCGTGGCCAT GAGTGAACGA GCCTGGCAAG CACATCAACA ATCGGATCTA CCGAAGTTTT ATCTCGATCT GGGGCCTTAC CGAAAAACTG CCGCTCAAGA CAGCAATCCA TTCACCCCTG CTGTGAATCT CTACTTCGCA CTGGAATCTG CGCTGGGAAT GATGCAGTCA GAAGGACTAG AAGCCATCTT TGACCGTCAT GCCCGCCACC GCGCAGCCGC TCAGGCCGGC ATGAAGGCCA TCTGCCTGCC CCTGTATGCA GCTGAAGGAC ACGGCAGCCC AGCGATCACC GCAGTCGCAC CTGAGGGAGT TGATGCAGAG CAACTGCGCA AAACTGTCAA AGAGAAATTC GACATCCTGC TAGCAGGTGG ACAGGATCAT CTAAAAGGGA AGGTCTTCCG TATCGGCCAT CTTGGATTCG TATGTGATAG AGATATCCTC ACTGCCATAG CTGCCATCGA ATCCACCTTG CAATCTCTTG GCTTGCATAA AGGCAACATG GGAGATGGCC TGGCTGCTGC TGCAGCAATT CTGAGATAA
|
Protein sequence | MQDKLTLMIP GPTPVPETVL KAMGRHPIGH RSGEFQAVVR HTTEQLRWLH QTKSDVLVIT GSGTAAMEAG IINTLSYGDK VLCGDNGKFG QRWVKLAKAY GLDVQVIKAD WGQPLDPEAF KRALEADNGK TIKAVILTHS ETSTGVINDL ETISKYVRTH GKALTIADCV TSLGACNVPM DSWGLDVVAS GSQKGYMMPP GLSFVAMSER AWQAHQQSDL PKFYLDLGPY RKTAAQDSNP FTPAVNLYFA LESALGMMQS EGLEAIFDRH ARHRAAAQAG MKAICLPLYA AEGHGSPAIT AVAPEGVDAE QLRKTVKEKF DILLAGGQDH LKGKVFRIGH LGFVCDRDIL TAIAAIESTL QSLGLHKGNM GDGLAAAAAI LR
|
| |