Gene P9211_00351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00351 
SymboldhsS 
ID5730643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp33241 
End bp34398 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content37% 
IMG OID641284377 
Productsoluble hydrogenase small subunit 
Protein accessionYP_001549920 
Protein GI159902576 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.269173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAA AACTTGCTCT AATGATCCCT GGACCAACCC CAGTGCCAGA GAGAGTTCTA 
AAAGCTCTAA GTCAACATCC CATTGGTCAC CGCACTCCAG AGTTCCAAGA AATCGTCAAA
AAAACAACTC AGCTACTGCA ATGGCTGCAT CAAACAGAAG GGGATGTTTT AACAATTACT
GGCAGTGGGA CTGCTGCCAT GGAAGCAGGA ATCATTAATA CCTTAAAAAA AGGGGATAAA
GTTATTTGTG GAGAAAATGG AAAATTTGGT GAAAGATGGG TGAAAATTGC CAAGGCCTAT
GGACTCAATG TTGAAATTAT TAAGTCCAAT TGGGGTGAAC CATTAGAACC TGAAAAGTTC
AGAAATATAC TTCAAGCTGA TAATGAAATT CGTGCCGTAA TACTTACCCA CTCAGAAACA
TCTACAGGAG TTATTAACAA TCTAGAGGCA ATTAGTAAAG AGGTTAGAAA ACACGAAAAA
GCAATCACTA TTGCAGACTG TGTTACTAGT TTAGGTGCAT GCAATGTACC TATGGATGAA
TGGGGCATAG ACGTTCTTGC ATCTGGCTCT CAAAAAGGAT ACATGATGCC TCCTGGCCTC
AGTTTTGTTG CAATGAATCA AAGAGCTTGG AAGGCAAGTG AACGCTCAGA TTTACCAAGT
TTTTATTTAA ACCTGAAGTC ATACAAAAAA ACTAGTGATA AAAATAGCAA TCCTTTTACG
CCTAGTGTAA ATCTATATTT TGCATTAGAA GAAGCGTTAA ATATGATGAA AGAGGAAGGT
TTAGAAAAGA TATTTAGTCG TCATAATAGA CATAAAGAAG CAACCCAAAA AGCAATGGAA
GCTATTGGTT TGAAATTATT TGCAGCCCCT GGGTATGGCA GTCCTTCCAT CACTGCAGTA
GAGCCTAAAG ATATTGATGC TGATCTAATA AGAAAAGTAG TAAAAGAAAA TTTTGATATA
TTACTTGCAG GTGGTCAAGA TCACTTAAAA GGAAAAGTCT TTCGCATAGG TCATCTTGGA
TTTGTCAATG ATCGTGACAT TATTACAGCT ATAGCATCTA TAGAATCTGC TCTTAATCAA
TTAGGGGCTT TAAAAGAGCC AATTGGTACT GGAGTAGCTA CCGCTTCAAA AATACTTTTT
AAAGAAAATA GAGTATGA
 
Protein sequence
MKEKLALMIP GPTPVPERVL KALSQHPIGH RTPEFQEIVK KTTQLLQWLH QTEGDVLTIT 
GSGTAAMEAG IINTLKKGDK VICGENGKFG ERWVKIAKAY GLNVEIIKSN WGEPLEPEKF
RNILQADNEI RAVILTHSET STGVINNLEA ISKEVRKHEK AITIADCVTS LGACNVPMDE
WGIDVLASGS QKGYMMPPGL SFVAMNQRAW KASERSDLPS FYLNLKSYKK TSDKNSNPFT
PSVNLYFALE EALNMMKEEG LEKIFSRHNR HKEATQKAME AIGLKLFAAP GYGSPSITAV
EPKDIDADLI RKVVKENFDI LLAGGQDHLK GKVFRIGHLG FVNDRDIITA IASIESALNQ
LGALKEPIGT GVATASKILF KENRV