Gene P9303_00431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00431 
SymboldhsS 
ID4778242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp42440 
End bp43588 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content53% 
IMG OID640085543 
Productsoluble hydrogenase small subunit 
Protein accessionYP_001016065 
Protein GI124021758 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGACA AACTCACCCT GATGATCCCG GGTCCTACAC CGGTTCCCGA AACAGTACTC 
AAAGCAATGG GTCGCCACCC CATCGGCCAT CGCAGTGGAG AGTTTCAAGC CGTCGTTAGA
CATACCACCG AGCAGCTGCG CTGGCTGCAT CAGACCAAAT CAGATGTTCT AGTCATCACT
GGCAGCGGCA CCGCTGCGAT GGAAGCAGGA ATCATCAATA CCCTCAGCTA CGGCGACAAG
GTTCTCTGCG GCGACAACGG CAAGTTTGGA CAGCGCTGGG TAAAGCTGGC TAAGGCTTAC
GGCCTGGACG TACAAGTGAT AAAAGCAGAC TGGGGACAAC CGTTAGACCC AGAAGCCTTC
AAAAGAGCAC TGGAAGCAGA CAATGGCAAA ACAATCAAAG CGGTAATTCT TACCCACTCG
GAAACCTCAA CCGGGGTCAT TAACGACCTT GAAACCATCA GCAAATATGT CCGCACTCAC
GGCAAAGCAC TAACCATCGC TGATTGCGTG ACAAGCCTTG GGGCCTGCAA TGTACCCATG
GATTCTTGGG GTCTAGATGT GGTTGCCTCC GGTTCCCAAA AGGGATACAT GATGCCTCCA
GGTCTCAGCT TCGTGGCCAT GAGTGAACGA GCCTGGCAAG CACATCAACA ATCGGATCTA
CCGAAGTTTT ATCTCGATCT GGGGCCTTAC CGAAAAACTG CCGCTCAAGA CAGCAATCCA
TTCACCCCTG CTGTGAATCT CTACTTCGCA CTGGAATCTG CGCTGGGAAT GATGCAGTCA
GAAGGACTAG AAGCCATCTT TGACCGTCAT GCCCGCCACC GCGCAGCCGC TCAGGCCGGC
ATGAAGGCCA TCTGCCTGCC CCTGTATGCA GCTGAAGGAC ACGGCAGCCC AGCGATCACC
GCAGTCGCAC CTGAGGGAGT TGATGCAGAG CAACTGCGCA AAACTGTCAA AGAGAAATTC
GACATCCTGC TAGCAGGTGG ACAGGATCAT CTAAAAGGGA AGGTCTTCCG TATCGGCCAT
CTTGGATTCG TATGTGATAG AGATATCCTC ACTGCCATAG CTGCCATCGA ATCCACCTTG
CAATCTCTTG GCTTGCATAA AGGCAACATG GGAGATGGCC TGGCTGCTGC TGCAGCAATT
CTGAGATAA
 
Protein sequence
MQDKLTLMIP GPTPVPETVL KAMGRHPIGH RSGEFQAVVR HTTEQLRWLH QTKSDVLVIT 
GSGTAAMEAG IINTLSYGDK VLCGDNGKFG QRWVKLAKAY GLDVQVIKAD WGQPLDPEAF
KRALEADNGK TIKAVILTHS ETSTGVINDL ETISKYVRTH GKALTIADCV TSLGACNVPM
DSWGLDVVAS GSQKGYMMPP GLSFVAMSER AWQAHQQSDL PKFYLDLGPY RKTAAQDSNP
FTPAVNLYFA LESALGMMQS EGLEAIFDRH ARHRAAAQAG MKAICLPLYA AEGHGSPAIT
AVAPEGVDAE QLRKTVKEKF DILLAGGQDH LKGKVFRIGH LGFVCDRDIL TAIAAIESTL
QSLGLHKGNM GDGLAAAAAI LR