Gene A9601_00341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_00341 
SymboldhsS 
ID4716716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp32388 
End bp33551 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content33% 
IMG OID640077731 
Productsoluble hydrogenase small subunit 
Protein accessionYP_001008429 
Protein GI123967571 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCATAC AACAAAAATT ATCATTGATG ATTCCTGGAC CCACACCAGT TCCAGAAAAA 
GTATTACAAG CATTAAGTAA ACATCCAATA GGCCATCGCA GCAAAGAATT CCAAGATCTC
GTAGAGAGTA CTACTAAAAA TTTACAATGG CTTCATCAAA CTCAAAATGA TGTTCTAACA
ATTACTGGAA GTGGGACTGC CGCAATGGAG GCCGGAATAA TAAATACCTT AAGTAGAGGA
GATAAAGTAA TTTGTGGAGA CAATGGAAAA TTTGGAGAAA GATGGGTAAA AGTTGCTAAA
GAATTTGGTC TAGAAGTAAT AAAAATTGAT TCAGAATGGG GTACTCCACT TGATCCAGAA
AAATTCAAAA AGGTATTAGA AGAAGATACA CAAAAAGAAA TTAAAGCTGT TATTTTGACT
CATTCTGAAA CCTCAACAGG TGTAATTAAT GACTTAAAAA CCATAAGTTC ATATATTCGC
AAACACAATA CAGCTTTATC AATTGTTGAT TGCGTTACAA GTCTTGGAGC TTGCAATGTT
CCAGTAGATG AATGGGAATT AGATATCGTT GCTTCAGGAT CACAAAAAGG TTATATGATA
CCTCCAGGGC TTAGTTTTAT AGCAATGAGC CAAAAAGCAT GGGAAGCTGC AGAAAAATCT
AATCTACCAA AATTTTATTT AAATTTAAAA TCATACAAAA AAAGTCTTTT AAGTAACAGT
AACCCATATA CTCCAGCAGT TAATTTGGTT TTTGCTTTAG ATGAATCTTT AAAAATGATG
AAAGAAGAAG GCTTAGATAA CATTTTCTTC AGACACAATA AACATAAATT AGCAATGAGC
AATGCTGTAA AGGCTTTAGA TCTTAAATTA TTTGCTGATG AAAAATACTT GAGCCCATCA
ATTACTGCGG TAAAAACTGA AGGAATAGAT GCTGAAGAAT TTAGGAAAAC TATAAAAAAT
AATTTTGATA TTTTACTTGC TGGTGGTCAA GATCATATGA AAGGAAAAAT ATTTAGAGTC
GGGCACTTAG GTTATGTTAA TGATAGAGAT ATTATTACGG TGGTTTCTGC TATAAGTAAT
ACACTTCTCA ACCTCGGTAA AATTACAGCC AAACAAGCTG GTGAAGCATT AAAAGTTGCT
TCAAGATATC TAGCAGAGAA TTAA
 
Protein sequence
MAIQQKLSLM IPGPTPVPEK VLQALSKHPI GHRSKEFQDL VESTTKNLQW LHQTQNDVLT 
ITGSGTAAME AGIINTLSRG DKVICGDNGK FGERWVKVAK EFGLEVIKID SEWGTPLDPE
KFKKVLEEDT QKEIKAVILT HSETSTGVIN DLKTISSYIR KHNTALSIVD CVTSLGACNV
PVDEWELDIV ASGSQKGYMI PPGLSFIAMS QKAWEAAEKS NLPKFYLNLK SYKKSLLSNS
NPYTPAVNLV FALDESLKMM KEEGLDNIFF RHNKHKLAMS NAVKALDLKL FADEKYLSPS
ITAVKTEGID AEEFRKTIKN NFDILLAGGQ DHMKGKIFRV GHLGYVNDRD IITVVSAISN
TLLNLGKITA KQAGEALKVA SRYLAEN