Gene P9303_27941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_27941 
Symbol 
ID4778659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2460403 
End bp2461404 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content46% 
IMG OID640088317 
Productglutathione S-transferase N terminus protein 
Protein accessionYP_001018789 
Protein GI124024482 
COG category[O] Posttranslational modification, protein turnover, chaperones
[S] Function unknown 
COG ID[COG0625] Glutathione S-transferase
[COG3502] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAATC CGTTGTTGTA TAGCTTCCGT CGCTGCCCTT ATGCAATGAG AGCCCGATGG 
GCTCTTTTGG TTTCTGGCCT TTTGGTGAAC TTGCGGGAAG TGGCCCTAAA CAATAAGCCA
CCAGAGCTGC TGCAGGCTTC TCAAAAAGGA ACGGTGCCAG TGCTGTTGAC TGCAGATGGG
ACAGTGATTG ATGAAAGCAT GGACATCATG CACTGGGCTC TTCAGCAAGC TGATCCCTTC
GATGGGCTAC GCAGCGGAAA AGCCGAAGAA CAACAAACAA TTCAGCAGCT TATCGAACAG
AATGATGGCC CGTTTAAATA TCATTTAGAT CGTTTTAAAT ATGCTTGCAG GTTCAAAGGA
GAAGATGCCG AAGAACATCG CAACATGGCT AGAGACATTC TTGTGGAATG GAATGCGCGA
CTAGCACAAC AAGAATCAAG TGATTGCTAT GGTTGCTTGA TTGGAGAATC TCAGAGCTTG
GCAGACTGGG CTCTATGGCC TTTTGTGCGT CAATATCGTC TCGCTGATCC ATCAAGCTTT
GATTGCGATC AAGACCTTCA AGCCATTAAA AGATGGTTGA AAGCCTTTCT GCAACATCCA
CTGTATGCAA GATTGATGAC ACCAGTTAAG CCTTGGTTGC CAGAACATCA ACCGCAGACG
TTCCCTGCTG ATTCAAGTTT AGTTAAAACA GATCAACCAT TGTTTCATCT GGCTTTGCTT
GAAGACTGGC AAGACGCATG CAATCAAGGG GTTTATCAAT TCTCTACTCG CGGATTAAAA
CTCAAAGAGA TAGGATTCAT CCATTTGAGC TATCAGCATC AACTTGAGTC TACTTATCAT
CAATTTTATC GTGATCGAGG CCAGGTGCTT AGCTTGAAAT TAAACCCAGA GCAACTGACA
ATGCCGCTTC GAGCCGAACC CTCATCAGCA GGGGAGCTTT TCCCTCATCT TTTTGGAGTC
CTGCCTTTGA GTGCTGTAGA ACTTGTGGAA ACTTATCCAT GA
 
Protein sequence
MSNPLLYSFR RCPYAMRARW ALLVSGLLVN LREVALNNKP PELLQASQKG TVPVLLTADG 
TVIDESMDIM HWALQQADPF DGLRSGKAEE QQTIQQLIEQ NDGPFKYHLD RFKYACRFKG
EDAEEHRNMA RDILVEWNAR LAQQESSDCY GCLIGESQSL ADWALWPFVR QYRLADPSSF
DCDQDLQAIK RWLKAFLQHP LYARLMTPVK PWLPEHQPQT FPADSSLVKT DQPLFHLALL
EDWQDACNQG VYQFSTRGLK LKEIGFIHLS YQHQLESTYH QFYRDRGQVL SLKLNPEQLT
MPLRAEPSSA GELFPHLFGV LPLSAVELVE TYP