Gene P9211_02801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_02801 
Symbol 
ID5730217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp263200 
End bp265404 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content34% 
IMG OID641284625 
Producthypothetical protein 
Protein accessionYP_001550165 
Protein GI159902821 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1452] Organic solvent tolerance protein OstA 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCGAC AGAATGTTTT GGCGCCGATA GTCTCTAATG CTTGGATCTC AGGACTTTTT 
TCTGCTGGCT GCTTAGTTTT TGCGCCTTTA GAAAGTTTTT TGGCTGATGC CAAAGCGTTT
GAAATCCACA CTTCAGTTCA AGGAGTTAAT ACTAACCAAC GAATTCGTTT GCTTAAAGAC
GAGCGCAGAC AATATTTGGC ATCTAATATC TTAATTGCTT CACACTCAGT TAAGGATAAC
TTTCCTCATA AGAAAAAAGA CCTTTCTCAG TTATCTATAG ACGCTTCAGG TAGTCATTCT
CAAATACAAA ATACGCATAA AGAGATTCAT CTCGATAGTG AATCTTCTTT TCTCAGAGTT
GATATTCATG CCGATCGCCA ATATTGGGAA ACGGATAATG TATTTGTAGC TGAAGGTAAT
GTTGTTGTAT CTTTCAATCA AGGTATACTT CGAGCAGAAA AAATTGTTTT TGACCGATCT
AAAAATCTTC TTTTTGCAAC TGGTGATGTT CGCTTTATGC GAGGAGAACA ATACTTTAGG
GCCAGTTATT TTAGATATAA CTTAGTAAGC AAGAATGGAT ACTTAGACGA TGTCTATGGA
GTAATCAAAG TCAACCTTTT AACTAATGAT CTTAACATAA ACTCTTCAAC TAACATTAAT
AAAAAGCAAT CTAGGAATAC TATACCAAAT AATTCAGCTT CTAGGATTAC TCTAAATGAT
GGAGTAGTTA TAGAAGGAGG TAAAATTGAT TTAGGTTTAA ATCCATTTGT TGCTGGTGAT
CTCTCTGATA AAGGTATAAA TAGCTGGCGA TTTAACTCTC CAAAAGTCAT AATTAATCGA
TCAGGTTGGA AAGCTAAAAT AATGACTTTT TCTAATGATC CATTTAACCC TGCTCAAGCC
AAGCTTGTTG CTAAGAATGT AATTGCGAAT GAGAATAAAG ATGGCACTTT GCTAATTAAA
TCTAGTAAGA CAAAGTTAAT ATTAGAGGAT CAATTAAATA TCCCTATTGG TAAAAGATCC
TTTGGAGCTA ATCAAGAAAA TGAAGAACGT TGGATCTTAG GATTTGATAC AAAAGACAGA
GATGGATTAT ATATAGGAAG AAAGTTTAAG CCTATTCAGT TAGATGAGAA TTATGAATTA
TCACTGCAGC CACAATTTCT TTTTCAACGA GCTATTAATG AAAAAACAAA TGCTTACCCT
GAATCAGATT TATCTGTTTT GAGCCCTAAG GTATCACAAT CAACAAAATT TTCAGATTTA
ATAGGAATGA AGGCAAAATT AAAAGGAAAA ACATTTAATA TGCAGTCAGA ATTATCTGCA
AACATAAGCA GTTTCAATCC AGATAGATTT GCTAATGGAA GTCGATATTG GGGTGCCCTT
AAAGATTCTT TTGATCTTGG TGGGATTAAA GATATCAATG CAGTTCTTTT TGCAGCTTAT
CGTTATAAAT CATGGAATGG TTCTTTGGGA AGAAGTGATA TTTATACTTC AGTTGGTGGC
TACGTGGATA AAGAAGTGGA TTGGGGAAAT GGGACTTCCC GTTATGAATA TAGATTTAGA
TCTGGAATCG GTAAATATCA AGCTGAAGCT TTAAAATCTC TTACTTTATC TCATCTATGG
AGAGCAAGCA TCTTTAACTC TTTGAATATT TCATACCCTA TATATATGTT CGAAGATGCC
AGTTCAGTCA ATCAAGTTAA ACCAAGATAT TCCATGGCAA AAATTAACCC TGGCATTATT
CTTAATACAG AAATTTTTTC GACTTATTTT CATTACGAAG GTGGAGATAG TCAGTTTTCA
TTCGGAGTAA ATGCAGGGCC TGAGTTAACA TTAGGAAACT TTAGAAAGCC TTTCCTAGAT
TATACAAAAG TATCAATTAT GCCAGGCTTT ACTGTTAAAG CTGGCGATAG TCCATTTAAA
TTTGACAATG AAGTTGACCT TCAGAAAATT TCTTTTCAAT TAACTCAACA AATATATGGT
CCTTTGCTTC TTTCGGGTAT TTACAATGTC AATATTGACA AAGACTCTGA TCAATATGGA
AAATCTTTAA GTTCTAAATT AGCCATTTTA TGGGAACGCA GATCGTATGC TTTGGGTATA
TTCTATGATA TTAATGATAA CTCTGGAGGT TTGATGTTCA GATTGAATGG GTTTGACATT
GAAAGGGCAC TAATCCCAAA TGATTCAATT GTAGATACTA TCTAA
 
Protein sequence
MRRQNVLAPI VSNAWISGLF SAGCLVFAPL ESFLADAKAF EIHTSVQGVN TNQRIRLLKD 
ERRQYLASNI LIASHSVKDN FPHKKKDLSQ LSIDASGSHS QIQNTHKEIH LDSESSFLRV
DIHADRQYWE TDNVFVAEGN VVVSFNQGIL RAEKIVFDRS KNLLFATGDV RFMRGEQYFR
ASYFRYNLVS KNGYLDDVYG VIKVNLLTND LNINSSTNIN KKQSRNTIPN NSASRITLND
GVVIEGGKID LGLNPFVAGD LSDKGINSWR FNSPKVIINR SGWKAKIMTF SNDPFNPAQA
KLVAKNVIAN ENKDGTLLIK SSKTKLILED QLNIPIGKRS FGANQENEER WILGFDTKDR
DGLYIGRKFK PIQLDENYEL SLQPQFLFQR AINEKTNAYP ESDLSVLSPK VSQSTKFSDL
IGMKAKLKGK TFNMQSELSA NISSFNPDRF ANGSRYWGAL KDSFDLGGIK DINAVLFAAY
RYKSWNGSLG RSDIYTSVGG YVDKEVDWGN GTSRYEYRFR SGIGKYQAEA LKSLTLSHLW
RASIFNSLNI SYPIYMFEDA SSVNQVKPRY SMAKINPGII LNTEIFSTYF HYEGGDSQFS
FGVNAGPELT LGNFRKPFLD YTKVSIMPGF TVKAGDSPFK FDNEVDLQKI SFQLTQQIYG
PLLLSGIYNV NIDKDSDQYG KSLSSKLAIL WERRSYALGI FYDINDNSGG LMFRLNGFDI
ERALIPNDSI VDTI