Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_02801 |
Symbol | |
ID | 5730217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 263200 |
End bp | 265404 |
Gene Length | 2205 bp |
Protein Length | 734 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641284625 |
Product | hypothetical protein |
Protein accession | YP_001550165 |
Protein GI | 159902821 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGTCGAC AGAATGTTTT GGCGCCGATA GTCTCTAATG CTTGGATCTC AGGACTTTTT TCTGCTGGCT GCTTAGTTTT TGCGCCTTTA GAAAGTTTTT TGGCTGATGC CAAAGCGTTT GAAATCCACA CTTCAGTTCA AGGAGTTAAT ACTAACCAAC GAATTCGTTT GCTTAAAGAC GAGCGCAGAC AATATTTGGC ATCTAATATC TTAATTGCTT CACACTCAGT TAAGGATAAC TTTCCTCATA AGAAAAAAGA CCTTTCTCAG TTATCTATAG ACGCTTCAGG TAGTCATTCT CAAATACAAA ATACGCATAA AGAGATTCAT CTCGATAGTG AATCTTCTTT TCTCAGAGTT GATATTCATG CCGATCGCCA ATATTGGGAA ACGGATAATG TATTTGTAGC TGAAGGTAAT GTTGTTGTAT CTTTCAATCA AGGTATACTT CGAGCAGAAA AAATTGTTTT TGACCGATCT AAAAATCTTC TTTTTGCAAC TGGTGATGTT CGCTTTATGC GAGGAGAACA ATACTTTAGG GCCAGTTATT TTAGATATAA CTTAGTAAGC AAGAATGGAT ACTTAGACGA TGTCTATGGA GTAATCAAAG TCAACCTTTT AACTAATGAT CTTAACATAA ACTCTTCAAC TAACATTAAT AAAAAGCAAT CTAGGAATAC TATACCAAAT AATTCAGCTT CTAGGATTAC TCTAAATGAT GGAGTAGTTA TAGAAGGAGG TAAAATTGAT TTAGGTTTAA ATCCATTTGT TGCTGGTGAT CTCTCTGATA AAGGTATAAA TAGCTGGCGA TTTAACTCTC CAAAAGTCAT AATTAATCGA TCAGGTTGGA AAGCTAAAAT AATGACTTTT TCTAATGATC CATTTAACCC TGCTCAAGCC AAGCTTGTTG CTAAGAATGT AATTGCGAAT GAGAATAAAG ATGGCACTTT GCTAATTAAA TCTAGTAAGA CAAAGTTAAT ATTAGAGGAT CAATTAAATA TCCCTATTGG TAAAAGATCC TTTGGAGCTA ATCAAGAAAA TGAAGAACGT TGGATCTTAG GATTTGATAC AAAAGACAGA GATGGATTAT ATATAGGAAG AAAGTTTAAG CCTATTCAGT TAGATGAGAA TTATGAATTA TCACTGCAGC CACAATTTCT TTTTCAACGA GCTATTAATG AAAAAACAAA TGCTTACCCT GAATCAGATT TATCTGTTTT GAGCCCTAAG GTATCACAAT CAACAAAATT TTCAGATTTA ATAGGAATGA AGGCAAAATT AAAAGGAAAA ACATTTAATA TGCAGTCAGA ATTATCTGCA AACATAAGCA GTTTCAATCC AGATAGATTT GCTAATGGAA GTCGATATTG GGGTGCCCTT AAAGATTCTT TTGATCTTGG TGGGATTAAA GATATCAATG CAGTTCTTTT TGCAGCTTAT CGTTATAAAT CATGGAATGG TTCTTTGGGA AGAAGTGATA TTTATACTTC AGTTGGTGGC TACGTGGATA AAGAAGTGGA TTGGGGAAAT GGGACTTCCC GTTATGAATA TAGATTTAGA TCTGGAATCG GTAAATATCA AGCTGAAGCT TTAAAATCTC TTACTTTATC TCATCTATGG AGAGCAAGCA TCTTTAACTC TTTGAATATT TCATACCCTA TATATATGTT CGAAGATGCC AGTTCAGTCA ATCAAGTTAA ACCAAGATAT TCCATGGCAA AAATTAACCC TGGCATTATT CTTAATACAG AAATTTTTTC GACTTATTTT CATTACGAAG GTGGAGATAG TCAGTTTTCA TTCGGAGTAA ATGCAGGGCC TGAGTTAACA TTAGGAAACT TTAGAAAGCC TTTCCTAGAT TATACAAAAG TATCAATTAT GCCAGGCTTT ACTGTTAAAG CTGGCGATAG TCCATTTAAA TTTGACAATG AAGTTGACCT TCAGAAAATT TCTTTTCAAT TAACTCAACA AATATATGGT CCTTTGCTTC TTTCGGGTAT TTACAATGTC AATATTGACA AAGACTCTGA TCAATATGGA AAATCTTTAA GTTCTAAATT AGCCATTTTA TGGGAACGCA GATCGTATGC TTTGGGTATA TTCTATGATA TTAATGATAA CTCTGGAGGT TTGATGTTCA GATTGAATGG GTTTGACATT GAAAGGGCAC TAATCCCAAA TGATTCAATT GTAGATACTA TCTAA
|
Protein sequence | MRRQNVLAPI VSNAWISGLF SAGCLVFAPL ESFLADAKAF EIHTSVQGVN TNQRIRLLKD ERRQYLASNI LIASHSVKDN FPHKKKDLSQ LSIDASGSHS QIQNTHKEIH LDSESSFLRV DIHADRQYWE TDNVFVAEGN VVVSFNQGIL RAEKIVFDRS KNLLFATGDV RFMRGEQYFR ASYFRYNLVS KNGYLDDVYG VIKVNLLTND LNINSSTNIN KKQSRNTIPN NSASRITLND GVVIEGGKID LGLNPFVAGD LSDKGINSWR FNSPKVIINR SGWKAKIMTF SNDPFNPAQA KLVAKNVIAN ENKDGTLLIK SSKTKLILED QLNIPIGKRS FGANQENEER WILGFDTKDR DGLYIGRKFK PIQLDENYEL SLQPQFLFQR AINEKTNAYP ESDLSVLSPK VSQSTKFSDL IGMKAKLKGK TFNMQSELSA NISSFNPDRF ANGSRYWGAL KDSFDLGGIK DINAVLFAAY RYKSWNGSLG RSDIYTSVGG YVDKEVDWGN GTSRYEYRFR SGIGKYQAEA LKSLTLSHLW RASIFNSLNI SYPIYMFEDA SSVNQVKPRY SMAKINPGII LNTEIFSTYF HYEGGDSQFS FGVNAGPELT LGNFRKPFLD YTKVSIMPGF TVKAGDSPFK FDNEVDLQKI SFQLTQQIYG PLLLSGIYNV NIDKDSDQYG KSLSSKLAIL WERRSYALGI FYDINDNSGG LMFRLNGFDI ERALIPNDSI VDTI
|
| |