Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03171 |
Symbol | |
ID | 5731587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 301281 |
End bp | 302789 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641284664 |
Product | hypothetical protein |
Protein accession | YP_001550202 |
Protein GI | 159902858 |
COG category | [S] Function unknown |
COG ID | [COG0391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01826] conserved hypothetical protein, cofD-related |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.128055 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCATC TGGCTTTCTT TAAGTGGACT GAAGGTCAAG TTCCCTTAGA GTTATCTATA AATAATCTCT GTGCGAGTTT TTTAGAGTTG TCCTCGATTA ATAAACCATT TAGAAAAAGA AGGAGAAGAA ATCAGGCCCA TGCTGATTTG TCGAAGAATA GCTTCAGAAG AATCAATGAA ATTAACTTTA TAAGGAGATT GCAAAGAGCA GTGCGTTGGC TATTGCCTGG ATTGGTTGTA AAACGCTGGA TGTTTACTTC AGGCATTGGG TTAATAATAG CTCTTCTTGG AGCTGCTATA TGGGCTGACT TAAACCCTAT TTATTGGGCA GTTGAAAGAT TGTTTTGGTT TCTTGAAGGA ATTACCACGT TCCTACCTAG AAGTTTTACT GGACCAATTG TTTTTTTAAT TGGTATTGGT CTTCTGCTTT GGGGGCAGAG TAGGAGCTTT GATTCAATAC AAAAAGCTGT TGCTCCTGAT AAAGATGCAG TTTTAGTAGA CGCATTAATG GTTAAGAGCA AATTGAATAG GGGGCCAAAT ATTGTTGCTA TTGGAGGTGG AACTGGCTTA GCTTCACTAC TTCAAGGTTT GAAGAGATAT AGCAGTCGCA TAACCGCAAT TGTCACAGTT GCAGACGATG GAGGAAGCAG TGGAATTTTG CGAAGAGAGC TTGGTGTGCA GCCGCCAGGG GATATTCGCA ATTGTCTTGC AGCTCTATCA AATGAAGAGC CTCTTTTAAC AAGACTTTTT CAATATCGCT TTTCATCTGG GACTGGATTG GCAGGTCATA GTTTTGGCAA TCTCTTTCTT TCAGCATTGA CTTCTATTAC AGGCAATATT GATACAGCTA TTACAGCTTC TAGTCGAATC CTGTCCGTTC AAGGCCAAGT TGTTCCAGCA ACTAATGCTG ATGTATGTCT TTGGGCTGAA TTGGAGAATG GAGAGGTTGT TGAAGGGGAG TCATCAATAG GTCGCGCTTC TAGCCCAATA GTTCGTATTG GTTGCTATCC AGAAAAACCT CCTGCAATTA GCAGAGCTTT AGATGCAATA GAGAATGCGG AATTAATCTT GCTGGGTCCA GGAAGTCTTT ATACTTCTCT TTTGCCAAAC TTATTGGTGC CAGAAATAGT CGCTGCGATA CAAAAAAGTA AAGCGCCAAA ATTATATATC TGTAATTTAA TGACTCAGCC AGGAGAAACA GATGGTCTAG ATGTAGCAGG ACATATCAGA GCTATTGAGG CTCAATTAGC AAGTCTTGGC ATTACTAATA GGATTTTCAA TGAAATACTT GTTCAAGAAG CTCTTGCCCC ATCTCCTTTG ATTGAGTATT ACCGATCACG AGGGGCAGAG CCTGTTAAAT GTGATCGTAA TAGCCTTCTT TCTAAGGGGT ACAGGGTTTA TCAGGCATCA CTTCAGGGTT CTAAAGCTAC CCCTACTTTG AGGCATGATC CAAGGAGTCT TTCTTTAGCT GTTATGCGCT TTTATCGAAA ATATAAAAGA AAGAATTAA
|
Protein sequence | MFHLAFFKWT EGQVPLELSI NNLCASFLEL SSINKPFRKR RRRNQAHADL SKNSFRRINE INFIRRLQRA VRWLLPGLVV KRWMFTSGIG LIIALLGAAI WADLNPIYWA VERLFWFLEG ITTFLPRSFT GPIVFLIGIG LLLWGQSRSF DSIQKAVAPD KDAVLVDALM VKSKLNRGPN IVAIGGGTGL ASLLQGLKRY SSRITAIVTV ADDGGSSGIL RRELGVQPPG DIRNCLAALS NEEPLLTRLF QYRFSSGTGL AGHSFGNLFL SALTSITGNI DTAITASSRI LSVQGQVVPA TNADVCLWAE LENGEVVEGE SSIGRASSPI VRIGCYPEKP PAISRALDAI ENAELILLGP GSLYTSLLPN LLVPEIVAAI QKSKAPKLYI CNLMTQPGET DGLDVAGHIR AIEAQLASLG ITNRIFNEIL VQEALAPSPL IEYYRSRGAE PVKCDRNSLL SKGYRVYQAS LQGSKATPTL RHDPRSLSLA VMRFYRKYKR KN
|
| |