Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_01671 |
Symbol | |
ID | 5730242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 162676 |
End bp | 163566 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641284511 |
Product | hypothetical protein |
Protein accession | YP_001550052 |
Protein GI | 159902708 |
COG category | [S] Function unknown |
COG ID | [COG1354] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGATTG ACGGATTAAA AACAGCACCA CAGTCAGGCG CAAGGCTTGC CATTCGCCTA ATACAAGATG CTGTAGAGAA AGGTGAAATT GACCCTTGGG ATGTTGATGT AATTCCAGTG ATTGATGGTT TTTTAGATCA ATTAAGGCAG AAGATACAAA GTCCAAGAAA TATTTCTTCT TCAATAGCCT ATAGGGGGGG TACCTTTGAA AAAGATTTAT CAGATAGTAG TGAGGCATTT CTTGCTGCTT CTGTCTTAGT TGGCTTGAAA GCTGAAATTC TGGAATCGGA AACTTTCCCT CCAGAATCTG ATTTCGACGA TATGGATGAT GGTAGTTTTG CTGACCAAGA ATGGATTGAT CCTAAATTAG ATCTTCCCAG ATATCCTGAA CGACATTTAT ATAGACGACC AGTTGCGCCC CCTCCACTGC GACGATCAAT CTCTTTAGCA GAATTAATTG AGCAATTGGA AGCCATTGCA GAAACAATAG AATCTGATGA ATTAAATATT CGAAGGAAAA GACGGGAAAA GAGATTTAGT GATAAACAAA TAATTCAACA AGTCAGCTCA CTGGCACATC GCGAAAAACT TCCAGAGACA ACTGCTGCAT TGGGGGTTTT TTTGAATAGT TGTGAAGAAG CTCTTAATTG GGTAGATTTT GAATTCTTGG TCGAACTATG GAAAGGTATA GCTAGTCAGG AATTAGATTC TGATCGCGTT GGCGTTTTTT GGGCATTGTT ATTTTTATGT TCTCAGGGCA AAGTTGAGCT TCAACAAGAC AATTCTTTAT TTGGCAAGCT AAAGCTTAAA AGGATTCTCG CACCTGGAAC TATTGCTCAA CTACCTCTGA AAACATTTGA CGTGACAGCT GTTGCTCCGG CCGCGGCTTA G
|
Protein sequence | MPIDGLKTAP QSGARLAIRL IQDAVEKGEI DPWDVDVIPV IDGFLDQLRQ KIQSPRNISS SIAYRGGTFE KDLSDSSEAF LAASVLVGLK AEILESETFP PESDFDDMDD GSFADQEWID PKLDLPRYPE RHLYRRPVAP PPLRRSISLA ELIEQLEAIA ETIESDELNI RRKRREKRFS DKQIIQQVSS LAHREKLPET TAALGVFLNS CEEALNWVDF EFLVELWKGI ASQELDSDRV GVFWALLFLC SQGKVELQQD NSLFGKLKLK RILAPGTIAQ LPLKTFDVTA VAPAAA
|
| |