Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_14261 |
Symbol | |
ID | 4912035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1193571 |
End bp | 1194464 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 26% |
IMG OID | 640161017 |
Product | hypothetical protein |
Protein accession | YP_001091650 |
Protein GI | 126696764 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0547193 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAA AAATCTTAAT ATCTGGTTGT ACAGGTTTGA CTGGCAAATA CGCAACTCTT AAAATATTAA ATAAATATAA GAATATTGAA GTGATAGGTT TTTCTAGAGA TATTAATAAA TCTTTCTCAA GTAATTCATT CACTTTTTTA CAAGGCGATG CAAATAAAAC GTCATTTTGG CGAGAATTAC TTAAAAAGTT TAGACCTAAT ACAATTCTAA TTAATTCAAA TATTAGGCAT TTTTTACCTT TCCTAGAAGC TATAGATACT TTAGATATAG AAGAATTTCC TAGAGTTGTT ATAGTCTCTA CTACTGGTAT TTTTTCTAAA TTCAACTCAT ATAATTTTTT ATATAAACAA ATAGAGGGTA AAATTAAATC TTATAGAGGC AATTTCTTGA TCTTAAGACC TTCACTAATT TATGGTTCAA AAAACGATAA AAATATCTCA AAATTAATTA GGTTTGTACA TAAATTTAGA TTTTCATTTT CTTTTGGGAG TGGCTTGAAT TATTTTCAAC CAATTTTTTA TAAAGATTTA GGTTGGGCTA TTTCAGAGGT TTTACTTAAT AAAAATATTT CTGGTGAATA TAATCTTACT GGAAAAAACT GCATCACTTT TATTGAGATT CTTAAATTAA TTAGCTTTAA TCTTAAAAAA AATCTTATTA ATATAAAACT TCCTTTGAAA TTAACTGGAA ATATATTGCT TTTTGTTGAG AAATATTTGA GGTTTACTAT TCTTCCTATT ACGAGTGAAC AAGTTTTCCG AATGTCTGAA AATAAATGCT ACAGTCATTT AAAAGCAAAA GAAGACTTCG GATTTGAGCC TATTAGTTTT CAAGATGGTA TTAAGTTACA AATTGATGAA ATGATCGTGC AAGGAGATTT ATGA
|
Protein sequence | MKEKILISGC TGLTGKYATL KILNKYKNIE VIGFSRDINK SFSSNSFTFL QGDANKTSFW RELLKKFRPN TILINSNIRH FLPFLEAIDT LDIEEFPRVV IVSTTGIFSK FNSYNFLYKQ IEGKIKSYRG NFLILRPSLI YGSKNDKNIS KLIRFVHKFR FSFSFGSGLN YFQPIFYKDL GWAISEVLLN KNISGEYNLT GKNCITFIEI LKLISFNLKK NLINIKLPLK LTGNILLFVE KYLRFTILPI TSEQVFRMSE NKCYSHLKAK EDFGFEPISF QDGIKLQIDE MIVQGDL
|
| |