Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_08781 |
Symbol | wza |
ID | 4780032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 816931 |
End bp | 818178 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640084153 |
Product | hypothetical protein |
Protein accession | YP_001014701 |
Protein GI | 124025585 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.00000165811 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCACATAA GAAAACCAAC TCAAGTAAAA GCTCTTTCCT TTAAGGTTAT TGGTTTTATT TGCTTGTCTA GCATTATTGG TGTAAACGCG CAGCAAGTCA TTGAAGAGAG ATCAGTTCCC TTAGATACCT CATACTTAGA ATCAAAAAAT GAACTTGAGG ACTATATTTT AGATACTGGA GATGTATTGA ATATTGAATT TGTGAATGTT CCTGAACTTA ATGGCTTATT TAAAATTAAT GAGCTAGGAG AGATATATTT TAAAAGAATA AAATCTACTT ATGTTAGAGG TCTAACCATT AATGAACTAA CACAATTACT AGAGGAACGT TATAAAGAAT TTCTTGTAAA CCCAGAGATT TATATAAGAA TTAATACATA CAAGTCTATT AGAGTTTCTA TAAGAGGTGA GGTGAAAGCA CCTGGAGTGA TATCACTTCC TGCTTATATT TCAACATCTT TTGCAACATC CTTAGATGTT TTTGATAATA AACAATCAAG CTTAGATTCT GATAATAACA TCAGCAAAAG AAATAAAAAC TCAAGCTATC TATCATTGTC TACCAACAAA AATGTGAATG GAGATTCTTT AATTAATTCT AACAATTTAA TTAAAAGAAA TAATGATTAT ATAACTACTC TCTCAAATGC AATTCAAAAA GCAGGTGGTC TAACTTCTTC TAGTGATATT AGCAAGTTAG AAATTACCAG AGAAATACCT CTTGGGAACG GAGGTGGCAA AAAACGAGCG ATAGTTAACT TTCTACCTTA TATCCGAAAT GCAGACGCCT CATCAGATAT GAGACTATTT GATGGAGACG ATATCTTTAT TCCTCGTCTT AAAGAAAAAG ATCTAACTAT TATTCCTGAC TCAATACTGT CCGGTCTATC TCCCAGGTTT ATAAATGTAT CAGTTGGGGG GCGAATAGAA AATCCAAGTA CCGTAAAGAT TCCAATTGAA GGAAGTCTTT CTGATGCAAT GAATTTAACA GGTCCAAGGA AGCCTTTGTC AGGAGAAATT TATTTAATTA GATACAATCA AGACGGAACT TTATTAAGAA AAGGTATTAG TTATTCTTCA AGTGCCCCTC CAGGATCTCC AAAGAATCCA TACTTATTAG CTGGTGATTC AATAACTGTT AAAAATAGTA TTTTAGGAAG AACATCCGGA ACATTAAGAG CAATAACTGA ACCATTTGCG GGCATTTTTG CCACCAAAGA GTTAATGGAG GGTCTCTACG AGAAATAA
|
Protein sequence | MHIRKPTQVK ALSFKVIGFI CLSSIIGVNA QQVIEERSVP LDTSYLESKN ELEDYILDTG DVLNIEFVNV PELNGLFKIN ELGEIYFKRI KSTYVRGLTI NELTQLLEER YKEFLVNPEI YIRINTYKSI RVSIRGEVKA PGVISLPAYI STSFATSLDV FDNKQSSLDS DNNISKRNKN SSYLSLSTNK NVNGDSLINS NNLIKRNNDY ITTLSNAIQK AGGLTSSSDI SKLEITREIP LGNGGGKKRA IVNFLPYIRN ADASSDMRLF DGDDIFIPRL KEKDLTIIPD SILSGLSPRF INVSVGGRIE NPSTVKIPIE GSLSDAMNLT GPRKPLSGEI YLIRYNQDGT LLRKGISYSS SAPPGSPKNP YLLAGDSITV KNSILGRTSG TLRAITEPFA GIFATKELME GLYEK
|
| |