Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_14951 |
Symbol | |
ID | 5730416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 1338307 |
End bp | 1339515 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641285873 |
Product | hypothetical protein |
Protein accession | YP_001551380 |
Protein GI | 159904036 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03573] N-acetyl sugar amidotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0184708 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAGT CACTCTTAAA TATTAGCAAT CTTCCCAAAC TTCCACCCTT AACAGATATT GAGAAGCAAC TTCTAGAAGA AAAAGTAGAT ATTGATTCAA AGTATAAATT GCCAAAAGAA ATAAAATTGT GTCATAAATG TGTAATTACA AACCAACGCC CAAGGATCAC TATTAATGAA GATGGTATAT GTAATCCATG CAAATACTGG GCGAGAAAGC ATTCCTCATT TGACTGGAAT TCATTAGCAG ATGAATTTCG AGAGCTGTGT GACAAGTATC GCTCATCAGA TGGTTCATAT GACGTATTAG TTCCTTCTAG TGGAGGTAAA GATAGCTCCT ATGTTGCCTA TAGATTAAGA GATGAATACG ATATGCATCC TCTTACAGTT ACATGGTCAC CTTCTTTATA TACAGAGATA GGGTTTGAAA ACTTTCAGAA CCATATACAT CATGGCTTAG ACAATGTTTT AGTAACAGCG AATGGATTGG TTCATAGGCG ACTATGTAGA AGTTCAACAA TTATTATGGG TGATCCTTTT CAACCTTTTG TATATGGTCA ATGCAATGTT CCATTAAGAA TAGCCAAAGC CTATGATATC CCTTTAATAG TTGATGGAGA GAATGGAGAA GTTGAGTATG GAGGAGATGA CAATACAGAA CAATTGACTG GTTTTCAGAA TGATGAATCA GTTGAGTTTT GGCAATCTGG TATGGCAGTT GAGGAATGGC AAAAATATGG TTATTCCGAT TCCGAGTTGT TTATTTATCA ACCACCTAAG CAGCAAATTA ATGTTCGTAG AGTATTTTTT AGCTATTACC ATAATTGGAT GCCCCACGAC CATTACTATT ACGCAAGTCA AAATGCAGGT TTCGTCTCTA ACCCTGATAG ATCTGAATGT ACTTTTTCCC GCTATGCAAG CCTTGATGAT TCAATAGATC CATTTCATTA TTACTTTGCG TTGCTAAAAT TTGGTATTGG AAGAGCGACC TCGGATGCCG CCCATGAATT AAGAGAAGGG GTTCTAGAAA GAGATGAAGC GATTCAATTA GTTAACAAGT TTGATTGCCA AGCTCCATCT AAAGAAACTA CCGAGATTTT TCTGAAATAC TGTTCTATAG ACAAAGGTGC TTTACAGAAA ATCGTGGATA GATGGACTAA TAGCAGAATA TGGTCTGCTA GAAATGACTT ACCTTCTCTT CAATTCTAA
|
Protein sequence | MRKSLLNISN LPKLPPLTDI EKQLLEEKVD IDSKYKLPKE IKLCHKCVIT NQRPRITINE DGICNPCKYW ARKHSSFDWN SLADEFRELC DKYRSSDGSY DVLVPSSGGK DSSYVAYRLR DEYDMHPLTV TWSPSLYTEI GFENFQNHIH HGLDNVLVTA NGLVHRRLCR SSTIIMGDPF QPFVYGQCNV PLRIAKAYDI PLIVDGENGE VEYGGDDNTE QLTGFQNDES VEFWQSGMAV EEWQKYGYSD SELFIYQPPK QQINVRRVFF SYYHNWMPHD HYYYASQNAG FVSNPDRSEC TFSRYASLDD SIDPFHYYFA LLKFGIGRAT SDAAHELREG VLERDEAIQL VNKFDCQAPS KETTEIFLKY CSIDKGALQK IVDRWTNSRI WSARNDLPSL QF
|
| |