Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21281 |
Symbol | |
ID | 4777116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1891477 |
End bp | 1893702 |
Gene Length | 2226 bp |
Protein Length | 741 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640087636 |
Product | hypothetical protein |
Protein accession | YP_001018128 |
Protein GI | 124023821 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.419121 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCAT CTGAAAAAAT TCGATACATC TCATTCTCTG GTGGTGGCTG GAGTTCACAC ACAGCAAGTT CAGCGTGGAT AAGCGGGGCA CTTCAGGCTT TAAGAAACAA AAAAAAGTCT GAAGATGTCG ATTTTGATGA TCTACTGGGT GGATTGCATG GCGCTGCTGG TAATTCAGGA GGTAGTTGGT TTGTATCGAT GCTTGCATAT TCAGAAGAAT TCAAAGAAGG TCTGAACAAC GTCAACGGTT CAGCGGATTG GTTCTCAAGA GGATATATGG GTCAAGTTAA AGATCAATTT GGTCGTGCTG ATGACGACAA TGGAAAACAG CAAATGATCA ATGATCTCTC GAGTGAGATC ATTGAGGCAA TCTTGATGAC AAGTTCAAAG ATTTCTGATC TTCCTGAAAA TTTTAAGAAG TTATTAATTG AATATCTTTC TGCAAAGATA AAGCCTTATG TAAGCGATGC TGTTGAATTG CTACATAATG ATTACTACGT AAACATCGCT TCACTTGGCA GTAAAGAGCT AAAGCCAGAT TGGTCTGAGG TTACAAAGAA TCTTGTTTAT AGTCCTTATG ATATGAAAGA TCAACTCAAT AGACCATTAG GAGAAAATCC AAACAAATGG GCAGAAGGTA AAGACATCAT CATCTCTGCA TCAATGCCAA CAAAAAATGT AATCGTAACA GGAAAAACCT TTGACATAGA TAATTTCCTC TCTAGCAGTT CATTCACAGA GAATTTCAAC CTCCCAAAAG AAAGTTTCGC CAGCCCAGTC ACAATAGAGT TAGATGGTGG ATCTGCTGGC ACCTCATGGT TAAAATTTCT AGCCGGGGAT CATCAACTAT CTTATCTTCA AACTGATATA AGTTGGGACG AAAAAGCATC CAGAGAAGAA TCCATGGAAA CTTTCCTAGA TTCAAAGCTA TCAATTATTG ATGCTGCAAC AATTTCTTCT TCAGCAGCAG GAATGCTCTC TTCAGTAGAT TTCGTCAATA AAGCATTGGG TACAAAATTA GATGAAGTAG ATTTCCTTAA AGACATAAAT CTAGGGCTAG ACAGCGATGC TATTGGCAAA CTCAGCGATC TATTATCTCC CTACTTAGCA AGATATTTAG ATGGGTTATC AATACCATTA AGTCTTAGCA ACTCCGAAGC TTCAGCTGAA ATCCCTGCAA GGGATGCTAG CTTACAATCA ATTGCTGAAA GTCAGAGCTA TCGCTTAGTA GACGGCGGTT ATGTTGACAA CACAGCTGTA GCCAATATCG TCAGCTCAAT GCAGGATGAG TACTCCAATG AGCAAGAGTT TTCTATTGCC TTATTTTCAA ATTCAACAGG GGGAGACAAG ACAATACCAA ATTCACACTT AAACCTAACG GAAGATGTAG CATTACTATT TGGGGACCCT AAAGAAGGCT CAGCAAAAAC TTGGGACAAG GGTACCGGAT ACGGGTTCAC AATTAAAACA GTAAGTCCAC ATATTTTCGA CATAAAAGCC TGGGAGAATG TAGATGCTCC AGCATGGTCA TATACGTCAG AAAAATACGG AGAAGGCAAA GAAAAAAGAT TGCAGTATTA TCGTTTAGAT GTTGAAACCA TCAGCAATGA TGCTTTCAAC ATTGAAGCTG GCTACAAAGG GGAGCTTCAT GTTTTTGTAA ATGTCGACAA ACAATCCAAT GCAGCACCCG TAGACCTAAG CATGTTTAAT GTCTATAGTG AGATGTTCGG CACTACTCAA GATGCAATTC TCAAAGGAGA TCTCATCCAT GGGCCTGGAG CCGATTTTCT ACTTAATGCA TTAGGACTAA ATCTGGAAAA CAAAGGCAGT TATCAAGCAA GCCTACTAAA AGGTGTCTCG AATGATGATT ACATTTCAGG CAGTGATGGT AATGATATTC TCATCGGAAA TCATAAGAAG AATAAGCTTA CGGGTGGCAG AGGTAGTGAT ATCTTTAAAT ACCAAACCAT CAGTGACTCC AGACCGGGAG AAGAGTCTCG TGATTTGATC ACAGACTTCT CAAGCAGAGA AGAAGACAAA ATAGACTTAT CTATGATTGA GGCAGACTTA TCATTCATTG GATCAGATGA ATTTAGTGGC GAAAATGGTG AGGTCAGGTT CTATAATGGC ATGCTAATCG TCAATATAAA AGGGGCTTCT GATCCAGAGA TGGAGATCCA GCTTGAAGCT GTCAATACCT TCAATCCAGA ACACCTGATT TTGTAG
|
Protein sequence | MASSEKIRYI SFSGGGWSSH TASSAWISGA LQALRNKKKS EDVDFDDLLG GLHGAAGNSG GSWFVSMLAY SEEFKEGLNN VNGSADWFSR GYMGQVKDQF GRADDDNGKQ QMINDLSSEI IEAILMTSSK ISDLPENFKK LLIEYLSAKI KPYVSDAVEL LHNDYYVNIA SLGSKELKPD WSEVTKNLVY SPYDMKDQLN RPLGENPNKW AEGKDIIISA SMPTKNVIVT GKTFDIDNFL SSSSFTENFN LPKESFASPV TIELDGGSAG TSWLKFLAGD HQLSYLQTDI SWDEKASREE SMETFLDSKL SIIDAATISS SAAGMLSSVD FVNKALGTKL DEVDFLKDIN LGLDSDAIGK LSDLLSPYLA RYLDGLSIPL SLSNSEASAE IPARDASLQS IAESQSYRLV DGGYVDNTAV ANIVSSMQDE YSNEQEFSIA LFSNSTGGDK TIPNSHLNLT EDVALLFGDP KEGSAKTWDK GTGYGFTIKT VSPHIFDIKA WENVDAPAWS YTSEKYGEGK EKRLQYYRLD VETISNDAFN IEAGYKGELH VFVNVDKQSN AAPVDLSMFN VYSEMFGTTQ DAILKGDLIH GPGADFLLNA LGLNLENKGS YQASLLKGVS NDDYISGSDG NDILIGNHKK NKLTGGRGSD IFKYQTISDS RPGEESRDLI TDFSSREEDK IDLSMIEADL SFIGSDEFSG ENGEVRFYNG MLIVNIKGAS DPEMEIQLEA VNTFNPEHLI L
|
| |