Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_12881 |
Symbol | |
ID | 4718894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | + |
Start bp | 1131565 |
End bp | 1133268 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640080972 |
Product | hypothetical protein |
Protein accession | YP_001011602 |
Protein GI | 123966521 |
COG category | [S] Function unknown |
COG ID | [COG0397] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.446292 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAAA AACCTGAATC ACTTAAAGAT AAAGAAACTA AAAATTTTTC TGAGTTTTCA CAATTAGCAG ATTTTTCTTT AATGAATTCT CTCAAAGCTG ATCCTCATTC AACAAAAAAC GGTAATGATC ACAGGCCTAG GCCAGTTTAT TCAGGTCATT ATGTTCCTGT CACCCCAACA CCAATTCCAG AACCAAAATA TATCGCTCAT AGCAAAACAT TGTTTGATGA ATTGGGATTA AGTTCTGATC TTACAAAAGA TAAAAAGTTT TGTAGTTTTT TTTCAGGAGA TATCGAAGTT TCAGAGTACC CAATGCGACC CTTTGGCTGG GCAACGGGGT ATGCATTGTC TATTTATGGC ACTGAATATA CTCAGCAATG TCCTTTTGGG ACCGGTGATG GATATGGGGA TGGTCGAGCA ATTTCTGTAT TCGAAGGTTT ATTTAATGAA AAAAGAATGG AGATGCAACT AAAGGGAGGT GGTCCTACAC CTTATTGTCG TGGTGCAGAT GGCAGAGCAG TCCTACGATC AAGTGTTCGC GAATTTCTTG CACAGGAATT CATGCATGCC TTGGGAATTC CCACTTCAAG ATCTTTAATA CTTTATGTAT CAGGAACAGA AATAGTTAGA AGACCATGGT ACTCAAAGGG TTCAAAGTAC TATGAGCCTG ATATCATGCT TGATAATCAT GCGGCGATTA CTACTCGTGT TGCTCCATCA TTTTTACGTG TAGGCCAGAT TGAGCTATTT GCTCGCAGAG TTCGAAGTAA TTCTCATAAT GAAGCTTTTA ATGAGCTAAA ACTTATAGTG CAACACCTTA TAGATAGGAA TTATAAAGAA GAAATTGATC CCAGCAAATC ATTTGCTGAG AAGATTATTA AGTTGGCTTA TTTATATCGA GAAAGACTCA TATCACTTGT CAGTAATTGG ATGAGGGTTG GTTATTGCCA AGGGAATTTT AATAGCGATA ATTGTGCGGC TGGAGGATAT ACTTTGGACT ATGGTCCTTT TGGTTTTTGT GAATTATTTG ACCCAAGATT TCAGCCTTGG ACTGGAGGTG GAGAACATTT TTCATTTTTC AATCAGCCTT TTGCTGCAGA AATTAACTTT AAGATGTTTT GTTTATCTCT CAAACCTTTA CTTTTAGAAA ATAAAAAAGA CATAGAAAAA TTAGAGCAAA TCAAAAATGA TTTTTCTAAA GTAATGAACA AAAAAATTCA ATTAATGTGG GCACGAAAGC TTGGTTTAGA AAAATACGAA GAAACTCTTA CCCACGAACT TTTCAATCTT ATGTTTACCT CTAAGGCTGA CTTCACTATT TTCTTCAGAA AGCTTTCCAA TATTCCCGAA AATATATCTT CTCTAAAAAA GAGTTTTTAC GTACCTCTTA CTGATGAACT TGAACAAAAA TGGAACATTT GGCTCAAAAA ATGGCAAGAC TTTATCAAAA AAGAGATTGA TATTAAAGAA ATATCAAAAT CAATGAGACA AGTAAACCCA AAATTTACTT GGCGTGAATG GATGATAGTT TCTGCATATG AAGATGCTGA GGAGGGTAAC TATAGCAAAA TAAAAGAATT ACAAACTATT TTAAGCAATC CATATGAAGA GCAATCTTTG GAAACAGAGC AAAAATACGA TCGCCTAAAG CCAAATCAAT TTTTCAATTA TGGAGGTATT TCACATTACA GTTGTTCATC TTAA
|
Protein sequence | MSQKPESLKD KETKNFSEFS QLADFSLMNS LKADPHSTKN GNDHRPRPVY SGHYVPVTPT PIPEPKYIAH SKTLFDELGL SSDLTKDKKF CSFFSGDIEV SEYPMRPFGW ATGYALSIYG TEYTQQCPFG TGDGYGDGRA ISVFEGLFNE KRMEMQLKGG GPTPYCRGAD GRAVLRSSVR EFLAQEFMHA LGIPTSRSLI LYVSGTEIVR RPWYSKGSKY YEPDIMLDNH AAITTRVAPS FLRVGQIELF ARRVRSNSHN EAFNELKLIV QHLIDRNYKE EIDPSKSFAE KIIKLAYLYR ERLISLVSNW MRVGYCQGNF NSDNCAAGGY TLDYGPFGFC ELFDPRFQPW TGGGEHFSFF NQPFAAEINF KMFCLSLKPL LLENKKDIEK LEQIKNDFSK VMNKKIQLMW ARKLGLEKYE ETLTHELFNL MFTSKADFTI FFRKLSNIPE NISSLKKSFY VPLTDELEQK WNIWLKKWQD FIKKEIDIKE ISKSMRQVNP KFTWREWMIV SAYEDAEEGN YSKIKELQTI LSNPYEEQSL ETEQKYDRLK PNQFFNYGGI SHYSCSS
|
| |