Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_19571 |
Symbol | |
ID | 4777314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1721601 |
End bp | 1723607 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087467 |
Product | hypothetical protein |
Protein accession | YP_001017964 |
Protein GI | 124023657 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.258176 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAATCG CTGCGCAGCT GGTGGACCTG CCAATTGATC ATTTCCGCTT GCTGGGTGTC AGTCCTTCGG CAGACAGTGA GGCGATTTTG CGGGCCTTGG AGTTGAGGTT GGATCGCTGC CCTGACCAAG GTTTCACCCA TGAGGTCTTA ATTCAGCGGG CAGAATTGTT GCGGCTTTCA GCAGATTTGC TGACCGATCC GCCACGGCGT CAGGCCTATG AGACTGCCTT GTTGGAGCTC AGTCGTGATC ACCCAGGCGA GACCGCCGGT CTTGATGTGT CACCTAGTAG AGAGGTGGCA GGGCTGATCT TGTTGTTTGA AGCGAACTCT CCTCATGAGG TCTTTCATCT CGCCTCTCAG GGATTGCAGC CGCCCCAGTC CCCGACGCTA GGTAGCGAAC GAGAAGCTGA CCTCGCTTTG TTGTTAGCGC TGGCCTGCCG GGCTGCAGCC GCTGAGGAAC AGGAACAACG GCGTTATGAA GCAGCAGCGT CTCTTTTGCA TGACGGGATC CAGTTGTTGC AGCGGATCGG CAAGCTCTCC GAAGAGTGCC TCAAGCTTGA GAAAGATTTA GATGTCCTTC TGCCCTATCG CATTCTCGAC TTATTGAGTC GGGATCTTGG TGATCAGGTC TCTCACCAGG AAGGACTGCG CCTACTTGAC AACTTTGTGA GCCAGAGAGG AGGCCTTGAG GGAACGGTCC CATCGCCTGC ACCTGGGGGT CTTGATCAAT CCGAATTTGA CAACTTCTTC AAGCAGATCA GAAAGTTTTT GACTGTTCAG GAACAGGTTG ATCTTTTCCT GCGCTGGCAG CAAGCCGGAT CAGCAGATGC GGGTTTCCTT GGTGGGTTGG CTCTTGCTGC GGTTGGATTT TCGCGTCGGA AGCCTGAACG GGTGCAGGAA GCTCGGCAGC ACTTAGAGAG TCTTGAACTG GATGGATTCG ATCCGTTGCC GATGCTGGGT TGCTTGGACC TCTTGCTCGG AGATGTAGGC CGCGCTCAGG AGCGCTTTCT GCGCAGTACT GATCCTCGAG TGAAGGATTG TCTTGACAGC CACCCTGGCG ATGAATTGGC TGCTTTTTGC GAATACTGCC GCTCTTGGCT GGGAGGGGAC GTGCTTCCCG GTTATAGGGA TGTGGATGCT GAGGCCGTTG ATCTGGAGGC TTGGTTTGCT GATCGGGATG TTCAGGCCTA TGTGGAACGC CTGGAACGCA GCGAAAATCG TGCTTCGTCT TTAGGTAAGG CCTTCGCAGG ATCGTCTTTG AAGCAACCCT TCCCTTGGGC TCCTCTTCAT CCCGATGGGA TTTTGCCTCT CTCTCTTGGT GGGCCTGAGG TTGGTCAACC TGCAGCTGAC CCAAGCTCTG ATGAGTTTGC CAGCGATGGC ATTGCATGGA TTGATCGTTT AGCAGATCTG CCACGTCAGA CGCGGCCGGT GCTGATCGGT TCTGTTGTCT TTGCGGCCCT GATTGCAGCC TTTGCAGGCT TCAGCTTGTT TGGCCAACGT CCTCGTCCTT CAGTCAGCAC GGCTGCTGAT CAGCCTCAAG TCACAGCACC TCCTACCGCC ACACTGCAAG AGGCGGTCCT CATGCCTCAA GTCCCTGTCA GCGCTGTGGT TGAGCCGCTT ACTTTGGAGC AGCCGAATCA GGCACAGCTC AAAGACCTGC TTCAGGCCTG GCTCAGCAAC AAGGCAGTCG TGCTTGCCGG TGGCAAGAGT GATGCTCTGC CTGAGGTCGC AAGAGATCCA TTAGTGCAGC GCGTGGCGCA AGAGCGTGCC AGGGATGCTG CTTTAGCTGA GACCCAGAAG GTTGTGGCCA GCATCAGCTC TGTAGAGGTG GTGAGTCGAA CGCCGCAGCG TATTGAGCTG AATGCAGTTG TGACCTATCG CGATCAACGC GTTGATGCTG CCGGCAAGGT TGTTGACCAA ACGCCGCAAA AGGATCTCTC GGTGACTTAC ATCCTTGGTC GTGATCCCGA TCGTTGGCGC CTGCATGAAT ACATCAGCGG CAAATAA
|
Protein sequence | MPIAAQLVDL PIDHFRLLGV SPSADSEAIL RALELRLDRC PDQGFTHEVL IQRAELLRLS ADLLTDPPRR QAYETALLEL SRDHPGETAG LDVSPSREVA GLILLFEANS PHEVFHLASQ GLQPPQSPTL GSEREADLAL LLALACRAAA AEEQEQRRYE AAASLLHDGI QLLQRIGKLS EECLKLEKDL DVLLPYRILD LLSRDLGDQV SHQEGLRLLD NFVSQRGGLE GTVPSPAPGG LDQSEFDNFF KQIRKFLTVQ EQVDLFLRWQ QAGSADAGFL GGLALAAVGF SRRKPERVQE ARQHLESLEL DGFDPLPMLG CLDLLLGDVG RAQERFLRST DPRVKDCLDS HPGDELAAFC EYCRSWLGGD VLPGYRDVDA EAVDLEAWFA DRDVQAYVER LERSENRASS LGKAFAGSSL KQPFPWAPLH PDGILPLSLG GPEVGQPAAD PSSDEFASDG IAWIDRLADL PRQTRPVLIG SVVFAALIAA FAGFSLFGQR PRPSVSTAAD QPQVTAPPTA TLQEAVLMPQ VPVSAVVEPL TLEQPNQAQL KDLLQAWLSN KAVVLAGGKS DALPEVARDP LVQRVAQERA RDAALAETQK VVASISSVEV VSRTPQRIEL NAVVTYRDQR VDAAGKVVDQ TPQKDLSVTY ILGRDPDRWR LHEYISGK
|
| |