Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_19231 |
Symbol | |
ID | 4718663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 1666179 |
End bp | 1668035 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640079658 |
Product | hypothetical protein |
Protein accession | YP_001010312 |
Protein GI | 123969455 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.494256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAG ACTTTACTGA TTTTATTGAG GTATCTGGAC TCTTAAATTA TGATCCAGAT ACAATTTCTA AAATTTACAA AAAAAATCCT AAAAGACTTT TAAAAAGGCT TTGGCAAACA CTCCTACCTA TTTTTGCTTA CATCTTTTCC GTTGGATGGG ATAAATTAAC TGGAAGGCTG AAAAATAAAC AGCAAGCAAG ATTTAGAGCA AGAGAATTAA CAAATTTGTT AGTAGAACTT GGACCTGCAT TTGTTAAAGC AGGCCAAGCT TTATCAACAA GACCAGATAT AATCCCAGGC ATTCTTCTAG AAGAATTATC TGAATTGCAA GATCAGCTCC CAGGTTTTGA TAGCGATAAA GCTATGGAAT TAATAGAAGA AGATTTAGGA AACAAAATAG ATGAGATTTT TTTAGAAATT GATAAAGAGC CAATTTCTGC TGCTTCTTTA GGTCAAGTAC ATAAAGCTAA ATTAAAAAAC GAAGAGATCG TTGCAATAAA AGTACAAAGG CCAGGTTTAA GAGAACAAAT AACCTTAGAC CTTTACATTG TAAGAAATAT TGCTTATTGG CTAAAAAACA ATATCGGATT GATAAGAAGT GATCTAGTTG CTTTGATTGA TGAATTAGGC AAGAGGGTTT TTGAAGAAAT GGATTATTTA AACGAAGCTG CAAATGCAGA AAAATTTAGA GATATGCATA AACATAACAA GATGATTGCC GTACCAAAAA TTTATAAAGA AATAACTTCA AGAAGAGTTT TAGCAATGGA ATGGATAGAC GGTACAAAAT TAACAAATTT AGAGGATGTA AAAAAATTAG GAATTAATCC TGATGACATG ATTGATATAG GGGTGCAATG CAGTTTAGAA CAGCTTTTAG AACATGGTTT TTTTCATGCA GACCCGCATC CAGGTAATTT ATTAGCCTTA GAAGATGGAA GATTATGTTA TCTAGATTTT GGAATGATGA GCGAGGTTTC CAGAGAATCT AGGTCAGGAT TAATTCAAGC AGTAGTTCAC TTAGTAAATA AAAACTTCGA TAAATTGTCT CAAGATTTCG TAAAATTAGG ATTTTTATCA GAGGAAGTTA ATCTAGAACC TATTGTTCCA GCATTTCAAG ATGTTTTCAT TAACGCCGTT GAACAAGGAG TTTCGAAAAT GGATTTTAAG AGCGTTACTG ACGATATGTC TGGTGTTATG TATAAATTCC CTTTCAGACT ACCCCCGTAT TATGCTCTTA TAATTAGATC ATTACTTACA TTAGAGGGAA TAGCTTTAAG CGTAGATCCA AACTTCAAAA TATTAGGAGC AGCTTATCCA TATTTTGCAA GAAGATTGAT GGAAGACCCT GATCCACAAT TGAGGGAAAG CCTTAAAGAA ATGCTTTTTG ATAATAAAAA ATTTAAATGG GATCGTTTAG AAGATCTACT TTCTAACGCT GCAAAGCAAA CAAATCTCGA TTTAGAAAAA CTTTTAGACG AAGTTATAAA TCTTCTCTTT TCTCCAACTG GAGGATTTCT TAGAAATGAG ATAGTTGAAG GTTTAACAAA TCAGATAGAT TTACTTAGTC TAAAAATATT GAAAAGTTTA AATAATTATC TTCCACAATC AATTAAATTA AATACTACTA ACGAAAATAA TAACTTGAGT GACCTTATAA TGTATGTTGA GCCATTGAGA AACTTTTTAG AGATTTTACA AAAAGTACCG GGGTATTCAA TTGACATTTT TCTAAGAAGA GTGCCAAGAC TTATTAATGA GCCCTATACA AAAGAAATGG GTATAAAAAT AGCAAAAAAA GTAACTGAAA AAGGAGTAGT AAGACTTGTT AAGATTGCCG CTGGTGCAAA TATATAA
|
Protein sequence | MKEDFTDFIE VSGLLNYDPD TISKIYKKNP KRLLKRLWQT LLPIFAYIFS VGWDKLTGRL KNKQQARFRA RELTNLLVEL GPAFVKAGQA LSTRPDIIPG ILLEELSELQ DQLPGFDSDK AMELIEEDLG NKIDEIFLEI DKEPISAASL GQVHKAKLKN EEIVAIKVQR PGLREQITLD LYIVRNIAYW LKNNIGLIRS DLVALIDELG KRVFEEMDYL NEAANAEKFR DMHKHNKMIA VPKIYKEITS RRVLAMEWID GTKLTNLEDV KKLGINPDDM IDIGVQCSLE QLLEHGFFHA DPHPGNLLAL EDGRLCYLDF GMMSEVSRES RSGLIQAVVH LVNKNFDKLS QDFVKLGFLS EEVNLEPIVP AFQDVFINAV EQGVSKMDFK SVTDDMSGVM YKFPFRLPPY YALIIRSLLT LEGIALSVDP NFKILGAAYP YFARRLMEDP DPQLRESLKE MLFDNKKFKW DRLEDLLSNA AKQTNLDLEK LLDEVINLLF SPTGGFLRNE IVEGLTNQID LLSLKILKSL NNYLPQSIKL NTTNENNNLS DLIMYVEPLR NFLEILQKVP GYSIDIFLRR VPRLINEPYT KEMGIKIAKK VTEKGVVRLV KIAAGANI
|
| |