Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_19041 |
Symbol | |
ID | 4912451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1638172 |
End bp | 1640028 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640161510 |
Product | hypothetical protein |
Protein accession | YP_001092128 |
Protein GI | 126697242 |
COG category | [R] General function prediction only |
COG ID | [COG0661] Predicted unusual protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAAG ATTTTACTGA TTTTATTGAG GTATCTGGAC TCTTAAATTA CGATCCGGAT ACAATTTCTA AAATTTACAA AAAAAATCCT AAAAGACTTT TAAAAAGACT TTGGCAAACA CTCATACCTA TTTTTGCTTA CATTTTCTCC GTGGGATGGG ATAAATTCAC TGGAAGATTA AAAAATGAAC AGCAAGCAAG ATTTAGAGCA CGAGAATTAA CAAATTTATT AGTAGAACTT GGGCCTGCAT TTGTTAAGGC AGGCCAAGCT TTATCAACAA GACCAGATAT AATCCCAGGG ATTCTTCTAG AAGAATTATC TGAATTGCAA GATCAACTCC CCGGGTTTGA TGGCAATAAA GCTATGGAGT TAATAGAAGA AGATTTAGGA TACAAAATAA ATGAGATTTT TTTAGAAATT GATAAAGAAC CAATTTCCGC TGCTTCTTTA GGGCAAGTAC ATAAAGCGAA ATTAAAAAAC GAAGAGATCG TTGCAATAAA AGTTCAAAGG CCAGGATTGA GAGAACAAAT AACCTTAGAT CTCTATATAG TTAGAAATAT TGCTTATTGG CTAAAAAACA ATATCGGATT AATAAGAAGT GATCTAGTTG CTTTGATTGA TGAATTAGGC AAAAGAGTTT TTGAGGAGAT GGATTATCTA AACGAAGCTG CAAATGCAGA AAAATTTAGA GATATGCACA AACATAACAA AATGATTGCT GTACCAAAAA TTTATAAAGA AATAACATCA AGAAGAGTAT TAGCAATGGA ATGGATAGAC GGTACAAAAT TAACAAATTT AGAAGACGTA AAAAAATTAG GAATTAATCC TGATGAAATG ATTGATATAG GAGTGCAATG CAGTTTAGAA CAGCTTTTAG AACATGGTTT TTTTCATGCA GACCCACATC CAGGTAATTT ATTAGCCTTA GAAGATGGAA GATTATGTTA CTTAGATTTT GGAATGATGA GCGAGGTTTC TAGAGAATCA AGATCGGGAT TAATTCAAGC AGTAGTACAT TTAGTAAATA AAAACTTCGA TAAATTGTCT CAAGATTTCG TAAAATTGGG ATTTTTATCA GAGGAAGTTA ATCTAGAGCC CATTGTTCCA GCATTTCAAG ATGTTTTCAT TAACGCCGTT GAACAAGGAG TGTCGAAAAT GGATTTTAAA AGCGTTACAG ACGATATGTC TGGTGTTATG TATAAATTCC CTTTCAGACT ACCACCATAT TACGCGCTTA TAATTAGGTC ATTACTTACA TTAGAAGGAA TAGCTTTAAG CGTAGATCCA AACTTCAAGA TATTAGGCGC GGCTTATCCA TATTTTGCAA GAAGATTGAT GGAAGATCCT GATCCACAAT TAAGGGAAAG TCTTAAAGAA ATGCTTTTTG ATAATAAAAA ATTTAAATGG GACCGTTTAG AAGATCTACT TTCTAACGCT GCAAAGCAAA CAAATCTCGA TTTAGAAAAA CTTTTAGACG AAGTTATAAA TCTTCTCTTT TCTCCAAATG GAGGATTTCT TAGAAATGAG ATAGTTGAAG GTTTAACAAA TCAGATAGAT TTATTTAGTC TAAAAATATT GAAAAGTTTG AATAACTACC TTCCACAATC AATTAAATTA AATACTATCA ACGAGAATAA TAACTTAAAT GACCTTATAA TGTACGTAGA GCCATTGAGA AACTTCTTAG AGATTTTACA AAAAGTCCCC GGGTATTCAA TTGATATTTT TCTAAAAAGG GTTCCAAGAC TAATAAATGA ACCTTATACA AAAGAAATGG GTATAAAGAT TGCAAAAAAA GTAACTGAAA AAGGAGTAGT AAGACTTGTT AAGATTGCTG CTGGTGCAAA TATCTAA
|
Protein sequence | MKEDFTDFIE VSGLLNYDPD TISKIYKKNP KRLLKRLWQT LIPIFAYIFS VGWDKFTGRL KNEQQARFRA RELTNLLVEL GPAFVKAGQA LSTRPDIIPG ILLEELSELQ DQLPGFDGNK AMELIEEDLG YKINEIFLEI DKEPISAASL GQVHKAKLKN EEIVAIKVQR PGLREQITLD LYIVRNIAYW LKNNIGLIRS DLVALIDELG KRVFEEMDYL NEAANAEKFR DMHKHNKMIA VPKIYKEITS RRVLAMEWID GTKLTNLEDV KKLGINPDEM IDIGVQCSLE QLLEHGFFHA DPHPGNLLAL EDGRLCYLDF GMMSEVSRES RSGLIQAVVH LVNKNFDKLS QDFVKLGFLS EEVNLEPIVP AFQDVFINAV EQGVSKMDFK SVTDDMSGVM YKFPFRLPPY YALIIRSLLT LEGIALSVDP NFKILGAAYP YFARRLMEDP DPQLRESLKE MLFDNKKFKW DRLEDLLSNA AKQTNLDLEK LLDEVINLLF SPNGGFLRNE IVEGLTNQID LFSLKILKSL NNYLPQSIKL NTINENNNLN DLIMYVEPLR NFLEILQKVP GYSIDIFLKR VPRLINEPYT KEMGIKIAKK VTEKGVVRLV KIAAGANI
|
| |