Gene P9301_19041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_19041 
Symbol 
ID4912451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1638172 
End bp1640028 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content32% 
IMG OID640161510 
Producthypothetical protein 
Protein accessionYP_001092128 
Protein GI126697242 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG ATTTTACTGA TTTTATTGAG GTATCTGGAC TCTTAAATTA CGATCCGGAT 
ACAATTTCTA AAATTTACAA AAAAAATCCT AAAAGACTTT TAAAAAGACT TTGGCAAACA
CTCATACCTA TTTTTGCTTA CATTTTCTCC GTGGGATGGG ATAAATTCAC TGGAAGATTA
AAAAATGAAC AGCAAGCAAG ATTTAGAGCA CGAGAATTAA CAAATTTATT AGTAGAACTT
GGGCCTGCAT TTGTTAAGGC AGGCCAAGCT TTATCAACAA GACCAGATAT AATCCCAGGG
ATTCTTCTAG AAGAATTATC TGAATTGCAA GATCAACTCC CCGGGTTTGA TGGCAATAAA
GCTATGGAGT TAATAGAAGA AGATTTAGGA TACAAAATAA ATGAGATTTT TTTAGAAATT
GATAAAGAAC CAATTTCCGC TGCTTCTTTA GGGCAAGTAC ATAAAGCGAA ATTAAAAAAC
GAAGAGATCG TTGCAATAAA AGTTCAAAGG CCAGGATTGA GAGAACAAAT AACCTTAGAT
CTCTATATAG TTAGAAATAT TGCTTATTGG CTAAAAAACA ATATCGGATT AATAAGAAGT
GATCTAGTTG CTTTGATTGA TGAATTAGGC AAAAGAGTTT TTGAGGAGAT GGATTATCTA
AACGAAGCTG CAAATGCAGA AAAATTTAGA GATATGCACA AACATAACAA AATGATTGCT
GTACCAAAAA TTTATAAAGA AATAACATCA AGAAGAGTAT TAGCAATGGA ATGGATAGAC
GGTACAAAAT TAACAAATTT AGAAGACGTA AAAAAATTAG GAATTAATCC TGATGAAATG
ATTGATATAG GAGTGCAATG CAGTTTAGAA CAGCTTTTAG AACATGGTTT TTTTCATGCA
GACCCACATC CAGGTAATTT ATTAGCCTTA GAAGATGGAA GATTATGTTA CTTAGATTTT
GGAATGATGA GCGAGGTTTC TAGAGAATCA AGATCGGGAT TAATTCAAGC AGTAGTACAT
TTAGTAAATA AAAACTTCGA TAAATTGTCT CAAGATTTCG TAAAATTGGG ATTTTTATCA
GAGGAAGTTA ATCTAGAGCC CATTGTTCCA GCATTTCAAG ATGTTTTCAT TAACGCCGTT
GAACAAGGAG TGTCGAAAAT GGATTTTAAA AGCGTTACAG ACGATATGTC TGGTGTTATG
TATAAATTCC CTTTCAGACT ACCACCATAT TACGCGCTTA TAATTAGGTC ATTACTTACA
TTAGAAGGAA TAGCTTTAAG CGTAGATCCA AACTTCAAGA TATTAGGCGC GGCTTATCCA
TATTTTGCAA GAAGATTGAT GGAAGATCCT GATCCACAAT TAAGGGAAAG TCTTAAAGAA
ATGCTTTTTG ATAATAAAAA ATTTAAATGG GACCGTTTAG AAGATCTACT TTCTAACGCT
GCAAAGCAAA CAAATCTCGA TTTAGAAAAA CTTTTAGACG AAGTTATAAA TCTTCTCTTT
TCTCCAAATG GAGGATTTCT TAGAAATGAG ATAGTTGAAG GTTTAACAAA TCAGATAGAT
TTATTTAGTC TAAAAATATT GAAAAGTTTG AATAACTACC TTCCACAATC AATTAAATTA
AATACTATCA ACGAGAATAA TAACTTAAAT GACCTTATAA TGTACGTAGA GCCATTGAGA
AACTTCTTAG AGATTTTACA AAAAGTCCCC GGGTATTCAA TTGATATTTT TCTAAAAAGG
GTTCCAAGAC TAATAAATGA ACCTTATACA AAAGAAATGG GTATAAAGAT TGCAAAAAAA
GTAACTGAAA AAGGAGTAGT AAGACTTGTT AAGATTGCTG CTGGTGCAAA TATCTAA
 
Protein sequence
MKEDFTDFIE VSGLLNYDPD TISKIYKKNP KRLLKRLWQT LIPIFAYIFS VGWDKFTGRL 
KNEQQARFRA RELTNLLVEL GPAFVKAGQA LSTRPDIIPG ILLEELSELQ DQLPGFDGNK
AMELIEEDLG YKINEIFLEI DKEPISAASL GQVHKAKLKN EEIVAIKVQR PGLREQITLD
LYIVRNIAYW LKNNIGLIRS DLVALIDELG KRVFEEMDYL NEAANAEKFR DMHKHNKMIA
VPKIYKEITS RRVLAMEWID GTKLTNLEDV KKLGINPDEM IDIGVQCSLE QLLEHGFFHA
DPHPGNLLAL EDGRLCYLDF GMMSEVSRES RSGLIQAVVH LVNKNFDKLS QDFVKLGFLS
EEVNLEPIVP AFQDVFINAV EQGVSKMDFK SVTDDMSGVM YKFPFRLPPY YALIIRSLLT
LEGIALSVDP NFKILGAAYP YFARRLMEDP DPQLRESLKE MLFDNKKFKW DRLEDLLSNA
AKQTNLDLEK LLDEVINLLF SPNGGFLRNE IVEGLTNQID LFSLKILKSL NNYLPQSIKL
NTINENNNLN DLIMYVEPLR NFLEILQKVP GYSIDIFLKR VPRLINEPYT KEMGIKIAKK
VTEKGVVRLV KIAAGANI