Gene P9515_02381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_02381 
Symbol 
ID4720374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp220177 
End bp221844 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content32% 
IMG OID640079901 
Productputative kinase 
Protein accessionYP_001010554 
Protein GI123965473 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.802769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATC ATAAATTAAA AAATAGAATA CAAAAAATCA AAAGAAGCTT TCTTATTTGG 
AAAACACTTA TTTTACTTTT AGTCAATTTA TGGATAGATA ATTTAAAAAC TAAAATTTTT
CAAACTAACA AAGATAAAAA TAAAAAAGTT CAAATAAAAA GAGCTAGGTG GTTTACTAAT
CAATTAATTG ATCTTGGTTC TGCCTTTATC AAAATTGGTC AGCTATTGTC GGCAAGACCT
GATTTAATTC CTAATACTTG GATACAAGAA TTATCAAAGT TACAAGATCA AGTTCCTCAG
TTTTCATATA CAAAAGTTGA GGAAATTATC AAAACCGAAC TCGGGCAGAA ATTCTCAGAA
ATTAACAAAA TTAATGATTT GCCAATTGGA TCAGCTTCTT TAGCTCAAGT TCATAGAGCG
ACTTTGAGAA ATGGGAAAGA AGTTGTATTT AAAGTTCAAA GACCCAATCT TAAGCAATTA
TTCATAATCG ATTTGAACAT AATGCAACAA ATTGCTTTTG TACTGCAAAA GAATAGGAAT
TGGAGTCGTG GAAGAAATTG GGTTGATATA GCAAAAGAGT GCAGAAAAGT TCTGATGAAA
GAACTAGATT TTAAATGTGA AGCACAGTAT GCCGCAAGAT TTAGACAACA ATTTCTTGAT
GATGATAATG TAGAAGTCCC TGAAGTTATT TGGGACTTGA GTAGTGACAA AGTCCTTTGT
CTTAGTTATT TAGAAGGTAT AAAAATAAGT GATATTGAAA AATTAAAGTC TAAGAACATT
GACTTACCTA AGATTGCCGA AATAGGTGCA ATTAGCTATC TGAAACAACT AGTAAATTAT
GGTTTTTTTC ATGCTGATCC TCATCCTGGG AATTTAGCCG TTTCAAATTC AGGCAAATTA
ATCTTTTATG ATTTTGGGAT GATGGGCAAT ATTTCAAATA ACCTACAAGT CAGATTGGGA
TCTATGGTTC AATCAGCAGC ATTGAGAGAT GCATCTTCAC TTGTCACTCA ACTACAACAA
GCAGGCTTGA TCTCCAAAGA TATAGATGTT GGACCAGTAA GAAGATTAGT TAGATTAATG
CTAAAAGAAG CTTTAACTCC TCCATTCAGC CCAAAAATTA TCGAAAAACT ATCTGGGGAT
TTATATGAAC TTGTTTATGA AACTCCTTTT CAACTACCAG TGGATTTAAT TTTTGTGATG
AGAGCACTAT CAACTTTTGA AGGAGTTGGT AGAATGTTAG ACCCAGGGTT TAACCTTGTA
TCAATTACTA AGCCTTATCT AATTGACCTC ATGACTTCTA ATAATCAATC ACCAAACGAT
TTAATTAACC AATTCGGGAG ACAAGTTGGT GAATTAGGTT CAAAGGCTGT AGGCATCCCA
AAAAGAATTG ATGAGAGTTT AGAGAGATTA GAGCAGGGTG ATTTACAACT TCAAATAAGG
ATGGGAGAAT CTGACAGGCA ATTTAAGAAA ATGTTTACTG CACAAAAATC ATTAGGACAT
TCAATTCTTA TGGGAAGCCT TACAATAGCA TCAGCTTTAC TTGTGACTAA GAACCAAAAT
AATTTAGCTA TTTTACCTTT ATTATTTGCA GCACCAATAA GTATTGATTG GATAAAATGT
CAACTAAACA TGAGAAAAGG TTCAAGAATA GACAAGCTCA AGAAGTAG
 
Protein sequence
MSNHKLKNRI QKIKRSFLIW KTLILLLVNL WIDNLKTKIF QTNKDKNKKV QIKRARWFTN 
QLIDLGSAFI KIGQLLSARP DLIPNTWIQE LSKLQDQVPQ FSYTKVEEII KTELGQKFSE
INKINDLPIG SASLAQVHRA TLRNGKEVVF KVQRPNLKQL FIIDLNIMQQ IAFVLQKNRN
WSRGRNWVDI AKECRKVLMK ELDFKCEAQY AARFRQQFLD DDNVEVPEVI WDLSSDKVLC
LSYLEGIKIS DIEKLKSKNI DLPKIAEIGA ISYLKQLVNY GFFHADPHPG NLAVSNSGKL
IFYDFGMMGN ISNNLQVRLG SMVQSAALRD ASSLVTQLQQ AGLISKDIDV GPVRRLVRLM
LKEALTPPFS PKIIEKLSGD LYELVYETPF QLPVDLIFVM RALSTFEGVG RMLDPGFNLV
SITKPYLIDL MTSNNQSPND LINQFGRQVG ELGSKAVGIP KRIDESLERL EQGDLQLQIR
MGESDRQFKK MFTAQKSLGH SILMGSLTIA SALLVTKNQN NLAILPLLFA APISIDWIKC
QLNMRKGSRI DKLKK