Gene P9301_02291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_02291 
Symbol 
ID4912844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp213525 
End bp215192 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content33% 
IMG OID640159795 
Productputative kinase 
Protein accessionYP_001090453 
Protein GI126695567 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTC AAATTCTTCA TTATCGTTTA AAAAAATTGA AGAGGGCTTT TCTTATTTGG 
AAAACTCTTA TTTCGCTTTT AATAAATTTG TGGATAGATA ATATTAGATT TACAATTTTC
CAAATTAAAA GCAATGAAAA AAGTAGGGTC CAAATTAAAA GAGCTAGGTG GTTTACTAAT
CAATTAATAA ATCTTGGTTC AGCATTTATT AAAATTGGAC AATTATTATC AGCGAGACCT
GATTTAATTC CTAATACCTG GATACAGGAA TTGTCTAAAT TGCAAGATCA AGTTCCTAAT
TTTTCATTTG AGCAAATTGA AGAAACTATA AGAGAAGAAC TAGGGTCCAA GTTTTATGAA
ATAGATCAAA TAATAGCTGA TCCGGTTGGA TCAGCATCAC TAGCTCAGGT TCATAGGGCG
ACTTTAAAAG ATGGCAAGAA AGTAGTATTC AAAGTTCAAA GACCCAATTT AAAAGAATTG
TTTATTATCG ATTTGGGCAT AATGCAGCAG ATAGCAGGAT TGTTGCAAAA AAACAAGAAC
TGGAGTCGAG GTAGAAACTG GGTTGAGATT GCTAAAGAGT GCAGAAAAGT TCTCATGAAG
GAGCTTGATT TTAATTGTGA AGCACAATAT GCAGCAAGAT TTAGACAGCA ATTTCTTGAT
GATGTAAATG TTGAAGTTCC TGAAGTGATT TGGGATATGA GCAGTGAAAA AGTGCTTTGC
TTAAGTTATC TTGAGGGAAC GAAAATAAGC GATTTAGAAA AATTAAAATT ACAAGAAATT
GATTTGCCTA AAATTGCAGA AATAGGGGCT ATAAGCTATC TAAAACAATT AGTAAATTAC
GGTTTTTTCC ATGCAGATCC TCATCCAGGG AATCTAGCAG TTTCAAAAAA AGGTAAATTG
ATTTTTTATG ATTTTGGAAT GATGGGCAAC ATTTCAAATA ATCTTCAAAC AAGATTAGGG
GGGATGGTTA AGGCTGCCGC ATTAAGAGAC GCCTCATCAC TTGTTAGCCA ATTACAGCAA
GCTGGGCTAA TTTCAAAAGA TATTGATGTT GGACCAGTCA GAAGATTAGT CAGACTGATG
CTTAAAGAGG CCTTAACTCC CCCATTTAGC CCAAATATTA TTGAAAAATT ATCTGGAGAT
TTATACGAAC TTGTTTATGA AACACCATTT CAACTACCAG TAGATTTAAT CTTTGTGATG
AGAGCTTTAT CAACTTTTGA AGGAGTTGGC AGAATGCTCG ATCCAGGGTT TAACCTTGTA
TCAGTTACCA AGCCTTATTT AATAGAACTT ATGACTTCAA ATAACCAAAC TCCCAACGAT
TTAATTAACC AATTTGGAAG GCAAGTAGGT GAACTAGGAT CTAAAGCTGT TGGAATTCCC
AAAAGAATAG ATGAAAGTTT AGAAAGATTA GAACAGGGCG ATTTACAATT GCAAATAAGA
ATGGGAGAGT CTGATAGGCA ATTCAAAAAA ATGTTTACCG CTCAAAAAAC TTTAGGCCAT
TCAATTCTTA TAGGCAGCTT ATCAATTGCA TCTGCATTAC TAGTAACCAA TAAACAAAAT
AATTTTGCAT TGTTGCCACT TCTTTTTGCA GTACCAATAA GTATTGATTG GATAAAATGC
CAATTAAGTA TGAGAAAAGG CTCACGTTTA GAAAAACTAA AGCAATAA
 
Protein sequence
MSFQILHYRL KKLKRAFLIW KTLISLLINL WIDNIRFTIF QIKSNEKSRV QIKRARWFTN 
QLINLGSAFI KIGQLLSARP DLIPNTWIQE LSKLQDQVPN FSFEQIEETI REELGSKFYE
IDQIIADPVG SASLAQVHRA TLKDGKKVVF KVQRPNLKEL FIIDLGIMQQ IAGLLQKNKN
WSRGRNWVEI AKECRKVLMK ELDFNCEAQY AARFRQQFLD DVNVEVPEVI WDMSSEKVLC
LSYLEGTKIS DLEKLKLQEI DLPKIAEIGA ISYLKQLVNY GFFHADPHPG NLAVSKKGKL
IFYDFGMMGN ISNNLQTRLG GMVKAAALRD ASSLVSQLQQ AGLISKDIDV GPVRRLVRLM
LKEALTPPFS PNIIEKLSGD LYELVYETPF QLPVDLIFVM RALSTFEGVG RMLDPGFNLV
SVTKPYLIEL MTSNNQTPND LINQFGRQVG ELGSKAVGIP KRIDESLERL EQGDLQLQIR
MGESDRQFKK MFTAQKTLGH SILIGSLSIA SALLVTNKQN NFALLPLLFA VPISIDWIKC
QLSMRKGSRL EKLKQ