Gene A9601_02271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02271 
Symbol 
ID4716911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp212547 
End bp214214 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content33% 
IMG OID640077926 
Productputative kinase 
Protein accessionYP_001008622 
Protein GI123967764 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTATC ACATTTTTCA TTATCGTTTA AAAAAATTGA AGAGGGCCTT TGTTATTTGG 
ATAACTCTGA TTTCGCTTTT AATAAATTTA TGGATAGATA ATATTAGATT TACAATTTTC
CAAACTAAGA ATAATGAAAA AAGTAGGGTT CAAATTAAGA GAGCTAGGTG GTTTACTAAT
CAATTAATAA AGCTTGGATC AGCATTTATT AAAATTGGAC AATTATTATC AGCGAGGCCT
GATTTAATTC CTAATACCTG GATACAGGAA TTGTCTAAAT TGCAGGATCA AGTTCCTAAT
TTTTCATTTA CGCAAGTTGA AGAGACTATT AGAAATGAAC TAGGGTCTAA GTTTAATGAA
ATAGATCAAA TAATATGTGA TCCGGTTGGA TCAGCATCAC TAGCTCAGGT TCATAGGGCG
ACTCTAAAAG ATGGTAAGAC AGTAGTTTTT AAAGTTCAAA GACCCAATTT AAAAGAATTA
TTTATTATCG ATTTGGGCAT AATGCAGCAA ATAGCAGGAT TATTGCAGAA AAATAAGAAT
TTGAGTAGAG GTAGAAACTG GGTTGAGATT GCTAAAGAGT GTAGGAAAGT TCTGATGAAA
GAACTTGATT TTAATTGCGA AGCGCAATAT GCAGCAAGAT TTAGGCAGCA ATTTCTTGAT
GATGAAAATG TTGAAGTTCC TGAAGTAATT TGGGATATGA GCAGTGAAAA AGTACTTTGT
TTAAGTTATG TAGAAGGGAC AAAAATAAGC GATTTAGAAA AATTAAAATC ACAAGAAATT
GATTTACCTA AAATTGCAGA GATAGGTGCA ATCAGCTACT TAAAACAATT AGTAAATTAC
GGTTTTTTTC ATGCAGATCC TCATCCAGGG AATTTAGCAG TTTCAAGTGA AGGTAAATTA
ATTTTTTATG ATTTTGGAAT GATGGGGAAC ATCTCAAATA ATCTTCAAAA AAGATTAGGG
GGGATGGTTA AGGCTGCTGC TCTTAGAGAC GCCTCATCAC TAGTTAGTCA ATTACAACAA
GCTGGGCTAA TTTCAAAAGA TATAGATGTA GGACCAGTCA GAAGATTAGT CAGATTGATG
CTTAAAGAAG CCTTAACTCC TCCATTTAGT CCTAATATTA TTGAAAAATT ATCTGGAGAT
TTATACGAAC TGGTTTATGA AACGCCATTT CAACTGCCAG TAGATTTAAT CTTTGTGATG
AGAGCTTTAT CAACTTTTGA AGGAGTTGGT AGAATGCTTG ATCCAGGGTT TAACCTTGTA
TCAGTTACCA AGCCTTATTT AATAGAACTT ATGACTTCAA ATAATCAAAG TCCCAACGAT
TTAATTAACC AATTTGGAAG GCAAGTAGGC GAACTAGGAT CGAAAGCTGT TGGAATTCCC
AAAAGAATAG ATGAAAGTTT AGAAAGATTA GAACAGGGAG ATTTACAATT GCAAATAAGA
ATGGGCGAGT CTGATAGGCA ATTCAAAAAA ATGTTTACGG CTCAAAAAAC TTTAGGCCAT
TCAATTCTAA TAGGAAGCTT ATCAATTGCA TCCGCTTTAC TTGTATCCAA TAAACAAAAT
AATTTTGCAT TGTTGCCACT CTTTTTTGCA CTACCAATAA GTATTGATTG GATAAAGTGC
CAATTAAGTA TGAGAAAAGG CTCACGTTTA GAAAAACTTA AGCGCTAA
 
Protein sequence
MSYHIFHYRL KKLKRAFVIW ITLISLLINL WIDNIRFTIF QTKNNEKSRV QIKRARWFTN 
QLIKLGSAFI KIGQLLSARP DLIPNTWIQE LSKLQDQVPN FSFTQVEETI RNELGSKFNE
IDQIICDPVG SASLAQVHRA TLKDGKTVVF KVQRPNLKEL FIIDLGIMQQ IAGLLQKNKN
LSRGRNWVEI AKECRKVLMK ELDFNCEAQY AARFRQQFLD DENVEVPEVI WDMSSEKVLC
LSYVEGTKIS DLEKLKSQEI DLPKIAEIGA ISYLKQLVNY GFFHADPHPG NLAVSSEGKL
IFYDFGMMGN ISNNLQKRLG GMVKAAALRD ASSLVSQLQQ AGLISKDIDV GPVRRLVRLM
LKEALTPPFS PNIIEKLSGD LYELVYETPF QLPVDLIFVM RALSTFEGVG RMLDPGFNLV
SVTKPYLIEL MTSNNQSPND LINQFGRQVG ELGSKAVGIP KRIDESLERL EQGDLQLQIR
MGESDRQFKK MFTAQKTLGH SILIGSLSIA SALLVSNKQN NFALLPLFFA LPISIDWIKC
QLSMRKGSRL EKLKR