Gene A9601_19231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_19231 
Symbol 
ID4718663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1666179 
End bp1668035 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content32% 
IMG OID640079658 
Producthypothetical protein 
Protein accessionYP_001010312 
Protein GI123969455 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.494256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAG ACTTTACTGA TTTTATTGAG GTATCTGGAC TCTTAAATTA TGATCCAGAT 
ACAATTTCTA AAATTTACAA AAAAAATCCT AAAAGACTTT TAAAAAGGCT TTGGCAAACA
CTCCTACCTA TTTTTGCTTA CATCTTTTCC GTTGGATGGG ATAAATTAAC TGGAAGGCTG
AAAAATAAAC AGCAAGCAAG ATTTAGAGCA AGAGAATTAA CAAATTTGTT AGTAGAACTT
GGACCTGCAT TTGTTAAAGC AGGCCAAGCT TTATCAACAA GACCAGATAT AATCCCAGGC
ATTCTTCTAG AAGAATTATC TGAATTGCAA GATCAGCTCC CAGGTTTTGA TAGCGATAAA
GCTATGGAAT TAATAGAAGA AGATTTAGGA AACAAAATAG ATGAGATTTT TTTAGAAATT
GATAAAGAGC CAATTTCTGC TGCTTCTTTA GGTCAAGTAC ATAAAGCTAA ATTAAAAAAC
GAAGAGATCG TTGCAATAAA AGTACAAAGG CCAGGTTTAA GAGAACAAAT AACCTTAGAC
CTTTACATTG TAAGAAATAT TGCTTATTGG CTAAAAAACA ATATCGGATT GATAAGAAGT
GATCTAGTTG CTTTGATTGA TGAATTAGGC AAGAGGGTTT TTGAAGAAAT GGATTATTTA
AACGAAGCTG CAAATGCAGA AAAATTTAGA GATATGCATA AACATAACAA GATGATTGCC
GTACCAAAAA TTTATAAAGA AATAACTTCA AGAAGAGTTT TAGCAATGGA ATGGATAGAC
GGTACAAAAT TAACAAATTT AGAGGATGTA AAAAAATTAG GAATTAATCC TGATGACATG
ATTGATATAG GGGTGCAATG CAGTTTAGAA CAGCTTTTAG AACATGGTTT TTTTCATGCA
GACCCGCATC CAGGTAATTT ATTAGCCTTA GAAGATGGAA GATTATGTTA TCTAGATTTT
GGAATGATGA GCGAGGTTTC CAGAGAATCT AGGTCAGGAT TAATTCAAGC AGTAGTTCAC
TTAGTAAATA AAAACTTCGA TAAATTGTCT CAAGATTTCG TAAAATTAGG ATTTTTATCA
GAGGAAGTTA ATCTAGAACC TATTGTTCCA GCATTTCAAG ATGTTTTCAT TAACGCCGTT
GAACAAGGAG TTTCGAAAAT GGATTTTAAG AGCGTTACTG ACGATATGTC TGGTGTTATG
TATAAATTCC CTTTCAGACT ACCCCCGTAT TATGCTCTTA TAATTAGATC ATTACTTACA
TTAGAGGGAA TAGCTTTAAG CGTAGATCCA AACTTCAAAA TATTAGGAGC AGCTTATCCA
TATTTTGCAA GAAGATTGAT GGAAGACCCT GATCCACAAT TGAGGGAAAG CCTTAAAGAA
ATGCTTTTTG ATAATAAAAA ATTTAAATGG GATCGTTTAG AAGATCTACT TTCTAACGCT
GCAAAGCAAA CAAATCTCGA TTTAGAAAAA CTTTTAGACG AAGTTATAAA TCTTCTCTTT
TCTCCAACTG GAGGATTTCT TAGAAATGAG ATAGTTGAAG GTTTAACAAA TCAGATAGAT
TTACTTAGTC TAAAAATATT GAAAAGTTTA AATAATTATC TTCCACAATC AATTAAATTA
AATACTACTA ACGAAAATAA TAACTTGAGT GACCTTATAA TGTATGTTGA GCCATTGAGA
AACTTTTTAG AGATTTTACA AAAAGTACCG GGGTATTCAA TTGACATTTT TCTAAGAAGA
GTGCCAAGAC TTATTAATGA GCCCTATACA AAAGAAATGG GTATAAAAAT AGCAAAAAAA
GTAACTGAAA AAGGAGTAGT AAGACTTGTT AAGATTGCCG CTGGTGCAAA TATATAA
 
Protein sequence
MKEDFTDFIE VSGLLNYDPD TISKIYKKNP KRLLKRLWQT LLPIFAYIFS VGWDKLTGRL 
KNKQQARFRA RELTNLLVEL GPAFVKAGQA LSTRPDIIPG ILLEELSELQ DQLPGFDSDK
AMELIEEDLG NKIDEIFLEI DKEPISAASL GQVHKAKLKN EEIVAIKVQR PGLREQITLD
LYIVRNIAYW LKNNIGLIRS DLVALIDELG KRVFEEMDYL NEAANAEKFR DMHKHNKMIA
VPKIYKEITS RRVLAMEWID GTKLTNLEDV KKLGINPDDM IDIGVQCSLE QLLEHGFFHA
DPHPGNLLAL EDGRLCYLDF GMMSEVSRES RSGLIQAVVH LVNKNFDKLS QDFVKLGFLS
EEVNLEPIVP AFQDVFINAV EQGVSKMDFK SVTDDMSGVM YKFPFRLPPY YALIIRSLLT
LEGIALSVDP NFKILGAAYP YFARRLMEDP DPQLRESLKE MLFDNKKFKW DRLEDLLSNA
AKQTNLDLEK LLDEVINLLF SPTGGFLRNE IVEGLTNQID LLSLKILKSL NNYLPQSIKL
NTTNENNNLS DLIMYVEPLR NFLEILQKVP GYSIDIFLRR VPRLINEPYT KEMGIKIAKK
VTEKGVVRLV KIAAGANI