Gene NATL1_02851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02851 
Symbol 
ID4779995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp264676 
End bp266301 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content34% 
IMG OID640083550 
Productputative kinase 
Protein accessionYP_001014114 
Protein GI124024998 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATATTAT TAGGTTTACT CTTCTATTTA TGGTTTGATT CTAAAAAATG GACTTATTTA 
AAAGGATATA CCACTAAAAA AAGAGAAAAA AGAACTATTT CAAGAGCAAA ATGGATTACA
AAAGAATTAA TTAATTTAGG ATCAGCATTT ATAAAATTAG GCCAACTTTT ATCTGCTAGA
GCTGATGTTA TTCCATCTTC TTGGGTAAAT GAGCTTACCT CTCTACAAGA CAGAGTTCCA
CCTTTTGCAT TTGAAAAAGT CGATGAAATT CTAAGGAAAC AACTTGGAAA TTTATACAAT
AAAATCATAA GTATTGATAA AAATCCAATT GGCTCAGCTT CACTTGCTCA AGTTCATAAG
GCTAGATTAA ATAATGACAA AAATGTCATT TTTAAAGTAC AAAGACCAGA TATAGAAAAG
TTTTTCAGGC TTGACTTAGA TGTGATGAAT CAAGTTGCAA AAGTAGTGCA AAGAATAAAG
TCTTTAAGTA GAGGTAATGA CTGGATAGGT ATTGCAAAGG AGTCTAGGAG AGTATTACTC
AGAGAGCTAG ATTTTCGAAT TGAAGCTCAA TATGCTGCAA GATTTAAACA ACAATTTATA
GATGATTCAG ATATTTTAAT ACCAAGTGTA TTTTGGGATT TAAGCACTAG TAAAGTTTTA
TGCTTGGAGT ATTTACCAGG AATTAAGATC AATGACATTA AATCTCTAAA AAGTAATGAT
ATTGATACTT CTTCAATCGC AAAGATTGGA GCTACAAGTT ATTTAAAGCA ATTAGTTAAT
TACGGTTTTT TTCACGCTGA TCCTCATCCT GGTAACTTAG CTGTTTCTAA GAGTGGTTCT
CTCATTTACT ATGATTTTGG GATGATGGGT TTTGTCTCTG AGCGTATTAG AGGCAGGCTC
AATTCAATGA TAAAAGCGGC AGCGTTGAGA GACGTTAGTG GACTGGTCAA AGAACTTCAA
ATTGCTGGAC TAATAGAAGA CGATATTGAA ATAGGTCCAG TTAGAAGATT AATAAGAATA
ATGTTAAATG AAGCCCTAAC TCCTCCTTTT GATTCTCAAA TCATAGAAAA GCTCTCTGGG
GATATATCCG AACTGGCTTA TGGCAAACCC TTTAGAATAC CAATTGAGTT GATTTTTGTT
TTTAGAGCAT TATCTACTTT TGAAGGAGTT GGAAGATATT TAGACCCAGA ATTTAACTTA
ATAGCAATAG CAAAACCCTT CCTTCTCCCT CTCATGACTT CAAAGAATCC AGACTCTAAT
GATCTATTTA ATGAATTAGG AAGACAAGTT ACTGAAATAG GAAGTAAAGC AGTTGGGTTG
CCAAAACGCT TAGATGAAAA TCTAGAAAGA CTTGAACAAG GGGACCTGCA ACTTCAAGTA
CGAATGGGTG AATCAGACAG ACAACTGAGA AGAATGATTA ATGCACAACA AACGCTAGGT
AATTCCGTTC TCCTTGGCTC ACTTGCAATA TCGTCAGCTT TATTAGCCTC AAATAGCAAA
CCACATTTAT TTTTCATACC ATTAATTCCA GGGTTTCCAA TAGCCCTAAA TTGGATCACA
TTACAATTCA AAATGAGAAA CGAAAATCGA CTTGATAATT TTCAAGGAAG ACGAGGCTCT
CGCTAG
 
Protein sequence
MILLGLLFYL WFDSKKWTYL KGYTTKKREK RTISRAKWIT KELINLGSAF IKLGQLLSAR 
ADVIPSSWVN ELTSLQDRVP PFAFEKVDEI LRKQLGNLYN KIISIDKNPI GSASLAQVHK
ARLNNDKNVI FKVQRPDIEK FFRLDLDVMN QVAKVVQRIK SLSRGNDWIG IAKESRRVLL
RELDFRIEAQ YAARFKQQFI DDSDILIPSV FWDLSTSKVL CLEYLPGIKI NDIKSLKSND
IDTSSIAKIG ATSYLKQLVN YGFFHADPHP GNLAVSKSGS LIYYDFGMMG FVSERIRGRL
NSMIKAAALR DVSGLVKELQ IAGLIEDDIE IGPVRRLIRI MLNEALTPPF DSQIIEKLSG
DISELAYGKP FRIPIELIFV FRALSTFEGV GRYLDPEFNL IAIAKPFLLP LMTSKNPDSN
DLFNELGRQV TEIGSKAVGL PKRLDENLER LEQGDLQLQV RMGESDRQLR RMINAQQTLG
NSVLLGSLAI SSALLASNSK PHLFFIPLIP GFPIALNWIT LQFKMRNENR LDNFQGRRGS
R