Gene NATL1_14981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_14981 
Symbol 
ID4780006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1210011 
End bp1211657 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content33% 
IMG OID640084779 
Productkinase 
Protein accessionYP_001015320 
Protein GI124026204 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACG ATTTCCAAAG AGATTTAACT TGGTTGATTT CAAGACCATG GATATTGGTT 
CCTAGATTTA TTCAGATAGT TGGTGCAATA TCACTTCTAC TAGCAAGACT TATATTTCAA
GGTTCAAGCA CTGAAGACAA CACACAAAAG AAATTAGCTA AATTTCTACT TAAAACACTT
ATCGATCTGG GTCCCTGTTT TATAAAAGTA GGGCAAGCAC TTTCAACCAG GCCAGATCTA
ATAAGAAAAG ATTGGCTCGA AGAATTAACA AACCTACAAG ATAATTTACC CTCATTTTCT
CATGAAAAAG CACTTGAGAT AATTGAAAAT GAGCTAGGCA AACCAGCAAG TGAACTTTTT
GAAGAGTTCC CATCTAAACC AATAGCCTCA GCAAGTCTTG GGCAGGTTTA TAAAGCAAAA
TTATTTGGCG ATTATTGGGT AGCAGTAAAA GTTCAGAGGC CTAATCTGAT TTTCAATATT
AGGAGAGATA TCGTAATTAT AAAAATACTT GGTGTTCTTA GTGCGCCAAT ACTTCCATTA
AATCTTGGGT TCGGACTAGG TGAAATAATA GATGAATTTG GAAGAAGTTT ATTTGACGAA
GTTGATTATA AAAAAGAAGC TGATAATGCA GAAAAATTTT CTAACCTCTT CCACAATAAT
AATTCAGTAA CTATACCGAC AGTAGAAAGA CATTTATCAA CTATAAAAGT ATTAACTACA
AGCTGGATTG ATGGCACAAA ACTAAAAGAA AGAAATGAAT TAATTGAAAA TAATTTAAAT
CCTTCAAAAC TTATAAGAAC TGGTGTTATC AGCGGCATAC AACAATTATT GGAATTTGGA
TATTTTCATG CAGATCCTCA TCCAGGAAAT ATGTTTGCAC TGGAAGGTCA TAATGGTGAT
TTAGGAAATA TAGCCTATGT AGATTTCGGA ATGATGGACT CAATTTCAGA GGTAGATAGA
TTAACTCTTA CTGGTGCAAT TGTTCACCTG ATAAATAATG ATTTTCATTC TGTTGCAAAA
GATTTTCAAA AGCTAGGTTT TCTAAGTAAA GAACAGGACC TTAAGCCTCT AATACCTGTA
TTAAAGGAAG TGCTTGGAAG TGCAATTGGC AAAGATGTTG CAACTTTTAA TTTCAAAGAA
ATTACCAACA AATTCTCCGA ATTAATGTTT GATTATCCCT TTAGAGTCCC CGCAAGATTT
GCATTGATTA TTAGAGCAGT AATCAGCCAA GAAGGACTTG CCATCCGATT AGATCCTGAT
TTTAAAATTA TAGATTTAGC ATATCCATAT GTAGCCAAAA GATTACTCAC TTCAGATACT
AATGAAATGA TAAATATATT ATTAGACGTC ATATTTGATC AGAATGGACA TTTAAGAGTT
GAGCGAATAG AAAATTTATT TGATGTACTA GTCCAAGACT CGAGCACACC AGCTAAAGAA
CTAATACCAG TAGCAGGAGC AGGTTTAAAG CTTCTAACAA GCTCTAGAGG CTCATTAATA
AGGAAAAATT TATTAATGAG TATTATTAAA GACGAAAGGA TCGATACTAA AGATATGAAG
CAATTAATCA AATTAGTACG AAAAACCTTC AATCCTTTAA AAATGGTTTC AGCTCTTACA
AAACAAAACA ACACACAAGT TGCGTAG
 
Protein sequence
MKYDFQRDLT WLISRPWILV PRFIQIVGAI SLLLARLIFQ GSSTEDNTQK KLAKFLLKTL 
IDLGPCFIKV GQALSTRPDL IRKDWLEELT NLQDNLPSFS HEKALEIIEN ELGKPASELF
EEFPSKPIAS ASLGQVYKAK LFGDYWVAVK VQRPNLIFNI RRDIVIIKIL GVLSAPILPL
NLGFGLGEII DEFGRSLFDE VDYKKEADNA EKFSNLFHNN NSVTIPTVER HLSTIKVLTT
SWIDGTKLKE RNELIENNLN PSKLIRTGVI SGIQQLLEFG YFHADPHPGN MFALEGHNGD
LGNIAYVDFG MMDSISEVDR LTLTGAIVHL INNDFHSVAK DFQKLGFLSK EQDLKPLIPV
LKEVLGSAIG KDVATFNFKE ITNKFSELMF DYPFRVPARF ALIIRAVISQ EGLAIRLDPD
FKIIDLAYPY VAKRLLTSDT NEMINILLDV IFDQNGHLRV ERIENLFDVL VQDSSTPAKE
LIPVAGAGLK LLTSSRGSLI RKNLLMSIIK DERIDTKDMK QLIKLVRKTF NPLKMVSALT
KQNNTQVA