Gene NATL1_17691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_17691 
SymbolkaiC 
ID4779577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1451777 
End bp1453279 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content36% 
IMG OID640085057 
Productcircadian clock protein KaiC 
Protein accessionYP_001015589 
Protein GI124026474 
COG category[T] Signal transduction mechanisms 
COG ID[COG0467] RecA-superfamily ATPases implicated in signal transduction 
TIGRFAM ID[TIGR02655] circadian clock protein KaiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGTTC AAAAACTCCC CACTGGAATC GAGGGCTTTG ATGATATTTG TCATGGTGGA 
TTGCCTAATG CGCGAAGCAC TCTAGTCAGC GGAACATCAG GTACTGGTAA AACTGTTTTC
TCTCTGCAAT ATCTTCATCA TGGAATATGT AATTTTGATG AGCCTGGGAT TTTTGTTACT
TTTGAAGAAT CGCCATTAGA TATTATTCGA AATGCTGCAA GTTTTGGTTG GGATTTACAA
GGGTTAATTG ATCAAAATAA GCTTTTTATT TTAGATGCAT CTCCAGATCC AGATGGACAA
GATGTGGCTG GAAGTTTTGA TTTATCTGGG TTGATTGAAA GGATAAGTTA TGCGATTAGA
AAGTTCAAAG CTAAGAGAGT AGCTATAGAT TCAATGACAG CAGTTTTCCA ACAATATGAC
GCTATTTATG TTGTTAGAAG AGAGATTTTT CGACTAATAG CAAGACTTAA AGAAATAGGT
GTGACTACTG TTATGACCAC TGAAAGAATA GATGAATATG GTCCAATTGC TAGATATGGA
GTAGAAGAAT TCGTTTCTGA TAATGTTGTT ATTCTGAGAA ATGTTCTAGA GGCAGAGAAG
AGGAGAAGAA CTGTAGAAAT TTTGAAATTA AGAGGAACTA CACATATGAA GGGTGAATTC
CCTTTTACTA TGGGTTCCCA TGGAATAGTT GCATTCCCTT TGGGTGCAAT GAGGTTGACT
CAAAGATCAT CCAATATTCG AATAAGTTCT GGTGTTCCCG CTTTGGATGA AATGTGTGGA
GGAGGATATT TTCAAGATTC AATTATTCTT GCAACTGGTG CAACTGGTAC CGGAAAAACA
ATGCTAGTTT CAAAGTTTGT TGAAGATGCA TACTTAAATA ATGAAAGAAC AATCCTTTTT
GCGTATGAAG AATCAAGAGC ACAACTTCTT CGAAATGCAA CTAGCTGGGG GATTGATTTT
GAAAAAATGG AAGCTGATGG ATTATTGAAG ATTATTTGTG CTTATCCTGA ATCAACTGGA
TTGGAAGATC ATCTGCAAAT TATTAAAACT GAAATTAGCG AATATAAACC ATCCAGAATG
GCTATAGATT CTCTATCCGC ATTGGCTAGA GGTGTCAGCT TAAATGCATT TAGACAGTTT
GTTATTGGAG TAACTGGTTA TGCAAAACAA GAAGAAATAG CAGGATTCTT TACTAATACT
GCAGAAGAAT TCATGGGAAG CCATTCAATT ACTGATTCCC ATATTTCAAC GATAACTGAT
ACTATTTTGC TTTTGCAATA TGTAGAAATA AGAGGTGAAA TGGCAAGAGC AATAAATGTT
TTTAAGATGC GCGGATCTTG GCATGACAAA AGGATACGCG AATATATCAT TACTAATGAA
GGACCTGAGA TTAAGGATTC ATTCTCTAAC TTTGAACAGA TATTTAGTGG TGCCCCTCAT
CGAATTAATC CTGAAGAACA GATACCTGGC GTTTTTAAAA GCATTAATCC AAAAGAACGG
TAA
 
Protein sequence
MHVQKLPTGI EGFDDICHGG LPNARSTLVS GTSGTGKTVF SLQYLHHGIC NFDEPGIFVT 
FEESPLDIIR NAASFGWDLQ GLIDQNKLFI LDASPDPDGQ DVAGSFDLSG LIERISYAIR
KFKAKRVAID SMTAVFQQYD AIYVVRREIF RLIARLKEIG VTTVMTTERI DEYGPIARYG
VEEFVSDNVV ILRNVLEAEK RRRTVEILKL RGTTHMKGEF PFTMGSHGIV AFPLGAMRLT
QRSSNIRISS GVPALDEMCG GGYFQDSIIL ATGATGTGKT MLVSKFVEDA YLNNERTILF
AYEESRAQLL RNATSWGIDF EKMEADGLLK IICAYPESTG LEDHLQIIKT EISEYKPSRM
AIDSLSALAR GVSLNAFRQF VIGVTGYAKQ EEIAGFFTNT AEEFMGSHSI TDSHISTITD
TILLLQYVEI RGEMARAINV FKMRGSWHDK RIREYIITNE GPEIKDSFSN FEQIFSGAPH
RINPEEQIPG VFKSINPKER