Gene Cagg_1361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1361 
Symbol 
ID7268653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1685803 
End bp1687500 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content53% 
IMG OID643566204 
Productcircadian clock protein KaiC 
Protein accessionYP_002462704 
Protein GI219848271 
COG category[T] Signal transduction mechanisms 
COG ID[COG0467] RecA-superfamily ATPases implicated in signal transduction 
TIGRFAM ID[TIGR02655] circadian clock protein KaiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.438271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000777387 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCGTAG CACATATACC AAAAGCGCCA ACCGGAATTC GTGGTTTCGA TGAGATCACC 
GGTGGGGGAG TGCCGCGTGG CCGACCAACC CTGATTTGTG GTGGTCCCGG CTGTGGCAAA
ACACTCTTTG CATTTGAGAC GCTCGTTCAC GGTGCAGCTC AGCACGACGA ACCCGGCTTG
TTTGTTTCGT TTGAAGAAAG CCCGAACGAT CTCCGCACCA ATTTTGCCAG TTTTGGCTTC
AATATCACCG ATTTAGAACA ACGTGGTGCG CTCCTGATCG AGCAAATTCC TCTCGATCAG
AGTGATTGGG CTGAAGTCGG TGAATATGAT CTCGAGGGGT TATTTATTCG GCTTGGGCTA
TTGATCGACC GTATTGGCGC CAAGCGTATC GCCCTCGATA CGATCGAAGC GCTGTTTAGC
GCCTTCAGAA ATGAATCGTT GCTACGGAGT GAATTGCAAC GTCTTTTTCG TTGGTTACGT
CAACGTGGTT TAACTGCGCT AATCACTGCC GAAAGTAACA ATACTAACCT GAGTCGGTTC
GGCATCGAGG AGTTTGTCGC CGATTGTGTC ATTCTCCTCG ACTACCGCAT CAGCGAACAC
GTCCTTACCC GCTACGCACG TATCGTGAAG TATCGCGGTT CGGCGCACGG CGTGAATGAA
TACCCTTTCA CGATCGGGCC GCATGGGATC ACCATTCTGC CGATTACCTC ATGCACCCTC
GATTATCCTG CTTCGACCGA GCACATTTCG AGCGGCTTTC CCGATCTCGA CAGTCTGTTG
GATGGTGGCA GCTATTATCG TGGATCAACG ATTTTGATCA GTGGTCCGAC CGGTTCGGGA
AAAACCAGCC TTGGCGCTTT GTTTCTCAAC GCTGCTTGTG CGCGAGGTGA GCCGGCTATC
CTTTTTGGGT TTGAAGAATC GGCCGATCAG ATTATGCGAA ACATGCGCTC CATCGGGGTT
GATCTGCACC AATGGTATGA ACGCGGCTTG CTTCAGATCG TGAGTAGTCG CCCGACCCTG
ACCGGTCTTG AAACACACTT GATCACAATC CTTTCCACCG TTGAGCAATC TGGTGCTCGT
GTGGTTGTCA TCGACCCGAT CACGGGGTTT CACACAATTA GCCGACCGGC CGACATTACT
GCAATGCTTT TCCGCCTGTT CGACGGGCTA AAATCACTGG GAGCTACGAC ACTTGCCACA
AGCCTTACCA CCGCCGGGGC CGATCCGACC CAAAGCGAAG TCAATATCTC GTCACTCGTT
GATACGTGGA TCGTTTTACG CCATCATGAA GCCAACGGTG AACGTAACCG TAGTCTGCTC
GTGCTCAAAT CGCGTGGTAT GCGTCACTCC AATCAGGTGC GTGAGTTGGT GATGGATCGG
CAGGGTCTCA AACTCGTTGA GATTCTTACC GCGGGGGATA CCGTACTGGT CGGTGCCGCG
CGCATCGCCG AACAAGCGCG TCTCCGCTAC GAACAAGAAC TCTACCAACA GGAATCACTT
CGGCGTCAAC AACGTTACGA GCAGCAACGC CGCCTCCTTG CCCTCCAGAT CGAAGCGCTG
CAAGTCGAAC TAGCAGCCCT TGACGAGGAA ATGAAGAGCG AGACGGCTAT CGCGACTGCT
CGCATGCAAG CCCAAGCACA GAGCCAATTA GCGACTACTC GATACCGCAA CGCGCGCAAC
GAGAGCAGCA ATGAATGA
 
Protein sequence
MIVAHIPKAP TGIRGFDEIT GGGVPRGRPT LICGGPGCGK TLFAFETLVH GAAQHDEPGL 
FVSFEESPND LRTNFASFGF NITDLEQRGA LLIEQIPLDQ SDWAEVGEYD LEGLFIRLGL
LIDRIGAKRI ALDTIEALFS AFRNESLLRS ELQRLFRWLR QRGLTALITA ESNNTNLSRF
GIEEFVADCV ILLDYRISEH VLTRYARIVK YRGSAHGVNE YPFTIGPHGI TILPITSCTL
DYPASTEHIS SGFPDLDSLL DGGSYYRGST ILISGPTGSG KTSLGALFLN AACARGEPAI
LFGFEESADQ IMRNMRSIGV DLHQWYERGL LQIVSSRPTL TGLETHLITI LSTVEQSGAR
VVVIDPITGF HTISRPADIT AMLFRLFDGL KSLGATTLAT SLTTAGADPT QSEVNISSLV
DTWIVLRHHE ANGERNRSLL VLKSRGMRHS NQVRELVMDR QGLKLVEILT AGDTVLVGAA
RIAEQARLRY EQELYQQESL RRQQRYEQQR RLLALQIEAL QVELAALDEE MKSETAIATA
RMQAQAQSQL ATTRYRNARN ESSNE