Gene Rcas_4148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4148 
Symbol 
ID5541659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5370042 
End bp5371574 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content58% 
IMG OID640896259 
Productcircadian clock protein KaiC 
Protein accessionYP_001434197 
Protein GI156744068 
COG category[T] Signal transduction mechanisms 
COG ID[COG0467] RecA-superfamily ATPases implicated in signal transduction 
TIGRFAM ID[TIGR02655] circadian clock protein KaiC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0860299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCAGA ATCCGCGTCA ACCAGCTATC GAGAAACTGC CAACCGGGAT CAGCGGCTTC 
GATCTGATTG CCAATGGCGG CATTCCAGCC GGTCGCACCA CGCTGGTTTC CGGCACTGCC
GGAAGCGCTA AGACGGTCTT CGCGGCACAG TTTCTCGCTG AAGGAATTCG CAAATATGGC
GAGCCTGGGG TGTTCGTCAC CTTTGAAGAG CCGCCGTCTG ATATTCGTCG CAATATGGAG
TCGCTCGGAT GGGATATTCC AGCCTGGGAG AGCGAAGGGC GCTGGGCATT CGTCGATGCG
TCGCCACAAC CGGGTGAGGA AGTCCTGTTT GCGGGCAATT ACGATCTTGG CGCCCTCCTG
GCGCGCATCG AAAATGCGGT GCGTCGGATT GGCGCCAAAC GCCTGTCGAT GGACTCGCTC
GGCGCGATCT TTACGCAGTT CGCCGACAGT GCAGTGGTGC GCCGCGAGTT GTTTCGCGTC
GTCTCGGCGC TGAAACGGAT GGGTGTCACG GCGATCATGA CCGGCGAACG CACGCAGGAG
TATGGCGACA TTGCGCGCTA CGGCGTCGAA GAGTTCGTCG CCGATAATGT GATCATCCTG
CGCAATGTTC TGGAAGATGA GAAACGTCGG CGCACCATCG AGATTCTGAA GTTCCGCGGT
ACATCCCACC AGAAAGGCGA GTTCCCCTTC ACGATCATGT CGGGTGAGGG CATGGTCGTT
ATCCCGCTCT CAGCGATCGA ACTTAAGCAG CGCTCGTCTG ATATTCGTGT CACCTCGGGT
AACCCGGAAC TCGACCGTAT GTGCGGCGGC GGCTTCTTCC GCGACTCGAT CCTGCTGGTG
TCGGGTGCGA CAGGAACCGG CAAGACCCTC ATGACCACCG AGTTCATGGC GGGTGGGGTC
AATCGTGGCG AACGGTGTCT GCTCTTTGCG TTCGAGGAAA GCCGTGAACA ACTGTTCCGT
AATGCAACCG GCTGGGGCGT TGATTTCGAG CAGATGGAGC GTGATGGGTT GCTGCGCGTC
GTCTGTGAGT ATCCAGAGAC GGCAGGGCTT GAGGATCACC TGATCGCTAT GAAGAAGTTG
ATCCAGGAGT TCCGCCCCAA TCGTGTGGCG GTCGATAGCC TCTCCGCTCT CGAGCGCGTC
TCGACGATCC GCGGCTTTCG CGAGTTTGTG ATCAGCCTGA CGTCGTTTAT CAAGCATCAG
GAGATCGCCG GTCTGTTTAC CGCGACGACG CCAAGCCTGA TGGGGGGCGA GTCGGTGACC
GAAACCCATA TTTCGACCCT GACCGACTCG ATCATTCTGC TCCGATATGT TGAGATGTTC
GGCGAAATGC GCCGCGGGTT GACGGTGTTG AAGATGCGTG GCTCGATGCA CGATAAGGAC
ATTCGTGAGT TTACAATCGA TAATCGCGGG ATGCATATTG GTCGTCCGTT TCGCCAGGTG
ACCGGCATTC TTTCGGGGAA TCCGATCCAT ATTGCGCCCG GTGAGATTGA TCGCATGAAT
CAGTTGTTCA ATGACGATGA ACAACAGTCG TGA
 
Protein sequence
MTQNPRQPAI EKLPTGISGF DLIANGGIPA GRTTLVSGTA GSAKTVFAAQ FLAEGIRKYG 
EPGVFVTFEE PPSDIRRNME SLGWDIPAWE SEGRWAFVDA SPQPGEEVLF AGNYDLGALL
ARIENAVRRI GAKRLSMDSL GAIFTQFADS AVVRRELFRV VSALKRMGVT AIMTGERTQE
YGDIARYGVE EFVADNVIIL RNVLEDEKRR RTIEILKFRG TSHQKGEFPF TIMSGEGMVV
IPLSAIELKQ RSSDIRVTSG NPELDRMCGG GFFRDSILLV SGATGTGKTL MTTEFMAGGV
NRGERCLLFA FEESREQLFR NATGWGVDFE QMERDGLLRV VCEYPETAGL EDHLIAMKKL
IQEFRPNRVA VDSLSALERV STIRGFREFV ISLTSFIKHQ EIAGLFTATT PSLMGGESVT
ETHISTLTDS IILLRYVEMF GEMRRGLTVL KMRGSMHDKD IREFTIDNRG MHIGRPFRQV
TGILSGNPIH IAPGEIDRMN QLFNDDEQQS