Gene Acid345_3088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3088 
Symbol 
ID4072652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3668401 
End bp3671232 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content60% 
IMG OID637985107 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_592163 
Protein GI94970115 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.258254 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGGC TCTCCATAGC GCATTACCAG GTCCTGGAAG AGATTGGGTC TGGTGGGATG 
GGCGTTGTCT ACAAGGCGCA GGACACGCGG CTTGGACGTT TCGTCGCGCT GAAATTTCTT
CCCGAAGAGT TTGCCAACAA CCCTGAGGTG CTCGCCCGGT TCCGGCGGGA AGCGCAGGCT
TCCTCGGCAC TGAACGACCC GAACATCTGC ACCGTCCACG ACATTGTCGA TTACGAAGGC
CGCACCTTCA TTGTGATGGA GTACCTGGAA GGGGCGAACG TCCGCGAACG GATCAAAGAG
CGCGGCCCTT TCGCGATCGA AGAGTTCTTT CGGATCGCAA TCTCAATTAC CGAGGGGCTG
GCAGACGCGC ATCGACACGG CATTTTGCAC CGTGACATTA AGCCCGCGAA CATATTCATC
ACCGACCGTG GCCGGGTGAA GATTCTCGAC TTTGGCCTGG CGAAGATGGG CATCCAGCAG
CTCGGCACAA ATACCGGCGA TGACGATGAC GCGACCAAGA CGCGCGGGTG GGCTTTTGGA
ACGGTCGCCT ACATGTCTCC GGAACAGGCG CTCGGCAAGC CGCTGGACCA ACGCTCGGAT
ATTTTTTCGC TGGGCACGGT TTTTTTCGAG ATGCTCGCCG GTATAACGCC GTTCGAGGGC
GAGACCACCG GCACCGTGTT CCTCGCGGTG GTGCAGAACA CGCCCGTCAT TCCGGTGCAG
GAGATCCCCA ATACTCCTGC CGGGCTGAAA CGAATCGTTG GGAAGTGTCT CGAAAAAGAC
CGCGAAAAGC GCTACCAAAG CATGGCGGAG CTTCGCGACG ACCTGGTTCG GGTGCAACAG
GATCCGGAAG CAGAGCCTGT TGTGGCGCCG GATCCGCACG CCAGAGTTGC GAACAAATCC
GGTAGTTTTA CGCTGACGCT CGTCGTCGCG ATGATTGCGG CCGTTCTGAT CGTGGCGGTG
CTCTTCCTGC GTTATCGGAG CAGCAGCGGG CTGCACTCGC AGGACAAGAT CGTAATCGGC
GACCTCGCGA ACCTTACCGG CGACGCGACC TTCGATCGCA CGCTGCGACC AGCATTAGTC
GTGAGCCTGG CGCAATCGCC ATTCTTCAAG ATCGCAACGG ATCGCGAAAT GACGGACACC
TTGAAGAGGA TGGAGAAACC ACCCGACACG GCGTATTCGC GCGACGTGAC CCGCGAGATA
TGCATTCGGA ATGGTGGCAA AGCGTTCCTC TCCGGCTCCA TTCGCCAGGA CAGAGAACGC
TACGCAGTGA TGCTGGAAGG CGTTGGCTGC AGCGACAGCA AAGTGATCGC CACTGCGACG
ACGGCGGCGA AGGACAGCAA CGGAGTGATC GCTGCGCTGG GTAGTGCCGC GGAACAGCTT
CGCAAGAAGA TGGGTGAGTC GTTGCCATCG TTGCAACAGT TCGATAAGGC GCTGCCGGAT
GCGACGACAG CGTCGTTGCC GGCCCTGCAG GCATTGGCTG CGGGGGCGCT CGCAACCAGG
CAAACCAGCA GCGTGGCTGC AATTCCCTAC CTGCAGCGTG CGGTGGAACT GGACCCGAAT
TTTGCCAGTG CTTACTTCAG CCTGGGATCG GCGTACTACA ACTCGCGACA ACAGGATTTG
GCGAGCGCGG CAATGACCAA GGCGTTCGAA CTGCGCAATC GAGTGACGGA GCGCGAACGC
TTCCAGATTG AAGCGGCTTT TTATCGCAAT GTGACGGGTG AGCTTTCCAA ACAGATTGCG
ACGTGCGAGC AGGCGATCCA ATCGTATCCG GACGACCCTG TGTTTTACAC CTTTCTCGGC
CTTGCGTACC TGCGATCGGG CAATCACCAG GAGGCGGCAC GCAGTCACGA GGCGGCGCGG
CGTCTGGCTC CGGATAAGTT TTATCCGTAC TCAAATTTGA TGGCAACGTA TTTGTACTTG
AGCAAATGGG ACGAGGCCAA ACTGGCCTAT GACGAGGCGC GTAAACGCAA CCTGGACAAC
GAGGCGATGC GCGAGAACCG CTACCTGATT GCGTTCTTCG AGAACGATGA AGGCGGGATG
CGCGAGCAAC TGGATGCGGT AAAAGGGCGC GCGAATTACG AAGACCGCGT GGTGCGATTG
GCGGCAGACA CCGAGAGCTA CCACGGGCGC TACAAGAACG CGCGGGAGAT GGACCGTCAG
GCGCGCGGAG CGGCGGGGAA AGATAACGCG AAAGACCGGG TGGCGGAATA CCTGGCGGTG
CCAGCGTGGC GTGAGGCCGA AATCGGAAAC CGGAAAGATG CGATTCGCCT GGCGACGGAG
GCGTTGACGG ACACCGATGA TCCGCACGTC GAGGCCATCG CGGCGATTGC GTTAGCGCGC
GCCGGAGATT CAGCGGCGGC CGCGGCAGTA GCTGATCGCC TGGCGAAAGA GCATCCGCTC
GATACGAGCA TCCAAAGCTC CAACATTCCG ACGATTCGCG GCATCATCGC GTACAACCAA
GGCAAGTACG AGGACGTGAT CGCCACGCTA CCCGTCTCTA CGCTGGAAAT CGGTTCCGTC
TTCCCGAGTG GAACTGAAGC GACCTACATT CGAGGGCTTG CGTATTTGAA GCTGCAGAAA
GCGGACGAGG CCGCGCTGGA GTTTCAGAAG ATGATTGATC ATCCGGCCGC AGTGGGGAAT
TTCGTGAACC TCGCGTTGGC GCATTTGCAG CTGGCACGCG CCGAACGAAT GCGTGGCAAG
ACCGAAGAAG CGCGGCGTGC GTACCAGGAT TTTCTGGCGC TCTGGAAGGA TGCCGATGCG
GACCTGGCGC CTTTTCGGGA GGCGAAGGCA GAGTATGCCG CGGTGAGTGA ACCGATTCCG
GGAAAGCCGT GA
 
Protein sequence
MEGLSIAHYQ VLEEIGSGGM GVVYKAQDTR LGRFVALKFL PEEFANNPEV LARFRREAQA 
SSALNDPNIC TVHDIVDYEG RTFIVMEYLE GANVRERIKE RGPFAIEEFF RIAISITEGL
ADAHRHGILH RDIKPANIFI TDRGRVKILD FGLAKMGIQQ LGTNTGDDDD ATKTRGWAFG
TVAYMSPEQA LGKPLDQRSD IFSLGTVFFE MLAGITPFEG ETTGTVFLAV VQNTPVIPVQ
EIPNTPAGLK RIVGKCLEKD REKRYQSMAE LRDDLVRVQQ DPEAEPVVAP DPHARVANKS
GSFTLTLVVA MIAAVLIVAV LFLRYRSSSG LHSQDKIVIG DLANLTGDAT FDRTLRPALV
VSLAQSPFFK IATDREMTDT LKRMEKPPDT AYSRDVTREI CIRNGGKAFL SGSIRQDRER
YAVMLEGVGC SDSKVIATAT TAAKDSNGVI AALGSAAEQL RKKMGESLPS LQQFDKALPD
ATTASLPALQ ALAAGALATR QTSSVAAIPY LQRAVELDPN FASAYFSLGS AYYNSRQQDL
ASAAMTKAFE LRNRVTERER FQIEAAFYRN VTGELSKQIA TCEQAIQSYP DDPVFYTFLG
LAYLRSGNHQ EAARSHEAAR RLAPDKFYPY SNLMATYLYL SKWDEAKLAY DEARKRNLDN
EAMRENRYLI AFFENDEGGM REQLDAVKGR ANYEDRVVRL AADTESYHGR YKNAREMDRQ
ARGAAGKDNA KDRVAEYLAV PAWREAEIGN RKDAIRLATE ALTDTDDPHV EAIAAIALAR
AGDSAAAAAV ADRLAKEHPL DTSIQSSNIP TIRGIIAYNQ GKYEDVIATL PVSTLEIGSV
FPSGTEATYI RGLAYLKLQK ADEAALEFQK MIDHPAAVGN FVNLALAHLQ LARAERMRGK
TEEARRAYQD FLALWKDADA DLAPFREAKA EYAAVSEPIP GKP