Gene Acid345_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4087 
Symbol 
ID4072509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4841817 
End bp4845149 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content62% 
IMG OID637986118 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_593161 
Protein GI94971113 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCG GGGATCCGGA AGACGATAAG CGCGATATCG GCAGCGACGA CTCGCTTCTG 
CCCACTGCCG GTCAGGCCGA ACTAAGTGCG CCCGAAACCG CCGACCACGC CCTCGTCTCC
ATCCCCAATA GCACTGCGGA ACACTTACCG GGACGCACAC CCGGCCGTCT CGAGTCCGGT
GATCGCCTCG CCAATCGCTT CCGCATCGTT CGCTTTATCG CCAAGGGCGG CATGGGCGAA
GTCTACGAAG CCGAGGACGA AGTCGTCCGC GTTCGCCTCG CGCTCAAAAC CATTCTTCCC
GCTATCGCCG CCAATCCCGA CATGGTCGAG CAACTCAAGC TCGAAATGCG CTCGGCGCGC
CGCGTCACTC ATCCCAACGT CTGCCGCCTC ATCGAAGTCT TCGAGGATCG TGATCGCCAG
CCCGCCGTGA TGTTCCTCAC CATGGAACTT GTCGAAGGCG AATCGCTAAG CGACGTCATT
CACCGCAGCG GTCCGTTGCC GTTGCCGCAG TTCTACCGCA TCGCCGCCCA AATCGCTGCC
GGGCTCGACG CCGTCCATCG CGCCGAAATC GTCCACCAGG ATTTCAAAAC CTCAAATGTC
TTGCTCGTTG GATCCGGCGA GAGTACGCGC GCCGTCGTCA CCGATTTCGG TCTCGCCGTG
AATCTCCAAG CCACCGGACG CGACCGCGGC GTCGCCGGCG GCACGCCCGC GTACATGGCG
CCGGAACAGG TCGATGGCAA AACTGAGATC ACCCCCGCCG CCGACATTTA CGCGTTCGGC
GTCGTTCTGT ACGAACTCCT CACCGCTCGT TTCCCGATAG CTGCGACCAC CCGTCGCGAG
GCCCTGGATC GCAAGCTCAC CGAGCGTCCG GTTTCGCCCT CGCACTATCG CCCCGACATT
CCGAAGTACA CCGAGCGCGC GATCCTCAAA TGCCTGGAGC GCAATCCACA AGATCGCTTC
GCCAGCGTCA TGGACGCACT CGCCGCCGTC GAGGGCCGCG CCGAGAAGCG CCGCAAGAAG
CTCGCTGCGT TCGCCACGAC CGTCGCTGTT CTCCTCGCAG TCGCCAGCTT CGGCGGCTAT
GCCGCACGCC GATGGTTCCT CTCCCATCGC ACGCCCACCG TCGCGGTGCT ATCGCTGCAC
GATGCCTCCG CCCGCACCGA GTCCACGTCC ACCGGCACCG AACTCACCGA ACTCCTCACC
AGCAATCTCG GCCTGTCAAA GGGGTTAACC ACCGTCCCTG CCGAAGACGT CTCACTCGCA
CAAGGCGAGT TTCCGGTCAC AGCCAGTGCG AGTTTCGAGC AGGAAGATCT CTCCGCCTTC
CGCCGCGCCA TCGGCGCCGA CTATCTCGTT ATCGGCAAAT ACACGAACGC ATCCGATGGC
AAGCTCGCCT TTGACCTGAA GCTCGAGCGC CCCGACGGCG GCACTCTCGA TTCCATCCAC
GAAGAAGGCA CCGAGCAAAA TGCTGGCGCG CTCATCGCTG ACGCCGCCTC CAAAATTCGC
CAGCGCCTAG GCACCCAGCT TCTCTCCGAC AGCGAAACCG AAGAAGCCGA AAACATCTAT
CCGCGGACCG ACGAAGGACG CAAGCTCTAC TTCCAGGCAC TTGCGCAGTT ACGCGCGCTC
AACAGCGTCG AGGCCGCGTC GCTCTCCAAG AAGGCCGCCA ATGCCGAACC CGACAATCCC
TCCATCCACG CCACATACGC GGAGGCTCTT AATCTCCTGA AAAACATGCC GGCTGCGCAG
CAGGAAGCGA AACGTGCCGC CGAACTCGCG CAAGGGGGCA AGCTTCCGCC GGAATTCGTC
ACCTTGTTAG AAGCCCGCTC CGCTGAACTC AACAACGACT GGAAGACCGC GATCCAGAAG
CTCGACGCCC TCTTCACCTT CACTCGCGAC AACCTGCAAT ACGGCCTCAT GCTCTCGAAC
GCGCAAACCC TCGGCGCGCA ACCTTCCGAC GCCCTCAAGA CCATCGCCCG TCTCTCCAAG
CTAAAAGCGC CTGCTGGCAC CGATCCCCGT ATCCAAATCG CCGCCGCCGA AACCTACGCT
GCCATGGGAA ACTACACCGC AGAGATCCAG TCCGCCGAAC GCGCGGTTCG TGATGCGCAA
GCCCGTTCCT GGCGCATGAT GCAGGCCAAA GCCTCTTTGC AACTCTGCTG GGCTTATCAA
CGCAACGGTG ACTCCGCGAA AGCTCTCGCC AGTTGCGACA CCGCGCGAAC AGTCTTCGCC
GACTTCGGCG ATGGCGTGAG TGGCGCCGTT GCGCTCAATC GGATCGCCAA CGAGCTCGTC
ACCCGCGGCC AGTACCAGGA AGCCAAGAAT GCCTATGATC GCGTTCTCGC CATCGTCACC
AAAGCTCAAT CGCAGCACGA CATGGCGGGT GCACATCTCA ACCTCGCGCT GACCTTGTTG
AACCTTGGCG ACCAGAAAGC TGCGCAGCAA CACGCCGGCC AGGCCATCGA GATCGCAGCG
CACAGTGGCG ATCGCTACGA CGAAGCCCGC GCCCGCCTCA TCTCTGCCGA CCTCCTGCGC
GCGAGCGATG ATCTTCCCGC CGCCATTCAG CAGGCCCGCC TTGCGCAACA GGTCGCGCAC
GACGCCCAGG ACCGCGACGC CGAAGGCTAC GCCCTCAACA ATCTCGCGCT CTATCTCCAG
GAATCAGGCG ACAGTGAAGC CGCGTTCAAC GCCGCGCATC AGGCTCTCGA TATTCGCAAA
CAACTCGGCG TCCCCACCAG CATCTCCGTT ACTCAAGCCC TACTCGGAGA TCTCTACCTC
GAGCGCGGCG ATCTTCCTCA CGCCCGCAGC AGTTACGCAG CTGCTCTCCA ATTACAAGAG
CCCACCGCGA AGGCCCAAAT CGCACAGCTC CAACTCGCCG CGGCGCAGGC AGATTTTCAA
TCCGATGAAT TCGACGCTGC CCTGAAAAAT GCCCAAACCG CCGTCGCGGA ATTCCAGCGG
GAGAAGGATT CCGAAGAGAC CATCGAGGCC AACACCCTCA TCCTGCGCAT TCTCACTCGC
AAGAAAGATC TGGCTGCGGC TCGGCCTTAC TACGAACAGC TCGCGCAGCA GCCTTCTCAG
GACCACGACA TCGCCCTCGC CGCTGCGGTC GCGCGCGCTG AGTTTTTGAT CGCTTCAGCT
CAGCCAGTCG ACGCCGCGGC ACTCCTCCGC CCATTGCTCG GGCTCTCTGA AAAGCCGAAC
TACCTGAATC TCGAAGCTCG CCTGGTGCTG GCCCGCGCCC AACAATTTTC CTCGAAACCC
CAGTCCGTCA CCGATCTTCG CGACATTGCC TCTCAAGCCG AAAAACTGGG CTTTCATCAC
CTCGCCGCCG AAGCGCGCCA ATCTCTCCAC TAG
 
Protein sequence
MSFGDPEDDK RDIGSDDSLL PTAGQAELSA PETADHALVS IPNSTAEHLP GRTPGRLESG 
DRLANRFRIV RFIAKGGMGE VYEAEDEVVR VRLALKTILP AIAANPDMVE QLKLEMRSAR
RVTHPNVCRL IEVFEDRDRQ PAVMFLTMEL VEGESLSDVI HRSGPLPLPQ FYRIAAQIAA
GLDAVHRAEI VHQDFKTSNV LLVGSGESTR AVVTDFGLAV NLQATGRDRG VAGGTPAYMA
PEQVDGKTEI TPAADIYAFG VVLYELLTAR FPIAATTRRE ALDRKLTERP VSPSHYRPDI
PKYTERAILK CLERNPQDRF ASVMDALAAV EGRAEKRRKK LAAFATTVAV LLAVASFGGY
AARRWFLSHR TPTVAVLSLH DASARTESTS TGTELTELLT SNLGLSKGLT TVPAEDVSLA
QGEFPVTASA SFEQEDLSAF RRAIGADYLV IGKYTNASDG KLAFDLKLER PDGGTLDSIH
EEGTEQNAGA LIADAASKIR QRLGTQLLSD SETEEAENIY PRTDEGRKLY FQALAQLRAL
NSVEAASLSK KAANAEPDNP SIHATYAEAL NLLKNMPAAQ QEAKRAAELA QGGKLPPEFV
TLLEARSAEL NNDWKTAIQK LDALFTFTRD NLQYGLMLSN AQTLGAQPSD ALKTIARLSK
LKAPAGTDPR IQIAAAETYA AMGNYTAEIQ SAERAVRDAQ ARSWRMMQAK ASLQLCWAYQ
RNGDSAKALA SCDTARTVFA DFGDGVSGAV ALNRIANELV TRGQYQEAKN AYDRVLAIVT
KAQSQHDMAG AHLNLALTLL NLGDQKAAQQ HAGQAIEIAA HSGDRYDEAR ARLISADLLR
ASDDLPAAIQ QARLAQQVAH DAQDRDAEGY ALNNLALYLQ ESGDSEAAFN AAHQALDIRK
QLGVPTSISV TQALLGDLYL ERGDLPHARS SYAAALQLQE PTAKAQIAQL QLAAAQADFQ
SDEFDAALKN AQTAVAEFQR EKDSEETIEA NTLILRILTR KKDLAAARPY YEQLAQQPSQ
DHDIALAAAV ARAEFLIASA QPVDAAALLR PLLGLSEKPN YLNLEARLVL ARAQQFSSKP
QSVTDLRDIA SQAEKLGFHH LAAEARQSLH