Gene Acid345_3116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3116 
Symbol 
ID4070230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3702834 
End bp3705470 
Gene Length2637 bp 
Protein Length878 aa 
Translation table11 
GC content60% 
IMG OID637985135 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_592191 
Protein GI94970143 
COG category[K] Transcription
[L] Replication, recombination and repair
[N] Cell motility
[R] General function prediction only
[T] Signal transduction mechanisms
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0515] Serine/threonine protein kinase
[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCC CAGCCCCGGC CACAGGCGAA AACATTGGTC ATTACCGCGT GCTCGGAAAG 
CTCGGCGCCG GTGGGATGGG TGTGGTCTAC AAAGCGCTCG ACACCAAGCT CCAGCGCACC
GTCTGCCTGA AGTTTCTCCC CTCCGATACC GCCCTCAGTG ATCGTGATCG CCGCAATCTA
TTGCAGGAAG CTCGCGCGGC CTCCACGCTC GATCACCCGA ACATCGGCGC GATTTACGGG
ATCGAGGAAA CGCCGGATCA GCATCCGTTC ATCGTGATGG CGTATTACGA AGGCCAGACC
CTCGCGCAGG CCATGGACAG CGGTGCCGCC GGTGCGCATC CCCTGGATAT CGTTTGCCAG
GTTGCTCGCG GATTGGCAGC CGCCCACGCC CGCAATATCG TCCATCGCGA CGTCAAGCCC
TCCAACATCA TCCTTACCAC CGACAATGTC GCCAAGATCG TGGACTTCGG TCTTGCTCGC
GTCGTCAGCA GCTCCGCCAT GACGCAGAGC ATGCACACCT CAGGCACTCT GCCGTACATG
GCTCCTGAAC AGGTGCTTGG CGAAGCGATC ACGCCGGCCT GCGACGTGTG GGCCCTTGGC
ATCATCCTCG TGCAACTGCT CACCGGCTCC TATCCTTTCA TTCGGGAGAA CACCACCGCG
ATTGCGTTCG CGATCGTGAA TCTACCGCCC GCAGGCATGG AACTTCTGCC GCAAGCGCTG
GCTCCTGTCG CCTACCGCGC CTTGGCAAAG CAGCCGGAAC ACCGCTATCC GACGGCGAAG
GAATTCCTCG CAGCCCTCGT GGCTGCGAAC GACGAACTCG CAGCATCCGC GCGGGGTGCA
GACACCGACA GTCCAACCCG CAGCAATGCC ATCAGCGCAA AAGAGCTCAA ACAATTGGCC
GAGCATGCAT CGACAGCGCG CTGGAACACG CAGCAGCGCC AGACATCGAA GCGTGTTCTG
TATGCGGCGA TCGTCCTACT CGTTGCGGTC GTCGTCGCTT TGCTGCTGCC ATCGGTTCGC
ACCCGGCTCA ACACCGTGAT TCCCTCCAAT ACGGTGGAGC ACATCGCCGT GCTCCCTTTC
GACAACGCCT CCGGCGATCC CAGCAACGAG GCCATTGCTG CCGGCTTAAT GGATTCCTTG
ACCGGTGAGC TTTCCAATCT CAGCGCCGGC AAGCAGACGC TCTGGGTCGT CCCGGCCAGC
GTAGTTCGCG CCCACAAGAT CTCCGACCCC ACCGCCGCCG CCAAAGTCCT CGGTGCGAGT
CTCGTAGTGA AGGGAAGCAT CCAGCACAAC GGTGATGACG TTCGCCTCCG CGTGGATCTC
ATTGACGCTC GCAACCTGCG GCAGATCGGC TCAGCGACTC TCGAAGACCG AACTGGCGAC
ATTGGCGCGC TTCAGGATGA GGCTGTCGTC CGCCTCGCCG GATTGATGAA CATCAAGCTC
TCAACGGAGA TGCTGCGAGC CACTGGCGGC CGCGGTTCGC CTGCCTCGTA CGAGCTCTAC
CTGAGAGCGC TCGGCTACAT GCAGCGTTAC GACAAAGCCG GCAATCTCGA CCAGGCCATT
GAAGACCTCA ACCAGTCCAT ACACCTCGAT CCGCAATTTG CGCTGGGCTT TGCTTCTCTC
GGCGAGTGCT ATCGCCTGAA GAACGTCGTC GATCCCAAGC AGAAATGGGT GGACCAGGCG
CTTGCGAACC TGCAGCACGC GATGCAATTG AACGACCGCG TTGCCGCACC GCACGTTTCG
TTGGCGTGGT TACAGTCTGC GCTTGGTCAA CATGACCTCG CCTTGCAGGA GTATCAGAAA
GCGCTCGCGA TCAATCCACG CGATCCAGAA GCCGTAAAGG GCCTGTCGCG CGAGTACGAG
CGCGCCGGAC GCACCGCCGA CGCGGAAGCC GGCTTTAAAC AGGCAATCCT TCTGCGGCCT
GATTATTGGG ACAGCTATAA CGCGCTCGGT TCTTTCTACG TTCGCCAGCA ACGCTATCCG
GAAGCGATTG CGCAGTTCCG GCGCGTGCTC GATCTCACGC CCGACAATTC CGCTGCTTAT
AGCAATGTCG CTGGAGTGTT GTTGCTCATC GGCGATCCCG CCTCGCAAAA AGAGGCCGAA
ACGGCTTTGC GCCGCTCCCT TGACCTCTCG CCGTCTTATG CTGCCTACGC CAATCTCGGT
CGTCTTTATA TGAGCCAGAA GCGTTATGCC GAAGGCGTTG ACATCACGCG CAAAGCGCTC
TCGATGAACG ATCAGAATTA TGAAGTGTGG GCGAACCTCA CCGTCATGTT GCAGTGGATG
CATGATGACG TCGGTGCCGC CGACAGCCGC GCCCATACCT TCGCACTTCT TAAACCCTAC
GTCGTCGCGC ATCCCGAGGA TGCCAACGCC CATTCCTCAC TTGCCACGCA TTACGCCAAG
GCCGGCGATC GAACCAATGC AATGCGCGAG ATCGATGCCG CCCTCGGGCT CCAGCCCCAT
GATTCCACCG TGCTCGCCGA CGCCGCTGAA GTGTACGAAG ATTTCGGCGA CCGCAAGCGC
GCCATCGATT TCGCACAGAA AAGCCTGAAA AACGGCAACA GTCTCGATGA TTTGCAAGTC
CGGCCCGAAT TGCAGCAGCT TCTCAAGGAC CCCGGGTTCC GAAGTAATCC GAAATGA
 
Protein sequence
MTSPAPATGE NIGHYRVLGK LGAGGMGVVY KALDTKLQRT VCLKFLPSDT ALSDRDRRNL 
LQEARAASTL DHPNIGAIYG IEETPDQHPF IVMAYYEGQT LAQAMDSGAA GAHPLDIVCQ
VARGLAAAHA RNIVHRDVKP SNIILTTDNV AKIVDFGLAR VVSSSAMTQS MHTSGTLPYM
APEQVLGEAI TPACDVWALG IILVQLLTGS YPFIRENTTA IAFAIVNLPP AGMELLPQAL
APVAYRALAK QPEHRYPTAK EFLAALVAAN DELAASARGA DTDSPTRSNA ISAKELKQLA
EHASTARWNT QQRQTSKRVL YAAIVLLVAV VVALLLPSVR TRLNTVIPSN TVEHIAVLPF
DNASGDPSNE AIAAGLMDSL TGELSNLSAG KQTLWVVPAS VVRAHKISDP TAAAKVLGAS
LVVKGSIQHN GDDVRLRVDL IDARNLRQIG SATLEDRTGD IGALQDEAVV RLAGLMNIKL
STEMLRATGG RGSPASYELY LRALGYMQRY DKAGNLDQAI EDLNQSIHLD PQFALGFASL
GECYRLKNVV DPKQKWVDQA LANLQHAMQL NDRVAAPHVS LAWLQSALGQ HDLALQEYQK
ALAINPRDPE AVKGLSREYE RAGRTADAEA GFKQAILLRP DYWDSYNALG SFYVRQQRYP
EAIAQFRRVL DLTPDNSAAY SNVAGVLLLI GDPASQKEAE TALRRSLDLS PSYAAYANLG
RLYMSQKRYA EGVDITRKAL SMNDQNYEVW ANLTVMLQWM HDDVGAADSR AHTFALLKPY
VVAHPEDANA HSSLATHYAK AGDRTNAMRE IDAALGLQPH DSTVLADAAE VYEDFGDRKR
AIDFAQKSLK NGNSLDDLQV RPELQQLLKD PGFRSNPK