Gene Acid345_4393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4393 
Symbol 
ID4073299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5214627 
End bp5217686 
Gene Length3060 bp 
Protein Length1019 aa 
Translation table11 
GC content61% 
IMG OID637986426 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_593467 
Protein GI94971419 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.618951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGGGC AGACCTTTTC GCATTACCGC ATAACCGAGA AACTGGGTGG CGGCGGCATG 
GGCGTGGTGT ACAAGGCAGA AGACACGCGC CTGCATCGCT TCGTCGCGTT AAAGTTTCTT
CCGCCCGAAC TGGCGCGCGA TCCCCAGGCG CTGGCGCGTT TCCAGCGCGA GGCGCAGGCC
GCATCCGCCC TCAACCATTC CAACATCTGC ACCATCTACG ACATCGGCGA AGACAACGGG
CAGGCGTTCA TCGCCATGGA GTACCTGGAT GGTGTAACGC TCAAGCATAG GATCGAGGGC
CGGGCGCTCG ATCTCGAAGT TCTGCTGCCG ATCGCGATCG AGATTGCCGA TGCGCTCGAT
GCGGCCCACG CCGCGGGCAT TGTGCATCGC GACATTAAGC CGGCGAACAT TTTCATCACC
AAGCGGGAGC ACGCCAAGAT CCTTGATTTC GGGCTGGCGA AGGTCGAAGT CCTGGCGAGC
ACTTCCGCGG CGACCATGAC CGCCGGCGTT GATGAGAAGC AGCTGACGAG TCCCGGGTCC
ACCCTCGGTA CAGTGGCGTA CATGTCGCCG GAGCAAGCGC GCGCGAAGGA ACTGGATGCG
CGCACTGACT TGTTCTCGTT CGGCGCGGTG CTCTACGAGA TGGCGACGGG GCAACTGGCC
TTTCGCGGCG ACAGCACGGC GACCGTGTTT GAGGCGATCC TGAATCGTGC GCCGGTGCCC
CCGGTGCGGC TGAATCCGGA CCTGCCGCCG AAGCTGGAAG ACATCATTAA TAAGGCGCTG
GAGAAAGACC GCAACCTGCG CTACCAGCAC GCGGCCGACA TCCGGGCAGA CTTGCAGAGG
TTGAAACGTG ATACCGAGAC GGGTCGCAGC GCCGCCGTGG TGGCGATGCC CGACTTCATG
GAGCAGACCG TCACAGTCCC CCCGCCTTCG TCAAAAAAAT TGAGCGCGGC GACAACTCCG
GCAGCGGCAT CCACGCCAAG TGCACCGGCC GCGGCCGGTT CGTCGGCAAT CGTGCAGGCT
GCAGGCGGCC GCAATGGTCT GATCATCGGG ATCGCAGGAA TTTTGATCGC GATTGCGGTC
GCGGCATTCT TCCTGTTGCG CGGTGGCAAA CCTTCTACGA CCGGAAACAG CCGCGCAGCC
CACAAGGCAA TCGCCGTTCT CTACTTCAAC AACCTCACCC AAGACCAATC TTTGAACTGG
CTCGACAACG GGCTCACCGA CATGCTCACC ACCAACCTTG CGCAGGTGAA GGGCCTCGAC
GTGCTCTCCA CCGACCGCAT CATGACGGCG GTGCGCGGCG TGAGCAAAGA CGGCAAAGGG
CTCGATCCGT CGCAGGCTCA AAAAGTTGCG CGCGATGCCG GCGCCGATGC GTTCATCACC
GGAGCGTTGT TGAAAGTCGG GCCAACGCAA TTACGGCTTG ACGTCCGCGC GCAGGACACC
AACTCCGGGC AGATCCTGTT TTCCGACAAA GTGGAAGGCG CGGACGTGCA GAGCATTTTC
GGGATGGTCG ACCGCCTGAC TGCGAACCTC GCAGGGACGT TCCTGCCGGA ATCGGACCGT
CCGCAAAAAG GGCCGGAGAT CGAGCAGGCT TCCACCTCGA ACCTGGAGGC GTACAAGCAC
TACCAGCTGG GCAGGGATTA CTCGGAGCGT TTCCTCACGG AGGACGCGGT TCGCGAGTAC
GAAGAAGCGG TGCGGCTCGA CCCGCAATTC GCGATGGCGA TGATCCGCTT GGAGGGAGAG
TTTGCGATTT CAGGCGACCT CAAGTCGGGC TTCGAGTGGG CCGCCAGAGC GCAGCAATTG
CAGGCACGGC TGCCACGCTA TGAACAACTT CAACTGAATT TGTTGCTCTC TTCCCGGGGC
GGCGATTACG ACGCGCAAAT CGCTCTCCTG CGTCAAAACG GTGAAGAGTT CCCGCGCGAC
ATGGCAAACC GGGCGCTGAT TGGGCATCAC CTGTCGGTCT ACCAGGGGAA GGCCAGCGAG
GCGGTTGCGT TCTTAAAAGG CCTGCAGGCC GAGTATCCCC ACGACGAAAA CGTGCTGAAC
TTCCTGACCT ATGCTTCGGC GGAAGCCGGC GACCTGAACG GCGCAATGGC GGCAAATGAT
GCGTACATCC AGGTGCGGCC CGGAGATCCG AATCCGTTCG ACACGCGCGG CGACGCCCTT
TTCTTCGCCG GCCACAATGA CGAAGCGGTG ACGTCTTATC GCAAGGCGAT GGAACTGGGC
TTCAACGAAC ACGACAAACT GGCGATCGTC TACGCCGAGC AGAACAAAAC CGACATGGCC
CGGGCGTCGC TTGCACAATT CAAGGAAAAA GCATCGCCGC TCAACCAACT GTACGCGTCG
GTCTTCGAAG CGCAATTCGC CCAAAGCCGC GGCGACGTGG AAGGCGCGAT TTCCGCCTAT
CGCACTGCGA TCAAGGGCCT GGAGTCCGCG AAGCAGAATG ACGTGGCCGG AGACATCCTG
CTGCGCGCCT CGGAGTTGTC GGTGTTATTA GGACAAACGG CGCCGGCGCT CGCCTATGCG
CAGCAACAAA AGCTGGGCGG AGCGGAAGGC GCTGCGGTCG CCTTCCTCGA AACCATGCAG
GGCAACGAGG CGGCTGTCGA TCCGGCCCTG CAAAAATTCT TAACCGCCAA GCCGTGGCTC
TCGCCATTCC AGCAGAACCT GGTGCGCAGT AGGAATGCCC TTTGGCTCGC GATTGCGCAC
AACGACGCTC CGGCAGCAGC CGCGCAAGTC GCGCGATTGC CGAATGCGCA GATGCCGTAC
CTGCTCTTCC TGAAAGGCCG GGCCCACTTG CTCGCCAACG ACCTGGCGGC AGCCGAGTCG
GACTTCAAGG CCACGCGGCA ATGGGACCGT AACCTGGAGA ACTTCCGCGT GATGAGCTTA
CGCACACCCA TTTGGAGCGT GCTGTCGTCG TTCTATCTCG GCCAGGTCTA CGAACGTTCC
GGAAAGCGCG ACGAAGCCAT CAACTCTTAC CAGCAGTTCC TGTCGCACTT CGAAACATCG
CACACGAAAT TGCCACAGGT GGCCGAAGCG CGTACCGCGC TCAACCACCT GATGAAATAA
 
Protein sequence
MIGQTFSHYR ITEKLGGGGM GVVYKAEDTR LHRFVALKFL PPELARDPQA LARFQREAQA 
ASALNHSNIC TIYDIGEDNG QAFIAMEYLD GVTLKHRIEG RALDLEVLLP IAIEIADALD
AAHAAGIVHR DIKPANIFIT KREHAKILDF GLAKVEVLAS TSAATMTAGV DEKQLTSPGS
TLGTVAYMSP EQARAKELDA RTDLFSFGAV LYEMATGQLA FRGDSTATVF EAILNRAPVP
PVRLNPDLPP KLEDIINKAL EKDRNLRYQH AADIRADLQR LKRDTETGRS AAVVAMPDFM
EQTVTVPPPS SKKLSAATTP AAASTPSAPA AAGSSAIVQA AGGRNGLIIG IAGILIAIAV
AAFFLLRGGK PSTTGNSRAA HKAIAVLYFN NLTQDQSLNW LDNGLTDMLT TNLAQVKGLD
VLSTDRIMTA VRGVSKDGKG LDPSQAQKVA RDAGADAFIT GALLKVGPTQ LRLDVRAQDT
NSGQILFSDK VEGADVQSIF GMVDRLTANL AGTFLPESDR PQKGPEIEQA STSNLEAYKH
YQLGRDYSER FLTEDAVREY EEAVRLDPQF AMAMIRLEGE FAISGDLKSG FEWAARAQQL
QARLPRYEQL QLNLLLSSRG GDYDAQIALL RQNGEEFPRD MANRALIGHH LSVYQGKASE
AVAFLKGLQA EYPHDENVLN FLTYASAEAG DLNGAMAAND AYIQVRPGDP NPFDTRGDAL
FFAGHNDEAV TSYRKAMELG FNEHDKLAIV YAEQNKTDMA RASLAQFKEK ASPLNQLYAS
VFEAQFAQSR GDVEGAISAY RTAIKGLESA KQNDVAGDIL LRASELSVLL GQTAPALAYA
QQQKLGGAEG AAVAFLETMQ GNEAAVDPAL QKFLTAKPWL SPFQQNLVRS RNALWLAIAH
NDAPAAAAQV ARLPNAQMPY LLFLKGRAHL LANDLAAAES DFKATRQWDR NLENFRVMSL
RTPIWSVLSS FYLGQVYERS GKRDEAINSY QQFLSHFETS HTKLPQVAEA RTALNHLMK