Gene Acid345_4390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4390 
Symbol 
ID4073296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5206173 
End bp5208989 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content62% 
IMG OID637986423 
Productserine/threonine protein kinase 
Protein accessionYP_593464 
Protein GI94971416 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0515] Serine/threonine protein kinase
[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGGC AGACCTTTTC GCATTACCGG ATCCTCGAAA AACTGGGCGG CGGCGGCATG 
GGCGTGGTGT ACAAGGCCGA AGACACGCGC TTGCATCGCT TCGTCGCATT GAAATTTCTT
CCACCCGAAC TCGCCCGCGA TCCGCAGGCC CTGGCTCGTT TCCAGCGTGA GGCGCAAGCC
GCCTCGGCGC TCAACCATCC CAACATCTGC ACCATCTACG ACATTGGCGA CGACAACGGG
CAAGGCTTCA TCGCCATGGA GTTCCTCGAG GGCATGACCC TCAAGCACCG CATCAACAGC
CAGCCCGTGG ACCTCGAGAC GGTGCTGACT TTAGCGATCG ACATCGCCGA TGCGCTCGAC
GCTGCGCACT CCAAGGGCAT CGTCCACCGC GACATCAAGC CTGCGAACAT CTTCGTCATC
GAACGCGGCC ATGCCAAGGT CCTCGACTTC GGCCTGGCGA AGGTGACGCC GCGCAGCGCG
TCCAGCGTGC CGTCCGCGAA CACCATGACC GCCACCGAAC TCGCGGTGGA AGACGAGCAC
CTCACCAGTC CCGGATCCAC GCTCGGCACC ATCGCGTATA TGTCGCCGGA GCAGGCGAAG
GGGAAGGAAC TCGACGCGCG CAGCGATCTC TTCTCCTTCG GCTCCGTGCT CTACGAAATG
GTCACCGGTG CATTGCCGTT CCAGGGCGAG ACGTCGGCGC TGATGTTCGA CGCCATCCTC
AATCGCGATC CGCTGCCGCC GCTGCGCTTC AATCCGAAGG TGCCGGCGAA GCTGGAAGAG
ATCATTCAAA AGGCGTTGGA GAAAGACCGC GATTTGCGCT ATCAGCACGC GTCCGAAATG
CGCAGCGACC TCAAGCGCCT GCAGCGCGAC AGCAGTTCCG CGCCGCGCCA GGCCTCGGCC
CCAACTACCG AAGCAAGTTC CGCGCGTGTC AGTGCCGCTT ATGTTCCGCC TTCGTCTTCC
GCGAGCGCCG CGCCCGTGGC TCAAGCTTCT GCGCCCTCGC ATTCCACGGG TAGTTCGAGC
GTGATCGCCG TGGCCCGCGA GCATAAGTTC GGACTCGTAG GAATAGCCGT AGTCGCGCTG
GGGCTGCTCG GTGCGGGCGG CTTTGGGATC TACGAGTTCC TCTCACGCAG CGCGCAGGCG
CCGTTCCAGA ATTTCACAGT TGCGCAACTG ACGAACACCG GCAAAGCGCG CCAGTCGGCG
ATCTCGCCCG ACGGCAAGTA CGTCGTCAGC GTGCAGGACG ACAACGGCGT CCGCAGTTTG
TGGCTGCGCA ACGTCCCCAC CGGCAGTGAT ACGCAGGTCC TGCCGCCGTC TCCCGTCATT
TACGCCACGC TCACGTTCTC GCCCGACGGC AACTACATCT ATTACCGCAA GGCTTCCAGC
GCAGCGCAGA GCGAGTGGGA CATCTTCCGC ATCCCCGTGC TCGGCGGCTC GCCGCAAGTG
CTCGCCAAGG ATGTGGACAG CAACGTCACG TTCTCGCCCG ACGGTACGAA GATGGCCTAC
TTCCGCGGCA ACGATCCCGA AGTCGGCAAG TTGTACTTCC TCATCGCCAA CCTCGACGGC
AGCAACGAGG AGACGGTGTA TATCGGGTCG GCGGTGGACC TCCCACGAAG CATTGCGTGG
TCAGCGGACG GCAAGCGCCT CTTCTACAAC ATTTTCACGC TTAAAACTGC GCTGAGTGAA
ATCCGGCAAA TCGAAATTGC GAGCAAGAAA GCCAGCCTTC TCTCTGCTCA AACCTCTACG
CTTGTGCAAG AGCTCGAGCG TTTGCCGGGG AGTTCATCGC TGCTCATCCG AGGTCTCGAC
AAATCCAACT ACACGAAATC GCAGATCGGC GTGCTCTCGG ACTCAGGCCA GGTCGTGCCC
ATCACCCGCG ACACCAACTC CTATGAAGGC ATGTCGCTTT CAGGCGACGG CAAGACGATC
GCCGCCGTGC AACAGCGCGT CACCCGCACT TTTTGGACAG CAGCGATGTC CAACGACGCG
CTCGCCGCGC CACCGCAATC GGTCGCCGGA GTTGAGAACG CGCACACTTT CGCGTTCAAC
CCGGATGGCA GCCTGCTCGT GAGTGACGAC CAGACGCTGC GTCGCACTGA CCTTGCGGGC
GCCGCGACCA CGACGGTGCT TGGAGATGGC AACGCTTACA TCGTGGAACT CGCACCCTGC
GGCGACCGAT ACTTCGTGCT GCAATGGGCC TTTCATGGCG GCGGCTACAG CTATCCTGTG
TGGCGGGTGA ATCTCGACGG CTCCAATCCC CTGCGGCTCA CGGATGGCAG TTACGACGGC
CGCCCCAACT GTTCACCCGA TGGCAAAACC GTTTATTTCG AACCGACCAC GAGCAAAGCC
ACCGTCGCCA GGGTTCCCAT CGACGGCGGA AAACAGGAAA CCGTGCCCGG CAGTGAGGTA
GAAAGAGGGT TCGGCATCGG CGTGGGCCAC GCTGTTTCGC CCGACGGCAA ACTGCTAGCT
TTCAACGCTG AGATCAGCAC CGATGCCCAG GGACACGTGG GAGAGAAGAT TGTCTTCGTT
ACGCTGGACG GCTCCGGTGC TGCTCAGCGA ATCATTAACG CCGATCCGCG CATCTCCAGT
GGACGCCTGG CGAACACGCT CACCTTCACT CCCGATGGTA AATCGGTGGC CTACGTCGTC
CGCGAGCATG GCGTCGAAAA CGTCTTCATC CAGCCCATCG ACGGCACGCC CGGCCACCAA
CTCACGAACT TCACCTCGCA ACTCATCTCC AACTTCCACT GGTCGCCCGA TGGCAAGACC
CTCGCCATCG CCCGCGCCGA CGCCAGTTCC GATGTGGTGC TGCTGAAAGA GAAGTAG
 
Protein sequence
MIGQTFSHYR ILEKLGGGGM GVVYKAEDTR LHRFVALKFL PPELARDPQA LARFQREAQA 
ASALNHPNIC TIYDIGDDNG QGFIAMEFLE GMTLKHRINS QPVDLETVLT LAIDIADALD
AAHSKGIVHR DIKPANIFVI ERGHAKVLDF GLAKVTPRSA SSVPSANTMT ATELAVEDEH
LTSPGSTLGT IAYMSPEQAK GKELDARSDL FSFGSVLYEM VTGALPFQGE TSALMFDAIL
NRDPLPPLRF NPKVPAKLEE IIQKALEKDR DLRYQHASEM RSDLKRLQRD SSSAPRQASA
PTTEASSARV SAAYVPPSSS ASAAPVAQAS APSHSTGSSS VIAVAREHKF GLVGIAVVAL
GLLGAGGFGI YEFLSRSAQA PFQNFTVAQL TNTGKARQSA ISPDGKYVVS VQDDNGVRSL
WLRNVPTGSD TQVLPPSPVI YATLTFSPDG NYIYYRKASS AAQSEWDIFR IPVLGGSPQV
LAKDVDSNVT FSPDGTKMAY FRGNDPEVGK LYFLIANLDG SNEETVYIGS AVDLPRSIAW
SADGKRLFYN IFTLKTALSE IRQIEIASKK ASLLSAQTST LVQELERLPG SSSLLIRGLD
KSNYTKSQIG VLSDSGQVVP ITRDTNSYEG MSLSGDGKTI AAVQQRVTRT FWTAAMSNDA
LAAPPQSVAG VENAHTFAFN PDGSLLVSDD QTLRRTDLAG AATTTVLGDG NAYIVELAPC
GDRYFVLQWA FHGGGYSYPV WRVNLDGSNP LRLTDGSYDG RPNCSPDGKT VYFEPTTSKA
TVARVPIDGG KQETVPGSEV ERGFGIGVGH AVSPDGKLLA FNAEISTDAQ GHVGEKIVFV
TLDGSGAAQR IINADPRISS GRLANTLTFT PDGKSVAYVV REHGVENVFI QPIDGTPGHQ
LTNFTSQLIS NFHWSPDGKT LAIARADASS DVVLLKEK