Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4049 |
Symbol | |
ID | 4072471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4785884 |
End bp | 4788799 |
Gene Length | 2916 bp |
Protein Length | 971 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986080 |
Product | TPR repeat-containing serine/threonin protein kinase |
Protein accession | YP_593123 |
Protein GI | 94971075 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.553625 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGGCA ACAACTGGGA TCGCGTACAA GAGGTATTTC TGGAGGCGGC AGACCTGCCG CTGTACGATC GCGTGCGATT TCTGGACGAA ACGTGCGCCG ACGATCCGGA TTTGCGTACC GAAGTCGAGT CGCTGCTATG GGCGGATACC GCGGGTGCCG GCGGAATCAG CGAGGCCATC GAATCCGAAG TCAATTCCCT CCTGCATGAT GATGTCTCGC TGATAGGCAC TCGGCTTGGC CGCTATCGCC TGGTGAAAGA GATTGGTCGC GGCGGCATGG GATCGGTCTT TCTCGCCGAG CGCGACGATG AGCACTTCCA TCAGACCGTC GCCATTAAGA TCGTAAAACG CGGCATGGAT AGCGCCGAAG TTCTGGCGCG CTTCCGGCAT GAACGTCAGA TCCTCGCAGG CCTCGAACAT CCTTACATCG CACGATTAAT CGACGGCGGT ACGACCGACG ATGGACGTCC ATTCTTCGTC ATGGAGCGCG TCGAGGGCCG ACCGATTGAC GTGTACTGTC GCGAGCAAAA TCTTAGCGTC GAGGCGCGCT TGCGACTGTT TGTCCGGGTA TGCGAGGCCA TCTCGTACGC GCACCGCGCA CTGGTGGTGC ATCGCGATCT GAAGCCGAGC AATATCCTCG TAACCAGCGA AGGAATCCCG AAGCTCCTCG ACTTCGGAGT CGCGAAGCTC CTCGGTCCAA GCCTGGATCC TGGGCTGACC TCGACCTGGT CGGCGATGGG GCCGCTTACG CCGGAATACG CGAGTCCGGA ACAGATCCAG GGACTGCCGA TTACAACTGC AGCCGACACC TATGCGTTGG GCGCGATCCT CTTCGAACTG CTGACGGGGA GGAGAGCACA GAAGATTGCG GGGCACAGTC CGGCGGAAAT CGAGCGGGTA GTTTGCCACG TCGAGATTCC TGCGCCAAGT GCGGTCGAAA AAACTTCTGG TCTGTCGCTG AAAATTGACA GCGATCTCGA CAACATCGTG TTGATGGCAC TTCGCAAGGA ACCGGAGCGG CGCTATCGTT CGGTGAACCA ATTTGCCGAA GATATTGCGA AGTATCTCGC GGGCCGCCCG GTGCTAGCGC AGCAGGATTC ATTCGTCTAT CGCAGCCGGA AGTTCTTGCG GCGGCATGCT CTATTGGTTG CAGCAGCCGC GCTGGTGACG GCGAGTTTGG TCGGCGGAAC AGCACTGGCA CTGATGCAGG CCAAACGCGC AGAAACAGCC CGGGGATTGG CGGAAATGCA GCGCCAGTCA GCTGAACGCG AGCGTGCGCG CGCCGAAGCA CAAACTCAGA TCGCGGAGCA GGAACGCGTG AAAGCGGAGG CCGAGGCGCT CGTTGCGAAG ACGGAGCAAG GAATCTCGCA GCGCCGGTTG GCGCAGATGT TAGAGCTCTC CGATCACACT TTGTTCGACG TGCACTCCGC AATCGAGAAG CTTCCGGGCG CCACCGAGGC GCGTCGGAAG ATCGTCGCCA CCACGCTGAG CTTTCTCGAA GATTTGTCGA AAGATGCAGC GCATGATGAT CGCCTGCGAT TCATGTTGAG CGTGTCGTAC CTGCGCGTGG CGGATGTGCT GGGGCATCCG CTGAAACCGA ACCTTGGCGA CAGTAAGGGA GCAGACGAGA ACTATCGCAA GTCGGTGGCG ATGATCGAGC CATTGGTCAA ACAATATCCG GACAATGGTG AATATCTGCG GCAGATGATT CACGCCGAGG TGCAATGGGC GATCCTGCTC TCACGGACTG GAGAACAGGC ACGCGCGATT GCAGTGCTGA AGTCGCTTAT GCCGATGGCG CCGCGGTTGC CGAAACTCTG TCCGAAAGAT CCTGATTGCT GGATGGTGGA GAGTGAGGTT TATTCAGAAC TCCTGGAGAC CAATGAGACG ATTGATTCCG GCTCGGCGAT TGGTTACTCG GAGTTGCAGG TCGGGTCTCT CGAGAAAGCT CACAAACAAT TTCCGGACAA TTCCGAGGTC CTGCTGGAAC TGGCTTCCGC CTACAGCCAG AACGCAAAAC TGCACAATGT CCGCGGGGAA TTGCGGGAAT CAGTTGATGG CTTCCGACGC GCGATGTCGT TGCGCGAGGA AGAGGTGCGG CGGAATCCGT CGGATGTACT GCTGCAGCGC AGCCTGATGA TCACCTACGG AAACCTTGCG GGCACGCTGG GAAATCCGAT CTACCTGAAT CTTGGAGACT CCGAAGGCGC GCGCTTGTAT TACGGAAAAG CGCTGGCGAT TGCGCGTCAA CTGGCGGCGG CCGATGCGAA CAATCAACTC GCGCAGTATG ACCTCGCGAA TGCTTTGCTG TTTTCGTCGT GCCTCGATCT CCCGAAAGAA CTTTGGCCCG AAGCATTAGC TCATCTCACC GAAGCCGAGA CGATCATGAC GCGACTTGTT GCTGCAGACC CGAAAGCGGT GAAAAATCTG CGGTGGCTCA GCACGGTGCA GGAATTCCGG GGCCGACGTT TGATGTTGAT GGGGCAAAAT GACGAGGCGA TTGCCACCCT TCAGACGTCG ATGGAGAACG GGGAAAAGGG CCTCACCCGT GCAGCAAGTG ACCTCAGCAT GATGACGCAA GTGGTTGCCA GCGAGGAGGG ACTCTCGGAG GCCCTGGCGC GAAAGGGCGA TGCTGATGGC GCACTCAGTC ACGCGAGAGC TGCGGTGGCA AAAGTCGAAA AGGCAACCGC TCCAGATTCG GACAAGGACA GGTTGCTGCG GCTAGCGGCA ATTGCCTATC AGAACCTCGC GGTCGTGCAG TCGTTGCTCG GCGACTGGAA CGGTGCTCGG GCTTCTGCCG AACTTTCCAT CACGCAATGG CAGAGAATGG TCGCGATGGG CAGCCACCGA GTGGAATCCG CTAAAATGCG GGCCTCGGAA GAGCTTGTGC AGCAATCACT TGCGCACCTT AAATAA
|
Protein sequence | MPGNNWDRVQ EVFLEAADLP LYDRVRFLDE TCADDPDLRT EVESLLWADT AGAGGISEAI ESEVNSLLHD DVSLIGTRLG RYRLVKEIGR GGMGSVFLAE RDDEHFHQTV AIKIVKRGMD SAEVLARFRH ERQILAGLEH PYIARLIDGG TTDDGRPFFV MERVEGRPID VYCREQNLSV EARLRLFVRV CEAISYAHRA LVVHRDLKPS NILVTSEGIP KLLDFGVAKL LGPSLDPGLT STWSAMGPLT PEYASPEQIQ GLPITTAADT YALGAILFEL LTGRRAQKIA GHSPAEIERV VCHVEIPAPS AVEKTSGLSL KIDSDLDNIV LMALRKEPER RYRSVNQFAE DIAKYLAGRP VLAQQDSFVY RSRKFLRRHA LLVAAAALVT ASLVGGTALA LMQAKRAETA RGLAEMQRQS AERERARAEA QTQIAEQERV KAEAEALVAK TEQGISQRRL AQMLELSDHT LFDVHSAIEK LPGATEARRK IVATTLSFLE DLSKDAAHDD RLRFMLSVSY LRVADVLGHP LKPNLGDSKG ADENYRKSVA MIEPLVKQYP DNGEYLRQMI HAEVQWAILL SRTGEQARAI AVLKSLMPMA PRLPKLCPKD PDCWMVESEV YSELLETNET IDSGSAIGYS ELQVGSLEKA HKQFPDNSEV LLELASAYSQ NAKLHNVRGE LRESVDGFRR AMSLREEEVR RNPSDVLLQR SLMITYGNLA GTLGNPIYLN LGDSEGARLY YGKALAIARQ LAAADANNQL AQYDLANALL FSSCLDLPKE LWPEALAHLT EAETIMTRLV AADPKAVKNL RWLSTVQEFR GRRLMLMGQN DEAIATLQTS MENGEKGLTR AASDLSMMTQ VVASEEGLSE ALARKGDADG ALSHARAAVA KVEKATAPDS DKDRLLRLAA IAYQNLAVVQ SLLGDWNGAR ASAELSITQW QRMVAMGSHR VESAKMRASE ELVQQSLAHL K
|
| |