Gene Acid345_4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4049 
Symbol 
ID4072471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4785884 
End bp4788799 
Gene Length2916 bp 
Protein Length971 aa 
Translation table11 
GC content59% 
IMG OID637986080 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_593123 
Protein GI94971075 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.553625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGGCA ACAACTGGGA TCGCGTACAA GAGGTATTTC TGGAGGCGGC AGACCTGCCG 
CTGTACGATC GCGTGCGATT TCTGGACGAA ACGTGCGCCG ACGATCCGGA TTTGCGTACC
GAAGTCGAGT CGCTGCTATG GGCGGATACC GCGGGTGCCG GCGGAATCAG CGAGGCCATC
GAATCCGAAG TCAATTCCCT CCTGCATGAT GATGTCTCGC TGATAGGCAC TCGGCTTGGC
CGCTATCGCC TGGTGAAAGA GATTGGTCGC GGCGGCATGG GATCGGTCTT TCTCGCCGAG
CGCGACGATG AGCACTTCCA TCAGACCGTC GCCATTAAGA TCGTAAAACG CGGCATGGAT
AGCGCCGAAG TTCTGGCGCG CTTCCGGCAT GAACGTCAGA TCCTCGCAGG CCTCGAACAT
CCTTACATCG CACGATTAAT CGACGGCGGT ACGACCGACG ATGGACGTCC ATTCTTCGTC
ATGGAGCGCG TCGAGGGCCG ACCGATTGAC GTGTACTGTC GCGAGCAAAA TCTTAGCGTC
GAGGCGCGCT TGCGACTGTT TGTCCGGGTA TGCGAGGCCA TCTCGTACGC GCACCGCGCA
CTGGTGGTGC ATCGCGATCT GAAGCCGAGC AATATCCTCG TAACCAGCGA AGGAATCCCG
AAGCTCCTCG ACTTCGGAGT CGCGAAGCTC CTCGGTCCAA GCCTGGATCC TGGGCTGACC
TCGACCTGGT CGGCGATGGG GCCGCTTACG CCGGAATACG CGAGTCCGGA ACAGATCCAG
GGACTGCCGA TTACAACTGC AGCCGACACC TATGCGTTGG GCGCGATCCT CTTCGAACTG
CTGACGGGGA GGAGAGCACA GAAGATTGCG GGGCACAGTC CGGCGGAAAT CGAGCGGGTA
GTTTGCCACG TCGAGATTCC TGCGCCAAGT GCGGTCGAAA AAACTTCTGG TCTGTCGCTG
AAAATTGACA GCGATCTCGA CAACATCGTG TTGATGGCAC TTCGCAAGGA ACCGGAGCGG
CGCTATCGTT CGGTGAACCA ATTTGCCGAA GATATTGCGA AGTATCTCGC GGGCCGCCCG
GTGCTAGCGC AGCAGGATTC ATTCGTCTAT CGCAGCCGGA AGTTCTTGCG GCGGCATGCT
CTATTGGTTG CAGCAGCCGC GCTGGTGACG GCGAGTTTGG TCGGCGGAAC AGCACTGGCA
CTGATGCAGG CCAAACGCGC AGAAACAGCC CGGGGATTGG CGGAAATGCA GCGCCAGTCA
GCTGAACGCG AGCGTGCGCG CGCCGAAGCA CAAACTCAGA TCGCGGAGCA GGAACGCGTG
AAAGCGGAGG CCGAGGCGCT CGTTGCGAAG ACGGAGCAAG GAATCTCGCA GCGCCGGTTG
GCGCAGATGT TAGAGCTCTC CGATCACACT TTGTTCGACG TGCACTCCGC AATCGAGAAG
CTTCCGGGCG CCACCGAGGC GCGTCGGAAG ATCGTCGCCA CCACGCTGAG CTTTCTCGAA
GATTTGTCGA AAGATGCAGC GCATGATGAT CGCCTGCGAT TCATGTTGAG CGTGTCGTAC
CTGCGCGTGG CGGATGTGCT GGGGCATCCG CTGAAACCGA ACCTTGGCGA CAGTAAGGGA
GCAGACGAGA ACTATCGCAA GTCGGTGGCG ATGATCGAGC CATTGGTCAA ACAATATCCG
GACAATGGTG AATATCTGCG GCAGATGATT CACGCCGAGG TGCAATGGGC GATCCTGCTC
TCACGGACTG GAGAACAGGC ACGCGCGATT GCAGTGCTGA AGTCGCTTAT GCCGATGGCG
CCGCGGTTGC CGAAACTCTG TCCGAAAGAT CCTGATTGCT GGATGGTGGA GAGTGAGGTT
TATTCAGAAC TCCTGGAGAC CAATGAGACG ATTGATTCCG GCTCGGCGAT TGGTTACTCG
GAGTTGCAGG TCGGGTCTCT CGAGAAAGCT CACAAACAAT TTCCGGACAA TTCCGAGGTC
CTGCTGGAAC TGGCTTCCGC CTACAGCCAG AACGCAAAAC TGCACAATGT CCGCGGGGAA
TTGCGGGAAT CAGTTGATGG CTTCCGACGC GCGATGTCGT TGCGCGAGGA AGAGGTGCGG
CGGAATCCGT CGGATGTACT GCTGCAGCGC AGCCTGATGA TCACCTACGG AAACCTTGCG
GGCACGCTGG GAAATCCGAT CTACCTGAAT CTTGGAGACT CCGAAGGCGC GCGCTTGTAT
TACGGAAAAG CGCTGGCGAT TGCGCGTCAA CTGGCGGCGG CCGATGCGAA CAATCAACTC
GCGCAGTATG ACCTCGCGAA TGCTTTGCTG TTTTCGTCGT GCCTCGATCT CCCGAAAGAA
CTTTGGCCCG AAGCATTAGC TCATCTCACC GAAGCCGAGA CGATCATGAC GCGACTTGTT
GCTGCAGACC CGAAAGCGGT GAAAAATCTG CGGTGGCTCA GCACGGTGCA GGAATTCCGG
GGCCGACGTT TGATGTTGAT GGGGCAAAAT GACGAGGCGA TTGCCACCCT TCAGACGTCG
ATGGAGAACG GGGAAAAGGG CCTCACCCGT GCAGCAAGTG ACCTCAGCAT GATGACGCAA
GTGGTTGCCA GCGAGGAGGG ACTCTCGGAG GCCCTGGCGC GAAAGGGCGA TGCTGATGGC
GCACTCAGTC ACGCGAGAGC TGCGGTGGCA AAAGTCGAAA AGGCAACCGC TCCAGATTCG
GACAAGGACA GGTTGCTGCG GCTAGCGGCA ATTGCCTATC AGAACCTCGC GGTCGTGCAG
TCGTTGCTCG GCGACTGGAA CGGTGCTCGG GCTTCTGCCG AACTTTCCAT CACGCAATGG
CAGAGAATGG TCGCGATGGG CAGCCACCGA GTGGAATCCG CTAAAATGCG GGCCTCGGAA
GAGCTTGTGC AGCAATCACT TGCGCACCTT AAATAA
 
Protein sequence
MPGNNWDRVQ EVFLEAADLP LYDRVRFLDE TCADDPDLRT EVESLLWADT AGAGGISEAI 
ESEVNSLLHD DVSLIGTRLG RYRLVKEIGR GGMGSVFLAE RDDEHFHQTV AIKIVKRGMD
SAEVLARFRH ERQILAGLEH PYIARLIDGG TTDDGRPFFV MERVEGRPID VYCREQNLSV
EARLRLFVRV CEAISYAHRA LVVHRDLKPS NILVTSEGIP KLLDFGVAKL LGPSLDPGLT
STWSAMGPLT PEYASPEQIQ GLPITTAADT YALGAILFEL LTGRRAQKIA GHSPAEIERV
VCHVEIPAPS AVEKTSGLSL KIDSDLDNIV LMALRKEPER RYRSVNQFAE DIAKYLAGRP
VLAQQDSFVY RSRKFLRRHA LLVAAAALVT ASLVGGTALA LMQAKRAETA RGLAEMQRQS
AERERARAEA QTQIAEQERV KAEAEALVAK TEQGISQRRL AQMLELSDHT LFDVHSAIEK
LPGATEARRK IVATTLSFLE DLSKDAAHDD RLRFMLSVSY LRVADVLGHP LKPNLGDSKG
ADENYRKSVA MIEPLVKQYP DNGEYLRQMI HAEVQWAILL SRTGEQARAI AVLKSLMPMA
PRLPKLCPKD PDCWMVESEV YSELLETNET IDSGSAIGYS ELQVGSLEKA HKQFPDNSEV
LLELASAYSQ NAKLHNVRGE LRESVDGFRR AMSLREEEVR RNPSDVLLQR SLMITYGNLA
GTLGNPIYLN LGDSEGARLY YGKALAIARQ LAAADANNQL AQYDLANALL FSSCLDLPKE
LWPEALAHLT EAETIMTRLV AADPKAVKNL RWLSTVQEFR GRRLMLMGQN DEAIATLQTS
MENGEKGLTR AASDLSMMTQ VVASEEGLSE ALARKGDADG ALSHARAAVA KVEKATAPDS
DKDRLLRLAA IAYQNLAVVQ SLLGDWNGAR ASAELSITQW QRMVAMGSHR VESAKMRASE
ELVQQSLAHL K