Gene Acid345_4391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4391 
Symbol 
ID4073297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5209086 
End bp5211833 
Gene Length2748 bp 
Protein Length915 aa 
Translation table11 
GC content62% 
IMG OID637986424 
Productserine/threonine protein kinase 
Protein accessionYP_593465 
Protein GI94971417 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGTC CAGCGCGCTA TAATCCTGCC ACCCTCATGG CCCTCAACCC TGGTCTGAAG 
CTTGGACCCT ACGAAATCCA GTCGCCGCTT GGTGCGGGCG GCATGGGTGA GGTGTATCGC
GCCACCGATA CCCGGCTCGA TCGCATCGTC GCAATCAAGA TTCTTCCTGC CCATCTTTCC
GCGAATCCCG AGGCACGGCA GCGTTTCGAA CGCGAGGCGC GCAGCATCTC TGCGCTCAAC
CATCCCAATA TCTGTGCGCT GTATGACATC GGCACCCAGG ACGGCACTTC GTTTCTCGTC
ATGGAATACG TGCAAGGCGA AACGCTCGAA GCCCGGCGGC AAAAAGGGCC GCTGCCGCTG
AAGCAAGTGA CCGAAATCGG CATCCAGGTC TGCGACGCGT TGGAGAAGGC GCACCGCGCG
GGCATCATCC ATCGCGATCT CAAGCCCGGT AACATCATGC TCACCGCGAG CGGGGCGAAG
CTTCTCGACT TCGGGCTTGC GAAGGCCGTA GGCGTCTTGG GCGCACAAGC CGCCACTGCA
GGCACCCACA CACCCGACAC GCCGACCATG AATGTTTCCG CATTGCGCGC ACCCGCCGCC
GGACTCACGC AGCAGGGAAC CATTGTCGGC ACATTCCAAT ACATGGCGCC CGAGGCCGCG
GAAGGGTTGG CCACAGATGC CCGCAGCGAT ATCTTCAGCC TTGGCTGCGT GCTCTACGAA
ATGGTGACCG GACGCCGCGC CTTCGAAGGC AAATCGCAGC TCAGCGTGCT CACCGCCATC
CTCGAAAGAG ATCCCGAGCC CATCAGCACC ATCCAGCCGC TTACGCCGCT CGCGCTGGAG
TACACAGTGC ACACGTGCCT GGAGAAGAAT CCCGACCAGC GCTTCCAGAC TGCGCATGAT
GTGAAACTGC AGTTGGTGTG GATCGCGAAG TCGGGATCGC AAGCCAGCGC GAAAGCGATC
GCCGGCAAGC CGCAACCGCG CGGCGGGCTG TGGCTGGCGG CAGCGGCGGG CGCGATCCTC
GCGGCGCTCC TGGTTGCGGG CGTTCTGATG TCCACGCAGA AGCAGCCACG CGTGATGCGC
ACCAACCTCG TGGCACCAAA CGGAATGGTT TTCGAAACCC TCTACCGCAA TGGACCTCCG
GAACTTTCCT CGGATGGAAC CAGGGTGGCG TTCGTCGCTC GTAAGGACGG GCAGAATTCC
ATCTGGGTGC GTTCGCTGGA CAAGCTTGAG GCGACGCAGG TGCAGGGGAC AGCGGAGGGC
TTTCGGCCCT TCTGGTCACC AGATGGAACC TCGCTCGCTT TCTTCGCGCA TGCCAAGCTC
TGGCGCGTGG ATTTGAATGG CGTAGCTCCG GTAGCCATCG CGGACGCGCC CGAAGGTCGC
GGCGGAACTT GGGGAGCGGG CAACACGATT GTGTTCGCGC CGAACACTGG TGGACCGCTG
ATGGAGGTGG ACGCCGCGGG CGGCGCCGCA ACGCCGGTGA CCAAGATGGT CGTTACGGTG
CAAGGCGGTA CCGACCGCTG GCCGCATTTC CTTCCGGACG GCAAACATTT CCTGTACCTC
CGCGTACAGA CAGGAAACTC GAGCGACCAC AATGAACTCC GCGTGGGGTC AGTGGATCAG
AGTACGGACA CACTGATTAT GCATGGGGGC GTTTACGAGA CCCGGTACGT GTCCGGCTGG
CTCCTGGTGG ACCGCACGGG TTCTCTGCTG GCATGGCGCT TCGACCCGAA GAATGCGAAG
ACCTCGGGCG AGGGCATCCA GATTGTCGGG AAGCTTGCGA CCGACGAGGT AACGTTCGCA
GGCGTCTTTT CAGTATCGCA ACAAGGGATT CTTCTCTATC AGCCGGGCTC CAGCGAGACC
GGAGACCGCC ACGTCTGGAC GGATGCATCC GGTAAACCGG GTGCGCAGAT TTCGGAGCCG
GGTTATTACG GGCCCACGCG CCTCTCACCG GATGGAACGC GCGTGGTCAC TCCAGTGTCC
TCCGCCACCG GCGATAGCGA CATCTGGATG TGGGACCTGT TGGGGGGCGC CCGTGCGCGA
CTCTCCAAGG GGGACGTCTT TGTCGACATG CCGACTTGGT CGGCGGATGC ACGCACCATC
TACTTCGGCC AGTCCGACAA AGACGGGCAT GAGCAAATCC GTGCCGTGCC GGCTGACGGG
TCCGCACCGG AACGCACATT GCTCAAAATT GACGGGGATG TGCTTCCGGT CGAAACCACC
AAGGATGGAC GGTGGCTCCT GTACGAAGAA CTTATCCCGG GAAGCCTCAA CAACAATGAG
GTGTTGAAGG CATTACCGCT GGGCAGCGGC GACGCGCCGT TCACAGTGCT CGATTCTGTC
GCGGCGTACA GTAAAGCGTC CTTAAAGCCG GAATCCAACG ACTGGCTCGC CTACCAGTCG
AACGAATCGG GCCATTCCGA GGTGTATCTC ACGCGCTTTC CGCATCCGGG GGCGAAGTAC
CAGGTGTCGC AGAGCGGCGC GACGCAGCCG TTGTGGAGCA AAGACGGGAA GCGTCTCTAT
TATCTCGACA ACTCGCAACA CTTGATCTCC GTCGACATCC AGTTGAGGGG AGACTCTCCA
CAGATCGGTG CGCCGAAGAC GCTATTCCAG ACCACGATCC GCGACTCCAT CACCGCGAAC
GGCTACGATG TCACGCGCGA CGGACGCTTC CTGCTCGTGA ACTCCGTGAT GGAGAACAAC
GCGCCCGTCG TGCTCGTAAC CAATTGGGAG ACGGAACTCA AGAAGTGA
 
Protein sequence
MGSPARYNPA TLMALNPGLK LGPYEIQSPL GAGGMGEVYR ATDTRLDRIV AIKILPAHLS 
ANPEARQRFE REARSISALN HPNICALYDI GTQDGTSFLV MEYVQGETLE ARRQKGPLPL
KQVTEIGIQV CDALEKAHRA GIIHRDLKPG NIMLTASGAK LLDFGLAKAV GVLGAQAATA
GTHTPDTPTM NVSALRAPAA GLTQQGTIVG TFQYMAPEAA EGLATDARSD IFSLGCVLYE
MVTGRRAFEG KSQLSVLTAI LERDPEPIST IQPLTPLALE YTVHTCLEKN PDQRFQTAHD
VKLQLVWIAK SGSQASAKAI AGKPQPRGGL WLAAAAGAIL AALLVAGVLM STQKQPRVMR
TNLVAPNGMV FETLYRNGPP ELSSDGTRVA FVARKDGQNS IWVRSLDKLE ATQVQGTAEG
FRPFWSPDGT SLAFFAHAKL WRVDLNGVAP VAIADAPEGR GGTWGAGNTI VFAPNTGGPL
MEVDAAGGAA TPVTKMVVTV QGGTDRWPHF LPDGKHFLYL RVQTGNSSDH NELRVGSVDQ
STDTLIMHGG VYETRYVSGW LLVDRTGSLL AWRFDPKNAK TSGEGIQIVG KLATDEVTFA
GVFSVSQQGI LLYQPGSSET GDRHVWTDAS GKPGAQISEP GYYGPTRLSP DGTRVVTPVS
SATGDSDIWM WDLLGGARAR LSKGDVFVDM PTWSADARTI YFGQSDKDGH EQIRAVPADG
SAPERTLLKI DGDVLPVETT KDGRWLLYEE LIPGSLNNNE VLKALPLGSG DAPFTVLDSV
AAYSKASLKP ESNDWLAYQS NESGHSEVYL TRFPHPGAKY QVSQSGATQP LWSKDGKRLY
YLDNSQHLIS VDIQLRGDSP QIGAPKTLFQ TTIRDSITAN GYDVTRDGRF LLVNSVMENN
APVVLVTNWE TELKK