Gene Acid345_2578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2578 
Symbol 
ID4070541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3044539 
End bp3047496 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content59% 
IMG OID637984595 
Productserine/threonine protein kinase 
Protein accessionYP_591653 
Protein GI94969605 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.790371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGG CTCCCGGGAC AAAAATTGGG CCCTACCAAG TTGACGCGCT CCTCGGCAAA 
GGCGGTATGG GCGAGGTCTA CACTGCCCGC GACACGCGCC TCCAGCGGAC CGTCGCCATC
AAAATCCTTC CCTCGCACCT TTCCTACAAT CCCGATCTCC GTGCTCGGTT CGAGCAGGAA
GCCAAGTCCA TCTCCGCGCT GCAGCACCCG AACATCTGTG TCATCCACGA CGTTGGCTCG
CAGGACGACA TCGAGTTCAT GGTCATGGAG TACGTCCAGG GCGACACGCT CGACAAGCTG
ATCCCCAAGG GCGGTCTCCC CGCTGAAATC GCAATCCGCT ATGCAATCCA AATCGCCGAC
GCCATCGGAT GCGCGCACAG TGCCGGAATC GTTCACCGAG ATCTCAAGCC TTCGAACGTC
ATCGTTGACA AATCTGGCCT GGTAAAGGTT CTCGACTTCG GACTGGCAAA GACCTCCGCA
CTCGCTGCGC AAGCCGGTGC CATGGAAACG ATCACCGTTG GCACCAGTCC CGGCACAATC
GTCGGGACAG TGGCTTATAT GTCTCCGGAA CAAGCCGAAG GCAAGGCGGT GGACACCCGC
AGCGATGTCT TCTCTTTTGG CGCAGTCTTC TATGAGATGC TCAGCGGCCA CCGCGCCTTC
GAGGGTGAAT CCAGCGCCGC GCTACTTGCG TCGGTCCTCC GAGATGAACC GAAACCGCTC
ACCGAAGTTC GCCGCGATCT CGATCCCGAA ATCCGCCGCA TCGTCACGCG CTGTCTAAAG
AAGGATCCGG CCGCTCGCTA CGCGGATGGC AACGACCTCG CTCGCGACCT CAAGCGTTGT
CGCGAGACCC TCTTCCCCGA ATCCGGCACG GGAATGAGTG CCGTCCGACT GGCGCACGAA
GTGAAGCGCC CGCGTGTTTT GATTCCTGCA GTTCTGCTGT TACTCCTCAT CGCCGCCGGG
ACAATGATGC TGGTCAAACG CTCGCGCGAT GCGCATTGGG CGCGAGAGAC AGCGCTCCCG
CAAATCTCGC AACTTGTCGA CGAGGGAAAG TTTGAGACCG CCTTCCAACT CGCGACCAAG
GCGGAAAGCG CGATTCCCGG CGATCCCGCG CTCGAAAAGG TCTGGAAGCA AGTCACGTTC
GAACTGACCC TTGAGACAAG CGCTCCTGAC GTCAGCGTCT ATCGCCGGGA ATACGATGAC
CACAACGGCC CCTGGATCTT TGTCGGGAAG GCCCCGTTCA AGTCCATCCG GCAGCCCCGC
GGAACTCGGC TCTGGAAACT CGAAAAGCCC GGTTATGTGA CGGTAATTAG GACTACCAAC
TCTCTCCTCG ACCGTTACTT TGTTTCCACC GACCCGATGA CCGCTCATGT CACCATGGAT
GCTGTCGCTG ATGCCCCGCC TGGCATGGTC CGCGTCGCTC CTTCGAAGAG CCTGAAGGAG
CTACTGATCC CCGGTTTTGA GGGCATGCCT CACCTCGATC TGCCTGACTA CTGGCTCGAT
CAGTACGAAG TGACGAACCG CGCGTTCGAA GCCTTCGTGG ATGCCGGTGG TTATCGCAAT
CCATCTTTCT GGAAGCACGA CTTCATTCGC GACGGCAAGA AACTGACCTT CGATCAGGCA
ATGGCGCTGT TCCAGGACGC CACTGGACGT ACGGGACCGA AAGACTGGGT CGGCGGTCAA
TATCCGAAGG GACAGGATGA CTATCCCGTC GCTGGCATCA GTTGGTACGA AGCCGCCGCT
TACGCCGAAT ATGCAGGCAA GAGCCTGCCG TCGATCTATC ACTTCAACCG GGCAGCGGGA
CCACAGTTCT CGTACCTGAT CGAACCGGCG AGCAATTTCG GTACCGGCGG GCCGGTACCG
GTTGGCAGCC GACAAGGCAT CGGTCCGTTC GGCACCTCCG ACATGGCGGG AAATGTGAAG
GAGTGGATCT TCACCGAGGC CGAAAACGGC AAACACTACG TTTTGGGTGG CGCCTGGGAT
GAGCCTACCT ATATGTTCGT CGATCCCGAC GCGCAGGCAC CGTTTTTGCG CGCCAGCAAC
ATCGGGTTCC GTTGTGTGAA ATACATAAAT CCCGATGCAA TTCCGAAGGT CGCGTTCGAC
CGCATCCTCT CCGCGCGCCG CGATTTGACC GCCGTCAAGC CGGTTTCTGA TCAGGTCTTC
AGCGCCTATC GCAGCCTCTA CTCATACGAC AAAGCACCCC TCAACGCACA CGTCGATAAG
CTCGAGAAAA CCGAGGACGA TTGGACGGTT GAAAAGATCG CTTACGACGC GCCTTACGGG
AACGAGCGAG CATTTGCATA CCTGTTCCTT CCGACCAAGG CGAACCCGCC ATTCCAGACT
GTGCTCTTCT TCCCCGGATC GAACGCGCTT GAGTTGAGGA AGTTCAACCT GTATCCCACC
GCGGCAATCG ACGCGCTTGT GCGCAGCGGA CGCGCCGTGA TTTACCCGGT TTACAAAGGC
ACCTACGAAC GCGGCGACGG CATGGAAAGC GATGTCCCGA ATATGAGCAC CACGTGGCGG
GACCATGTCG TGATGTGGGC GAAGGATGCT TCCCGCGCGA TCGACTACGT CGAGAGCCGC
CCCGACCTCG ATCACGCAAA GGTCGCCTAC TACGGGTATA GCTGGGGCTC GGAGATGGGC
ACGATCATTC CCGCCATCGA ACCTCGCATC AAGGTCGTGA CCCTTGCCCT CGGCGGCTTC
GATCTCCACC AGTCACTGCC CGAGGTCGAC ACCGTCAACT TCGCCCAACG TATCAAGCAG
CCGACGCTGA TGCTCAATGG CCGCTACGAT TTCTTCTTCC CCATGGACGC CACCCAGGAG
CCTCTATATC GAATGCTTGG CGCTCCGAAG GACGACAAGA AGCACCTGAT TTACGACACC
AGCCACACTA TTCCGCGCAA CGAACTGATC AAGGAAAACC TGAATTGGCT GGATAAATAT
CTCGGTCCGG TGAAGTGA
 
Protein sequence
MALAPGTKIG PYQVDALLGK GGMGEVYTAR DTRLQRTVAI KILPSHLSYN PDLRARFEQE 
AKSISALQHP NICVIHDVGS QDDIEFMVME YVQGDTLDKL IPKGGLPAEI AIRYAIQIAD
AIGCAHSAGI VHRDLKPSNV IVDKSGLVKV LDFGLAKTSA LAAQAGAMET ITVGTSPGTI
VGTVAYMSPE QAEGKAVDTR SDVFSFGAVF YEMLSGHRAF EGESSAALLA SVLRDEPKPL
TEVRRDLDPE IRRIVTRCLK KDPAARYADG NDLARDLKRC RETLFPESGT GMSAVRLAHE
VKRPRVLIPA VLLLLLIAAG TMMLVKRSRD AHWARETALP QISQLVDEGK FETAFQLATK
AESAIPGDPA LEKVWKQVTF ELTLETSAPD VSVYRREYDD HNGPWIFVGK APFKSIRQPR
GTRLWKLEKP GYVTVIRTTN SLLDRYFVST DPMTAHVTMD AVADAPPGMV RVAPSKSLKE
LLIPGFEGMP HLDLPDYWLD QYEVTNRAFE AFVDAGGYRN PSFWKHDFIR DGKKLTFDQA
MALFQDATGR TGPKDWVGGQ YPKGQDDYPV AGISWYEAAA YAEYAGKSLP SIYHFNRAAG
PQFSYLIEPA SNFGTGGPVP VGSRQGIGPF GTSDMAGNVK EWIFTEAENG KHYVLGGAWD
EPTYMFVDPD AQAPFLRASN IGFRCVKYIN PDAIPKVAFD RILSARRDLT AVKPVSDQVF
SAYRSLYSYD KAPLNAHVDK LEKTEDDWTV EKIAYDAPYG NERAFAYLFL PTKANPPFQT
VLFFPGSNAL ELRKFNLYPT AAIDALVRSG RAVIYPVYKG TYERGDGMES DVPNMSTTWR
DHVVMWAKDA SRAIDYVESR PDLDHAKVAY YGYSWGSEMG TIIPAIEPRI KVVTLALGGF
DLHQSLPEVD TVNFAQRIKQ PTLMLNGRYD FFFPMDATQE PLYRMLGAPK DDKKHLIYDT
SHTIPRNELI KENLNWLDKY LGPVK