Gene Acid345_1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1736 
Symbol 
ID4072003 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2106348 
End bp2108561 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content60% 
IMG OID637983744 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_590811 
Protein GI94968763 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTGG TGTATCGCGC GGAGGACACG CGACTGGGGC GGCAGGTCGC CTTGAAGTGT 
CTCCCGGCCG AGTATGCCGC CGATCCGCAG ATGGTGGAGC GCTTCCTGCG CGAGGCGCGA
TCGGCGTCGG CGTTGAATCA TCCGAATATC TGCACGATTC ACGAGGTAGA TGTTGCCAAT
GGGCAGCACT TCCTCACTAT GGAGTTGCTC GAGGGGCAAA GTCTTCGCGA TCGCATTGCG
TCGGGGCCGA TAGCGACGGA CGAGTCGATC CGGATTGCGG TCGCAGTTGC GGACGCACTT
GATGCCGCAC ATGAGAAGGG TATTGTTCAT CGCGACATCA AGCCGGCGAA CATTTTCCTG
ACGGAGCGTG GCGAAGCGAA GGTGCTCGAT TTCGGGCTGG CGAAGCTGGA AAGCTCGAAT
CTCGCGATGT CGGCGACGGT GGATGCGAAC CTCACGAGCC CGGGGCAGGC GATCGGCACG
ATTGCGTACA TGTCGCCGGA GCAGGCGCGC GGCATCGACG TAGATGCCCG AAGCGATTTG
TTTTCGCTGG GGGTTCTGAT CTATGAGATG GCGACGGGAG TTCCACCGTT TCCGGGTGCA
ACGTCGGCGC TGATTTTTGA CGCGATTCTC AACCGGCAGG CGACACGTGC GTCGAAGGTG
AACGCTGCGA CGCCGGCGGG ATTGGAGAAC ATCATCGCGA AGCTGCTGGA GAAAGATCCG
CGGTTGCGCT ACCAGAGTGC GGCGGACCTC CTGGCGGACT TGCGTCGATT GCAGCGGGAC
GGTGGTACCG CCAGCGTTGC TGCGGCGATG CCCGCGCAGA GGGCCCGGAA GACGAGCAAG
GCGATTGATT CGATCGCGGT GCTGCCCTTC AAGAACGCGA CGGGCGATGC TGAACTGGAG
TACATCGGGG AGGCGATCGC TGAAGGTGTG CTCGACGGGC TCTCGCACCT GCCAAAGGTG
AGGATCGTTC CGCGCAGCAA GGCATTTCGT TTTCGCGACG AGGCTGAGGA TCCGCAGAGC
GTTGGAAAGA AGCTTGACGT TCGAGCGGTG TTGACCGGAC GGGTGAGCAA ACGCGGCGAT
CAACTGAACA TTCGCGCCGA ACTTGTCGAT GTAGCAAAAG ACGCGCAGCT CTGGGGCGCA
CAGTTTAGCC GCACGGTGAA CGATGCTCCC GATCTCCACG AAGAGATTGC AAAGCGCGTG
GCTGAGAAGT TAGAGGGGCC GTCGTCTGCG GGATCGAAGG GTGGGAAGAA ATCCGAGAAG
GTCGAAACCA CTTCCGTTAA TAAAGAGGCA CAGGCCTTGT ATTTACGCGG CTCGCACCAC
TTAAACAAGT GGACTGCCGA CGGTGTGCAG CTGGGCATTG AATTGTGCAA GCAGGCGATT
GACCTTGAGC CAACTTACGC CGAGCCTTAC GCGGCAATGG CGATGTCGTA TGCGGTTTCG
GGTGTCCTCG GCTCCTTGGA CGCGGAGCTT TCCCAGCGAC AAGCGAAGGC GCTTGGGCAG
AAAGCACTTC AGCTAAATGA AGCTCTTCCG GAGGCACACG CGGCGTTGGC GATTGTCTGC
TACTTTACTT TCGAGCTTAG TGCGGCCGTA CGGTTTGGCG AGCGCGCGAT CGAGTTGGCT
CCAGACCTGC CCATCGCCCG CTATGCTTTG GCGATGGGTC TCTCCACCAA AGGACGGATC
GAGGAAGCCA CTTCAATTCT GCGTGAGGCT GCCGATACGG ATCCTCTCAT GCTGCCGGTG
AACTACGCCT ACGGCCTGAT GCTCTACTAC AGTCATCGTT GGGATGAAGC CGCAGCCCAA
CTGCGGAGAG CGCTCGAGGT CCTCCCGACG ATGCAACTTG CGCAAGGGAT GAGAAGCGTC
GCGCTGGCGC GAGGCGGCCG CCACGAGGAA GCAAAAGCGC AGTACCAGGA GTTCATCCGC
GATCATCCGA CTACACCTTG GGGAACGATC GAAGCCTACC TCGCAGCGCT CGCCGGCGAA
CGAGAGAAAG CCATGCAATT GCTAGCGGTA CCCAGAACCC ATCCGATTTC CTTGTTCTTT
GCTGCCGGCG CGTACGGCGC ACTAGGAGAA CTCGACCTCG GATTCATTGA ACTGGAGCGC
GCCCGTGAGG TGCGTTTTAG CGTGTTGTGC ACTGCCCGGG CAAATCCGAT CTTCGATCCT
TACCGTTCGG ATCCTCGGTG GCCCGGGTAC CTGGAGTCGC TGCGTCTGAA CTAG
 
Protein sequence
MGVVYRAEDT RLGRQVALKC LPAEYAADPQ MVERFLREAR SASALNHPNI CTIHEVDVAN 
GQHFLTMELL EGQSLRDRIA SGPIATDESI RIAVAVADAL DAAHEKGIVH RDIKPANIFL
TERGEAKVLD FGLAKLESSN LAMSATVDAN LTSPGQAIGT IAYMSPEQAR GIDVDARSDL
FSLGVLIYEM ATGVPPFPGA TSALIFDAIL NRQATRASKV NAATPAGLEN IIAKLLEKDP
RLRYQSAADL LADLRRLQRD GGTASVAAAM PAQRARKTSK AIDSIAVLPF KNATGDAELE
YIGEAIAEGV LDGLSHLPKV RIVPRSKAFR FRDEAEDPQS VGKKLDVRAV LTGRVSKRGD
QLNIRAELVD VAKDAQLWGA QFSRTVNDAP DLHEEIAKRV AEKLEGPSSA GSKGGKKSEK
VETTSVNKEA QALYLRGSHH LNKWTADGVQ LGIELCKQAI DLEPTYAEPY AAMAMSYAVS
GVLGSLDAEL SQRQAKALGQ KALQLNEALP EAHAALAIVC YFTFELSAAV RFGERAIELA
PDLPIARYAL AMGLSTKGRI EEATSILREA ADTDPLMLPV NYAYGLMLYY SHRWDEAAAQ
LRRALEVLPT MQLAQGMRSV ALARGGRHEE AKAQYQEFIR DHPTTPWGTI EAYLAALAGE
REKAMQLLAV PRTHPISLFF AAGAYGALGE LDLGFIELER AREVRFSVLC TARANPIFDP
YRSDPRWPGY LESLRLN