Gene Acid345_3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3980 
Symbol 
ID4072453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4706040 
End bp4709111 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content61% 
IMG OID637986007 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_593054 
Protein GI94971006 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGGGC AGACGATATC GCACTACCGC ATTGTCGAAA AACTCGGCGG CGGCGGCATG 
GGAGTGGTGT ACAAGGCGGA AGACACGCGC CTGCACCGCT TTATCGCGCT CAAGTTCCTA
CCCGATAACG TAGCCGCAGA TCCGCAGGCT CTGGCGCGCT TTCAGCGCGA GGCGCAGGCC
GCATCGGCGT TGAATCATCC GAACATCTGC ACCATTCACG ACATTGGCGA AGAGAACGGC
AAGGCCTTCA TCGCCATGGA GTACCTCGAC GGCGTGACGC TGAAGCATTT GATCGAGGGG
CGTCCGCTGG ACATGGAGCG ACTGCTGCCG ATCGCCATTG ATGTAGCCGA TGCACTCGAC
GCCGCGCACG CCGTGGGCGT GGTGCATCGC GACATCAAGC CGGCGAACAT CTTCGTGACC
AAGCGCGGGC ACGCGAAAAT CCTCGACTTC GGCCTGGCGA TGGTGTCGCC GAGGAGCGAA
TCCGCGACGG TAATCGCTTC AGCGAACACC ATGTCCGCGG CCATCGCGAA GGAACAGCTC
ACTAGCCCAG GCTCGACGCT GGGGACCGTC GCATACATGT CGCCAGAGCA GGCACGCGCG
AAGGAACTCG ACGCGCGCAC CGATCTGTTC TCGTTTGGTG CGGTGCTCTA TGAGATGGCG
ACGGGCACGC TGCCCTTCCG CGGCGATAGC ACCGCGACCA TCTTTGACGC CATCCTGAAC
CGCGCGCCGG TCGCGCCGGT GCGACTCAAT CCCGATCTGC CGGCAAAGCT CGAAGACATC
ATTAACAAGG CCCTGGAGAA AGACCGCAAC CTGCGCTACC AGAGCGCCGC AGAGATGCGT
GCCGACTTGC AGCGGTTGAA GCGGGATACC GAGACGGGCC GCAGTTCGAT ACTCACCGAG
CCGACAGAGG AAGAGTCATC GCGCACCGAA GCAGTTCCCA AGGCGGCGAG CACCCCGAAA
ACCCTCGCAG CAGTAACGTC CTCGTCCACT CCCGCGAGCA GCACGCAGAT CGTAGTGCAA
CGCAGGACGG TGGGCGTGGC CGCGATCATG GTCGCGGTTC TGGTTGTGGC CGTGAGCGTG
GGCGCATTCT TCCTGTTCCG CGGCAACAAG CCGAGCAGTG GGACCGCGAG CGGTAAGCCG
CACAAAGCCG TGGCTGTTCT CTATTTCAGC AATCTCACCC AGGACCCCGC GCTGAACTGG
CTCGACAATG GCTTGACCGA CATGCTCACC ACCAACCTGG CACAGGTGAA GGGACTCGAC
GTCCTCGCCA GTGATCGTGT GATGAGCGCC GTGCAGAAAG CCAGCAAAGA TGGCAAGACG
CTCGATCCCG CGCAGGCGCA GAAGATCGCA CGAGACGCCG GCGCGGACAC ATACATTACC
GGCGCTCTGC TCAAGATTGG CCCCACGCAG TTGCGTCTGG ACGTCCGCGC GCAGGACACC
AGCACCGGCC AGATCGTTTA CAGTGACAAG CTCGAAGGCC AGGATGTGCA GAGCATCTTC
GGCATGGTGG ACCGGCTCAC GGCGAAGCTC GCCGGAAGCT TCCTGCCGGA ATCGGAGGCC
CCGGAGAAGG GTCCCGCGAT TGAGGAAGCA TCCACGTCCA ACGTGGAGGC TTACAAGCAC
TACCAGCAAG GCGTGGATGC CTCGACCCGA TACCTCTACG CAGACGCGAT CCGTGAGTTT
GAGGCGGCCG TGAAGCTGGA TCCGAACTTT GCCCTCGCTT ACATGGCGTT GGCAGACGAG
TACAACCAGG TGGGCGACAC TGAGAAGCGC TTCGAATCTT TCCAAAAAGC GCAAGCTCTT
CGCGCACGCC TGCCGCGCTA CGAGCAGCTC CGGCTGGGAG TGGCCGAGGC AGACCGTGCT
GGGGATCCGC TGGGCCTGAT CCAGGCGCAG GAGGCGCTCG TCGCCGAGTT CCCGCGCGAC
GGCTTTACGC GTGGCGTGCT TGCGTCGCAA CTGAACAATG CCGGGGAACC GGAGAAAGCA
CTGACCTACA TCGAGGAGGG CCTCAAGCTA AACCCGAAGG AGGAGGTGCT GCTTAATTTC
CGCTCGTATA TCCTCGCCAA TCTCGGCGAC TTCCCCGAAG CGTTGGCTTC GAACGAAGCC
TATATGGCGC TGCGTCCTGG AGATCCGAAC CCGCTCGATA CCCATGGCGA CATCCTGTTC
ATTGCCGGGA GGGATGACGA AGCCCTCGCC GCCTATCGCA AGGTGATGGA AGTGAGACCG
GATTTTGGGA GTTCCAGCGA ATATTTCAAG CTTGCGGTGG TGTATACCGA CCAGAAGAAG
CGAGACCTGG CAAAGATTGC CACGGACCAG TTCGCGGCGA AGACTAATGC GCTTACAAAA
GCGTACCTCC CCGCGTTCGA AGCACAGTTC CAGCAGAGTA ATGGCGACTT CGACGGCGCG
ATCGCAGGGT ATAAAGACGC GGTAAAAAGA CTGAAGGCCG CCAACCAACT CGGCCCCGCG
GGACGATTGA TAACTCCGGT GGTGATCCTT TCTGCGCTAA CCGGCAAGAC GAAGGAAGGG
CTGGCGTTTG CGCAAGCGCA GAAGCTGGAC GGCTATGAAC TTACATCCCT GGCGCTGGCG
CAACGGTTGG CAGGAGACCG CGACGCAGCC ATGCAGACTC TGCAGAAGCG TATGACCGTG
CAGCCCTGGA CCTCACCGCG CGCCATCGAG TTCAACCAGG CCGGCCAGGA AGCTGAAGCA
GCCGTGGAAA GTGGAGACGG CGCCGGCGCA CTCTCCGCTT TGGGTCCATA TGTGAAATTT
CACAATCCGC CGCTGTTGTT CATCCGTGCC CGCGCGCATC TGTTGACCAA GGACTACAGT
ACGGCAGAAG CGGAGTTCCG GGGCGCGATC CGCATTACCC GTAACATGGC TAATTTTGGG
AGCATTCTCA ATCGCTTACC CGCCATCGAA ATGCTCTCGC ACTTCTATCT CGGCCAAATC
TATGACCAGA CCGGAAAGCG CGACCAGGCG ATCAACGAGT ACCAGGACTT CCTATCCCAC
TTCGAACACT CGTCGGCAAA GTTGCCGCAA ATTGAAATCG CGCGCGCGGC CGTGAAACGG
CTGATGCAGT AA
 
Protein sequence
MIGQTISHYR IVEKLGGGGM GVVYKAEDTR LHRFIALKFL PDNVAADPQA LARFQREAQA 
ASALNHPNIC TIHDIGEENG KAFIAMEYLD GVTLKHLIEG RPLDMERLLP IAIDVADALD
AAHAVGVVHR DIKPANIFVT KRGHAKILDF GLAMVSPRSE SATVIASANT MSAAIAKEQL
TSPGSTLGTV AYMSPEQARA KELDARTDLF SFGAVLYEMA TGTLPFRGDS TATIFDAILN
RAPVAPVRLN PDLPAKLEDI INKALEKDRN LRYQSAAEMR ADLQRLKRDT ETGRSSILTE
PTEEESSRTE AVPKAASTPK TLAAVTSSST PASSTQIVVQ RRTVGVAAIM VAVLVVAVSV
GAFFLFRGNK PSSGTASGKP HKAVAVLYFS NLTQDPALNW LDNGLTDMLT TNLAQVKGLD
VLASDRVMSA VQKASKDGKT LDPAQAQKIA RDAGADTYIT GALLKIGPTQ LRLDVRAQDT
STGQIVYSDK LEGQDVQSIF GMVDRLTAKL AGSFLPESEA PEKGPAIEEA STSNVEAYKH
YQQGVDASTR YLYADAIREF EAAVKLDPNF ALAYMALADE YNQVGDTEKR FESFQKAQAL
RARLPRYEQL RLGVAEADRA GDPLGLIQAQ EALVAEFPRD GFTRGVLASQ LNNAGEPEKA
LTYIEEGLKL NPKEEVLLNF RSYILANLGD FPEALASNEA YMALRPGDPN PLDTHGDILF
IAGRDDEALA AYRKVMEVRP DFGSSSEYFK LAVVYTDQKK RDLAKIATDQ FAAKTNALTK
AYLPAFEAQF QQSNGDFDGA IAGYKDAVKR LKAANQLGPA GRLITPVVIL SALTGKTKEG
LAFAQAQKLD GYELTSLALA QRLAGDRDAA MQTLQKRMTV QPWTSPRAIE FNQAGQEAEA
AVESGDGAGA LSALGPYVKF HNPPLLFIRA RAHLLTKDYS TAEAEFRGAI RITRNMANFG
SILNRLPAIE MLSHFYLGQI YDQTGKRDQA INEYQDFLSH FEHSSAKLPQ IEIARAAVKR
LMQ