Gene Acid345_3369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3369 
Symbol 
ID4071287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3997048 
End bp3998415 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content61% 
IMG OID637985391 
Productserine/threonine protein kinase 
Protein accessionYP_592444 
Protein GI94970396 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC AGGTAATCTC GCATTACCGC ATCGTCGAAA AAATCGGCGG CGGCGGCATG 
GGCGTGGTGT ACAAGGCGGA GGACACTCGC CTGCACCGCT TCGTTGCCCT CAAATTTCTC
CCCGACACCG TCAGCACCGA TCCGCAAGCG CTCGCACGTT TCCAGCGTGA AGCCCAGGCT
GCGTCCGCGC TCAATCATCC GAACATTTGC ACCATTTACG ATATCGGCGA GGAGAACGGC
AAAGCTTTCA TCGCGATGGA GTATCTCGAC GGCCAGACGC TGAAGCACCG CGTCGACAAC
CGTCCGCTGG CGATGGACGA GTTGTTGCCC ATCGCGATTG ATGTCGCCGA CGCGCTCGAT
GCCGCACATA CCGCCGGCAT CGTTCATCGC GATATCAAGC CCGCGAACAT TTTCGTTACC
AAGCGGGGCG CTGCAAAGGT CCTCGACTTC GGGCTCGCAA AAGTTTCCGG CAGGAATCCA
TCGTCCAGCC AGATCGCTTC GGCGAACACG ATGACGATGG CGGCGGAAGA CCCGCATCTC
ACCAGTCCCG GCTCCACGCT CGGCACCGTG GCGTACATGT CACCAGAGCA GGCCCGCGCG
AAAGAACTCG ATGCTCGGAG TGATCTGTTC TCATTCGGCT CTGTGATCTA CGAGATGGCC
ACAGGCCAAC TTCCGTTCCG CGGCGACAGC ACGGCCAGTA TTTTCGATGC GATCTTGAAT
CGCCAGCCGA CGGCACCGGT GCGGCTGAAT CCCGATATCC CTGCGGAACT CGAGCGGATT
ATCAATAAAG CTCTCGAGAA GGACCGCGAC CTACGCTACC AGGTCGCGTC GGAAATGCGC
GCGGACCTCA AGCGTTTGCG TCGCGAGATC GAATCTGGCA AGAGTGCGGC GGTGCAAGCA
GCGTCCACAC CCTCGCAGCC TGCCGCGGCC TCACCGTCGC AAACTGTTTT CGCCGCCCCT
TCGTCGTCGG TCGCCGCAGC TTCAGCGCCC TCGGTGCAAG CCCAGTCGTA CGCGTCGCAG
ACCGCGGCAA GCGCGCCGGT GTCCGGGCCG AGCACGGCGG TAATTGAGCC ACCAAAGAAG
AGCAGCAAGA CGCCGCTGTT TATCGGCATT GCGGCCGTTC TTGTTGCCGT GTTGTTCGCG
GCAGGATATT TCGGCGGCTT CTTCGGCTCG CATGGGCCTT ACACCGAGGC TGTCCTCAAG
CCCCAGGCCA TTACTTCGAA CTCGTCGGAG GACCCCGTCT CGGGAGTTTC CATCTCCCCG
GATGGCAAAT ACCTCCTCTT CGCAGACCTT GAAGGACTGC ACCTTCGCCT TTTATCGAGC
GGTGAGACGC AGACCCTGCC TACGCCCGAA TCGTTTTGCT TCCGTTGA
 
Protein sequence
MIDQVISHYR IVEKIGGGGM GVVYKAEDTR LHRFVALKFL PDTVSTDPQA LARFQREAQA 
ASALNHPNIC TIYDIGEENG KAFIAMEYLD GQTLKHRVDN RPLAMDELLP IAIDVADALD
AAHTAGIVHR DIKPANIFVT KRGAAKVLDF GLAKVSGRNP SSSQIASANT MTMAAEDPHL
TSPGSTLGTV AYMSPEQARA KELDARSDLF SFGSVIYEMA TGQLPFRGDS TASIFDAILN
RQPTAPVRLN PDIPAELERI INKALEKDRD LRYQVASEMR ADLKRLRREI ESGKSAAVQA
ASTPSQPAAA SPSQTVFAAP SSSVAAASAP SVQAQSYASQ TAASAPVSGP STAVIEPPKK
SSKTPLFIGI AAVLVAVLFA AGYFGGFFGS HGPYTEAVLK PQAITSNSSE DPVSGVSISP
DGKYLLFADL EGLHLRLLSS GETQTLPTPE SFCFR