Gene Acid345_2583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2583 
Symbol 
ID4070546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3050957 
End bp3053092 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content57% 
IMG OID637984600 
Productprotein-tyrosine kinase 
Protein accessionYP_591658 
Protein GI94969610 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCCT CGCACTCCGA ACCTCCGTTC GGTCGTTTGC ACTCCAGCAA CGACGAGTTC 
TCACTCCTCG CCTTAAGTCG AGTACTGTCC GCTCGACGCC GCACCGTCGG ATACATCACC
GGCGCTGGCA TCGCGATCGC AATTCTGATT TCCCTCTTCT CACCGACCAA ATTCGCTGCC
ACTGCAACAC TGGAGTTGAA CCCGGCAAAT GCTACCAGCT TCAATTTGTC ATCTGGGGCG
AGTTCGTCTG ATCACGTCGA AGCAGCAGTC AATCAAGCTA CCCAGGTCGG AATTCTCCAG
AGCGATGAAC TCGCGCTGCA AGTCATCACG AAGCTGAAAC TTGAGGAAGT TGCAGACCGA
CGATCCTTTT TTAAGTTTGG TCGAACAGAC GATCGTTCCG TGCCGCTCAA TCAGACCCCG
GCGCGGACCG GAAGAATGCT CCGTTCTTTT CACCAGAACT TGCGAGTGCA ACCAGTCGGC
GGCACCCGCC TCATCGAGAT CCGATACCTC GATCGTGATC CTGCTCTCAC CGCTGCGGTG
GTCAACGCCC TTGTGGACGA CTATCTCGAT CGTCATGTCC AGACTCGCTT CACCGCCACT
CGCCAAGCAT CCGATTGGCT CTCGCGGCAA CTCACCGATC TAAAGAACGA CGTTGAGACC
TCCCAGCAGA GACTCGCTGA TTATCAACGT GAAACCGGCA TTCTTGGCGA GAGCGAGACC
AACAACATTG TCACCGCAAA ACTTGAGGAC ATTAACAAGC AGCTAAGTGC AGCAGAAGCA
AATCGCATCG TGAAAGAAGC CGTCTGGCGT CTTGCCAAGA GCGGAGATCC CGAGCTCATC
TCCTCGATGG CAGGCACTTC CCTGGTGCCA GGCATTGCGA GTAGCTCCGC TCCCCTCGGA
CTTCTGCCCA CTCTCCGCGC CCAAGAGGCG CAGCTCAAAG CCGACATCGC GCAAAACTCC
GAACGCCTCG GGCCTTCATT CCCAAAACTC GTGCAGATGC GTGCCCAACT CGCCGACCTC
CAGTCCTCTA TTCAATCCGA ACTCGGCAAG ATTTCCGCCC GCGCAGAGAA CGACTACCTC
GCCGCCAAGA ATGCCGAAGA CATGGAGCGC GCCCTCTTCG CAAAGCAAAA ACAAGAAGCG
AACCAGATCA ACGACAGCGC CGTGCAATAC GGCGTCCTGA AGCGCGAAGC AGATGCCAAC
CGCGACCTCT ACCAAACATT GCTTGGCAAG CTCAAAGAGG CAGGCGTCCT CGCTGGCCTC
CATTCATCGG ACATTCTTGT CGTTGACTCA GCGCGCGTTC CGGACAAACC GGCCAGCCCC
AAACGCTTAC TCAACCTTGT GCTCGGCGCC GTCATAGGAC TGATTCTCGG CGTAAGCATT
GCCCTCCTCC AGGACAGCCT CGACCGCACC ATTCGCTCGC CCGAAGACGT TGCGCGCGTA
AGCAACATTC CGACAATTGG CGTAATTCCA GTTCACTCGG ACGACGACAA GAGCGAGGCC
GACCAGCGCT TCCGCGCCCT CCGCGGGAAC CTGCCAGCAG GATCGCCGCG GGTCACCGTC
GTTTCCGGCC CAGCGCCTGG TGAAGGCAAG ACAACCGTTG CCATTCATCT CGCTCAATCG
CTTGGGCGGC TCGGACGTCG AGTGCTCCTC GTGGACGCCG ACCTACATCG TCCCAGCGTT
CACAAGTACC TCAAACTCGA CAACTCGTCT GCAGGCCTCA GCGAATTGCT CACTGATTCA
CATCTTCTCT CGGGAGATAG CCGAACGCTG CCGGATGGAA TCGCGCTATT ACTTGCCGGT
ACAGCAACTG AGCAAGCCAT TGATCACGTC GAATCACCTC GGATGGGCGC CCTCATCGAT
CACTGGCGCT CTACATACGA CGATATCGTT ATTGATACTC CGCCCGTCCT CGCTTACAGC
AACGCGGTTT CCATCTCGAA ATTTGCTGAC GCGGTCCTCC TCGTACTCCG CGCCGGGCAA
ACCAGCAGCG ACGCCCTTGT TCGCTCACTC GAAATCTTCG AACAGTCAGG TGTGAAAGTA
AGCGGCGCCG TTCTCAACCG ACTCGATTTC CACTCGCCCT ATTACAAACA CTATTACGGC
AACGACTACC GCTCCCTGCA AGAGCCCACT TCATGA
 
Protein sequence
MSPSHSEPPF GRLHSSNDEF SLLALSRVLS ARRRTVGYIT GAGIAIAILI SLFSPTKFAA 
TATLELNPAN ATSFNLSSGA SSSDHVEAAV NQATQVGILQ SDELALQVIT KLKLEEVADR
RSFFKFGRTD DRSVPLNQTP ARTGRMLRSF HQNLRVQPVG GTRLIEIRYL DRDPALTAAV
VNALVDDYLD RHVQTRFTAT RQASDWLSRQ LTDLKNDVET SQQRLADYQR ETGILGESET
NNIVTAKLED INKQLSAAEA NRIVKEAVWR LAKSGDPELI SSMAGTSLVP GIASSSAPLG
LLPTLRAQEA QLKADIAQNS ERLGPSFPKL VQMRAQLADL QSSIQSELGK ISARAENDYL
AAKNAEDMER ALFAKQKQEA NQINDSAVQY GVLKREADAN RDLYQTLLGK LKEAGVLAGL
HSSDILVVDS ARVPDKPASP KRLLNLVLGA VIGLILGVSI ALLQDSLDRT IRSPEDVARV
SNIPTIGVIP VHSDDDKSEA DQRFRALRGN LPAGSPRVTV VSGPAPGEGK TTVAIHLAQS
LGRLGRRVLL VDADLHRPSV HKYLKLDNSS AGLSELLTDS HLLSGDSRTL PDGIALLLAG
TATEQAIDHV ESPRMGALID HWRSTYDDIV IDTPPVLAYS NAVSISKFAD AVLLVLRAGQ
TSSDALVRSL EIFEQSGVKV SGAVLNRLDF HSPYYKHYYG NDYRSLQEPT S