Gene Acid345_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0835 
Symbol 
ID4072361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1036662 
End bp1038893 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content55% 
IMG OID637982844 
Productprotein-tyrosine kinase 
Protein accessionYP_589914 
Protein GI94967866 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAAAGT TCTCACAAAT CGAAAATGTT CAAGGTGCGT CACGTACCAG CTGGGCAGAC 
GTGCAGGAAG ATCGCGAAGC GGATCTCCTG CATGTGCTGT ATGTGTTGCG TCGTCACCTC
CAAATGATCA TCGGTGTCAC CGCATTCGGT GTCCTCGTTG CAATCCTCTT CTGCCTATTC
ATGAAACCGC GCTACGAAGG CATGGCAGAT TTAAACGTGC ATCCTGAAGA ATCTGCTGCG
CTCGACATGG GCGCGCTTGG AGACCTGGCG ACCGGCGCGG GTGGCTTAGA TTGGAGTTCC
AAACTCGAGA CACAAGCGCG CATCTTGAAG AGCGACACTC TGGCGTGGGA TGTGATCTCC
CAATTGCGGC TCGATCAGAA TGAGGCGTTC GCAACCAAGT CGCTCTTCGG TCGGTCTTTG
CAAACGCCCG TAGGGAAGGA CGTCAGTTCC GTCGATGAGG CACGCAAATC CAAGCTGCTT
ACGCGCTTTT CCAAGGCCCT TCGTGTTGAA GCTGTTCCGA AGACTCAGGT AATAGAGATT
CGGTTCCGCA GCACCGATCC AGCGCTCGCC GCGAAGGTCG TCAATACGCT CACCTCGTCG
TATATGCACC ACAATTTCAT GACTCGCTTC GAAGCGACGA TGCAGGCGTC AGCTTGGTTG
CAGCAGCGCT TGACGGAGCT CAAGAACAAC GTCGAGGAGT CTCAGCGAAA ACTCGCGGAG
TATCAATCGA AAGCGAACAT CATCGGAACC GACGAAACGG ACAATCTTGC AGTTTCTGAT
CTTACCGACG TCAGCAAGCA GCTGACCGAT GCAGAATCCG ACCGGATCAT GAAAGAAGCC
AAGTATCGGC TCGCCCAGAC CGGCAATCCG GAGCTGATTG GCACCATACT TCCGGATAGC
GTGTTGCCGG TTCTCCGCTC ACAAGAAGCC GATCTCAGAA ATCAATTGGC ACAGTACAGT
ACCAAGTTTG GCTCGAACTA TCCAAAAGTG ATCCAGCTCA ACAATCAGTT GGCACAAACG
GACGCCTCGT TGAAAAAAGA AATCCGAGAC ATCGAGGAAC GCTTCCGCAC TGAGTATGAA
TCGGCGAAAC GTACCGAAGA TCAACTGCGA GCTTCAGTTG AAAACCGTAA GAAGGAGGCG
TTCTCGCAAT CGGCCAAGTT CAGCCAGTAC GACATTCTCA AAAATGAAGT AGCGTCTGGC
CGGAGTCTAT ACGAAGACCT CCTTCGCAAA CTGAACGAAG CGGGAATCGC GGCCGGGTTG
AAGTCAACCA ATGTTGATGT GATCGACCCT TCGGCGGTAC CTCAGCTTCC GGTCTTGCCG
AATGTTCCGC TCTTTATCGC GCTTGGGTTG TTTGGTGGCG CATTCCTTGG GGTGTGCAGC
GCATTCGTGA AGGAGAGCAT CGACCAAACC ATCAGTTCGC CCGAGGATGC GGAGGAGATG
GCTGGCATCT CGACCATCGG CTTGATTCCT CACTTCTCGA TGGAGGGACT CAATGCTCTC
GCGACTGAAA ATCAGTCGGT TCTAGCTCGG GTACCTCTCG CCGCAGACCG CCCCCAGTCG
AAGCTCGCAG AAGCCTTCCG TGCCCTACGT ACCTCTCTCC TCCTTGCCAA TGCTGGAGCG
CCTCCGAAAG TCATCATGAT AACGAGCGCA CAACCTGGCG ATGGCAAGAG TACCGTGAGT
GTCAACATTT CAGCTGTGTT GGCACAGTCA GGTGCCCGAG TGTTACTCGT TGATGCGGAC
TTGCGCCGCG GCGTGCTCGC CAGAAATCTC AAGGTGATGC CAGAGGGAGG CCTTAGCGAG
TGCCTCGCAG GACGAAAGTC TTGGCGCGAT CTGATCATCC CGGTGAACGG CGTTGCGAAC
ATGTGGGTGC TTCCGGCCGG CCATCGGCCT CCGAGCCCGG CTGATCTTTT CACCTCCAAC
AAAATGGAGG AAATCCTGAA TGAGTGGCGC GGAGCGTTTG ACCACGTCGT GATCGACACT
CCTCCGGTTG TTGCAGTGAC AGATGGCGTC GTTCTTTCGC AAAAAACGGA TGCCGTGCTC
CTGATTGCGC GCGCGTCGAG AACTGGCCGA CACCCGCTGC GTCATGCGCG CATGCTGCTG
GCGAAGGTTC GTGCAAATGT CGTCGGCCTG GTTGTGAATG ACTTCGATGC GAAAGCGAAA
TACTACGGGT ACTCATACAG TTACGACAAG TACTACGTCG AGGATAAAAC CGAGACGACC
GTTAACAACT AA
 
Protein sequence
MEKFSQIENV QGASRTSWAD VQEDREADLL HVLYVLRRHL QMIIGVTAFG VLVAILFCLF 
MKPRYEGMAD LNVHPEESAA LDMGALGDLA TGAGGLDWSS KLETQARILK SDTLAWDVIS
QLRLDQNEAF ATKSLFGRSL QTPVGKDVSS VDEARKSKLL TRFSKALRVE AVPKTQVIEI
RFRSTDPALA AKVVNTLTSS YMHHNFMTRF EATMQASAWL QQRLTELKNN VEESQRKLAE
YQSKANIIGT DETDNLAVSD LTDVSKQLTD AESDRIMKEA KYRLAQTGNP ELIGTILPDS
VLPVLRSQEA DLRNQLAQYS TKFGSNYPKV IQLNNQLAQT DASLKKEIRD IEERFRTEYE
SAKRTEDQLR ASVENRKKEA FSQSAKFSQY DILKNEVASG RSLYEDLLRK LNEAGIAAGL
KSTNVDVIDP SAVPQLPVLP NVPLFIALGL FGGAFLGVCS AFVKESIDQT ISSPEDAEEM
AGISTIGLIP HFSMEGLNAL ATENQSVLAR VPLAADRPQS KLAEAFRALR TSLLLANAGA
PPKVIMITSA QPGDGKSTVS VNISAVLAQS GARVLLVDAD LRRGVLARNL KVMPEGGLSE
CLAGRKSWRD LIIPVNGVAN MWVLPAGHRP PSPADLFTSN KMEEILNEWR GAFDHVVIDT
PPVVAVTDGV VLSQKTDAVL LIARASRTGR HPLRHARMLL AKVRANVVGL VVNDFDAKAK
YYGYSYSYDK YYVEDKTETT VNN