Gene Acid345_3804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3804 
Symbol 
ID4071088 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4495108 
End bp4497432 
Gene Length2325 bp 
Protein Length774 aa 
Translation table11 
GC content56% 
IMG OID637985827 
Productprotein-tyrosine kinase 
Protein accessionYP_592878 
Protein GI94970830 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.684572 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCCT TGTTTAGAAA ATCGGAATTG CCTCCTCTCG GGGTTTCCGA ACAATCCACG 
ACAAACGCCG GACTCGAAGA CGAAGATCTG TCTCTAAGGC AGGTCATCAG GATTTTACGG
AAACGCAAGA ACCTGATTCT CGGGGCCGCC GGTGCCTGTC TTGCTTGTGC GCTTTTGTTG
TCATTTGTGA TGCGGTCCTA CTACAACAGT TCCGCGACGA TTGAGATTCA GAAGGACCAG
GATGCCTCCC TAGGAAGTGC CCTCGATACC CTGGCGAGTA CGGTGGGCGG CGGAAGCGGC
GACACGAAGA CGGAGGTTCA AACGCAAGTC GCGATTTTGC AGAGCGATGA CCTTGCGATT
GAGACGATCG AGAGAACCGG CTTTGAGAAA CACATCGGGT CGTATTGGCA CCTGTTTCCC
AGTGGGCGCG TGACTTCGGA ACAAGGGCTG CCGTTACGCA ATGCCCCGCG TCAACGTGAA
GCAATGTTGA AGAAATTCGG AAAGGCACTC ACGGTTACGC CGATTCCGGA TACGCGTTTG
ATCCAAATCG CGTTCGAAGA CCCCGACCCA AATTTTGCTG CGCAGGTACT CAACACCCTT
ATTGGCCAAT ACACACAGGA CGTGCTTACT CGGCGAAATG CATCCACAAC GCAAGCATCG
GAGTGGATGA GCGCACAGAT CAAGGACTTG AACCAGCAGG TAGAAGCGGC GCAGCAAAAA
CTGATCAATT ACGAGAAAGA GAGCGGATTG ATCGCGATTA CCGCCACTGA TCCGAAGAAT
CCCGCCGCTG CACCGGTACT GCATATACCT GCTTTGGACC GATTGACTGC ACTGAATCAA
GATCTGGTGA CAGCTGAGGC ATCGCGTGTC ACGCGAGAAG CGCTGTACCG TCTTGCGAGA
TCGGGAGACT TGGATCGCCT CACGAGTCTG GGCGCGGAGT CGCTCACTTC TCCGACGGAT
CAGGCGCAGG CGGCGATGTT TACGAACCTC CAGGCCCTGC GTCAACGCCA ATCGCAATTG
AAGCAACAGA TGGCGTCAGC AGTACAGATC TACGGAGCCA GAAATCCACA TTTGGTGGAC
GTCGACAAAC AACTATCGGA TGTCGACTCC CAAATTAAGG GCGAGCTTAA GCTGATTGTC
GATCGTACAG AGCTAGATCT CGAACTCGCG AGGCAGTCGG AAGACGCATT GCGGCAAGCC
TATGAGAAGC AGGAGCAGGA AGCGAACAAG ATCAACGACT CCCAGATTCG ACTCGCGGTG
TTGCAGCAGG AGGCCGACTC GACGCGACAG CTCTATGAGA CCCTCTACGG ACGCTTGGAG
GTTTCGAAGC TGGATGAAGG TATCAAGAGC ACGAATGTCG CAGTCATTAG CGCCGGACTA
CCTCCCGCTA AACCCTCTCA TCCGGATCCG CTCGTCAACG GAATCGTCGG TTTAGGGGGC
GGGCTCTTCG TTGGATTGAT TTTGGCGTTT GTGGTCGAAA CGTTGGATGA CTCCGTTGCC
ACCACGACTG ACGCCGAGGA ATTGACTGGG ATCGCCGTCC TCGGTGTGAT CCCGATTGTG
AATGAGAATC CGCGGGTTCT GCGCGCAGCA GGACGCGCGA AGGCCAACGG AAGCAATGGG
AGTAAGCCGG GGCGGCCGAC CGCGATGGTC TACGAGCAAG CGATGGCCAC CGAGGCATTT
CGCTCGCTGC GCACGACGCT GTTGTTGTCG CAAGCGGGAT CGGCGCCCAA GTCGCTACTC
TTAACAAGCT CATTGCCTGG AGAAGGGAAA TCCACCACGA CGTATGGATT GGGCAGATGC
TTTGGCAGCC TAGGAACACG CGTCCTACTA ATCGACGCCG ACTTGCGCAG ACCGACCCTC
CATAAGCACG CGATGAAGGA GAACGACCGC GGCCTAAGTA ATCTGTTGAC GTCGGTCGCA
GAACCGAGTG ATTTCATTCA GAAGGACTCG AGCGCGCCAA ACCTCGACAT TCTTTGTGCG
GGGCCAATTC CGCCGAATCC CGCAGAATTG CTGGCTTCGA ACGTGTTCTC GGACCTCCTC
AAGCGAGTGG TACTTGAATA CGACCTCGTG CTGATTGATA GCCCGCCGGC AATGCTGGTT
TCCGATGCGG CGATCATCTC GTCTCGAGTG GATGGCGTGG TGCTGGTCGC ACGCGCCGGA
ATAATCACTC GCGCAGCGCT GGGCAAAGCA GTCGAGGTCT TGCGCCGAAA CAAGGCGCCT
CTGCGCGGAT TGGTGCTAAA TGCCGTCAAT ACCAAGGGCA CCGACTACTA CTATTCCCAT
GGTTACTATG GGTATGACTC GTACGGCTCG AACGGGAATG CTTAA
 
Protein sequence
MDSLFRKSEL PPLGVSEQST TNAGLEDEDL SLRQVIRILR KRKNLILGAA GACLACALLL 
SFVMRSYYNS SATIEIQKDQ DASLGSALDT LASTVGGGSG DTKTEVQTQV AILQSDDLAI
ETIERTGFEK HIGSYWHLFP SGRVTSEQGL PLRNAPRQRE AMLKKFGKAL TVTPIPDTRL
IQIAFEDPDP NFAAQVLNTL IGQYTQDVLT RRNASTTQAS EWMSAQIKDL NQQVEAAQQK
LINYEKESGL IAITATDPKN PAAAPVLHIP ALDRLTALNQ DLVTAEASRV TREALYRLAR
SGDLDRLTSL GAESLTSPTD QAQAAMFTNL QALRQRQSQL KQQMASAVQI YGARNPHLVD
VDKQLSDVDS QIKGELKLIV DRTELDLELA RQSEDALRQA YEKQEQEANK INDSQIRLAV
LQQEADSTRQ LYETLYGRLE VSKLDEGIKS TNVAVISAGL PPAKPSHPDP LVNGIVGLGG
GLFVGLILAF VVETLDDSVA TTTDAEELTG IAVLGVIPIV NENPRVLRAA GRAKANGSNG
SKPGRPTAMV YEQAMATEAF RSLRTTLLLS QAGSAPKSLL LTSSLPGEGK STTTYGLGRC
FGSLGTRVLL IDADLRRPTL HKHAMKENDR GLSNLLTSVA EPSDFIQKDS SAPNLDILCA
GPIPPNPAEL LASNVFSDLL KRVVLEYDLV LIDSPPAMLV SDAAIISSRV DGVVLVARAG
IITRAALGKA VEVLRRNKAP LRGLVLNAVN TKGTDYYYSH GYYGYDSYGS NGNA