Gene Acid345_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3100 
Symbol 
ID4072664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3684086 
End bp3685516 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID637985119 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_592175 
Protein GI94970127 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.718748 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCAA AGATTCTTAG CCGTCCGAAG GACACCGCGG CGTGGCGGCT TTCGATCTGG 
ACGACGATTG CCTTCGCAGC GGGCAGCGCG ATCGCGTTTG GGATCGTGTA TTACATGGTG
TCGCTGGGCA TTCGCGAACG CAGCGACCAG TGGCTGGTTG GCGAATCGGA GACACTGAAG
GAAGTTTCAG ATGCGACGCC GCGAGACAAT CTTTACCAGC GCGTAATTGA AGAAACCGCG
CAAAACGCAG CCCACGAGAT TCCGGGTGAG CACGAAACCG AGGACGAGAA CCGGAACTCG
GTGTTCTTTT TGCAAATCGA CAACCTTGGG GAGCCGCTGT GGTATGGCCC GGAGAACAGC
CGCGACCGAT TCGTTGAAGC GGTCCGAGGT CTTCAACCGG GTTCGCCGCA GACACTGAAA
ATTCCTGAAA ATGTGGTTCC CTATCGCGTG GTGGTACAAG ACCTCAAATC CGGCGGCACA
ATCTACCTGG GACTTTCCGA TATTGGCGCC GTAGAACTGC TGCACCGGCT GATGCGCGGA
TTCTTCATTG TGTGGATCGG CATGGTGCTG CTGGGATTGA TGATCTCGTA TCTCAGCGCC
CGCCGAACGC TTTTACGCGT GGAGACGATC ACGCAGACGG TGTCGCGCAT TGGCAGCGAG
GACCTGAGCG CCCGGCTGGC GGAGGCGCAC AATGCCGATG AAATCGCGCG GCTGGCGCAG
ACTTTTAACC GCATGCTCGA TCGCATCCAG GCATCTGTGA ACCAACTGCG AACGGTCACC
GGTGCGGTCG CGCACGACAT GAAGAGTCCG GTAACGTCCA TCCGCGGCAA ATTGGAAGTT
GCCTTGCTCG AAGGCAGCGC GGCGGATTGG CGTGAGCCAG TGGCGGAAGC GGTAGAGGGC
CTTGACCGGC TGTCGCAGTT CATCAATACA ACGCTCGATC TTGCCGAGGC GGAAGCGGGA
GCCTTGCCGC TGCGAAAAGA GCCGGTGGAC TTCGGCGCGC TAGTGGAACA ATTCGTTGAC
ATCTACACGC CAGCGTTCCA CGAAAATCAT CACCAGGTTC ACGTGCAGAT ACACGAGCCG
GTAACGGTGG ACGTGGACGT GAGTCTAACC AACCGCATGC TTTCGAACCT GCTCGACAAC
GAGATGGCAC ACCTGCCGCC GGGCTGCAAG ATTGACATTG AAGTGATGGC GCGCGAGCAG
CAGGCAGAGC TCGTGATTCG CGATGACGGT CCGGGCTTCC CTGCTGAGTT GAAGGCGCAC
GCGTTCGAAC GGTTCGTGAA GGGCAAAGAG TCCAAGGGAC ACGGACTGGG CCTGGCGTTC
GTAGATGCCG TGGTGCAGGC ACATGGCGGA AATGTTGAGA TTGAAGACAC CCCGGGTGGG
GGTGCAACGA TTCGAATCTT AATGCCGCTG GTGGCCGTGA GCGTGGGATG A
 
Protein sequence
MFSKILSRPK DTAAWRLSIW TTIAFAAGSA IAFGIVYYMV SLGIRERSDQ WLVGESETLK 
EVSDATPRDN LYQRVIEETA QNAAHEIPGE HETEDENRNS VFFLQIDNLG EPLWYGPENS
RDRFVEAVRG LQPGSPQTLK IPENVVPYRV VVQDLKSGGT IYLGLSDIGA VELLHRLMRG
FFIVWIGMVL LGLMISYLSA RRTLLRVETI TQTVSRIGSE DLSARLAEAH NADEIARLAQ
TFNRMLDRIQ ASVNQLRTVT GAVAHDMKSP VTSIRGKLEV ALLEGSAADW REPVAEAVEG
LDRLSQFINT TLDLAEAEAG ALPLRKEPVD FGALVEQFVD IYTPAFHENH HQVHVQIHEP
VTVDVDVSLT NRMLSNLLDN EMAHLPPGCK IDIEVMAREQ QAELVIRDDG PGFPAELKAH
AFERFVKGKE SKGHGLGLAF VDAVVQAHGG NVEIEDTPGG GATIRILMPL VAVSVG