Gene Acid345_2989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2989 
Symbol 
ID4068890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3539792 
End bp3541549 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content63% 
IMG OID637985008 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_592064 
Protein GI94970016 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.461641 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGGAA GGATTTTCCT CAAACTCTTT TTTGCTTATC TCGTCGTTAT CGCCGCCTGC 
ACCCTGACCC TCGATATTGC CATCCGTCGC GCTTGGGTCA ACTCTCTTCA AACTGAAATC
GAATCCTCGC TGCGCGAAAA AGTGCAGCTC TTCGCGCTGC GCGTCCAGGC TGAACGCAAT
GCCGACTTGC CGTCGATCGC CCGCGTAGTC TCCAAGGCCG CCAACGCTCG TGCCACCGTC
ATCACCTCCG ACGGCCTCGT TCTGGCCGAC TCCGACGCCA ACCCCGCCGA CATGGAGAAC
CACGCCACGC GGCCTGAGTT CGTCGCCGCA TTGCATGGCA ACATCGGCAC CAACACGCGC
CGTAGCCATA CTCTCGGCAT CAACTTTCTC TACACCGCAG CCCCCATTCC CGGTGGCGCC
GTTCGTCTCG CTTATCCGCT CTCCGCGCTC GATGCCATTA CCGGACAAGT TCGCCACACC
CTGCTGCTGA GTTCGCTGCT CGCCGCCGCG CTTGCGCTCC TGCTCGCATC ATTTGCCACC
CAAAACATCA GCCGACGCCT CAAGGCCATC GTCTACTTCG CTGATCGCAT CGCCGCCGGA
GATCTCTCGG CGCGAATCGC TGAAGATTCC ACCGACGAAA TCGGCAAGGT CGCCACCGCG
CTCGACAAGA CCGCTCGCCA GTTGCAGGAA GTATTCGCGC AAGTTGAAAG CAGTCGCCAG
GAACTCGAAG CCCTGCTCAA CTCCATGCAG GAAGCCGTCA TCGCCGTCTC CAACGATGGC
CGCCTGATGT GGGCGAATCA GCGCATGGAA CGCATGCTGC CCACCGGTAT TCGACGCAAT
GGCAAGATCA TCGAGACCGT CCGCGACCCC GAGTTCCTCC GCGGCGTGCA GGAGGCACTC
GAGAGCCGCC GTGTCACCTC GACCCGCGTG AGCACTCTTC TGCCCGGACG CACCTTCACT
GCGACCACCG CGCCTATGCC CGGCGGCGGC GCAGTGGCGG TTCTCCACGA CCTCACCGAA
ACCGAGCGCG TTGAAAAGAC TCGCCGCGAC TTCATCGCCA ACGTTTCGCA CGAATTGCGC
ACGCCGCTGA CTTCCATCCA GGGCTATGCC GAAACTTTGA GCGAGGCCTT CCGCGAAGAT
GATCCGGCCC GCGAATTCGT GGAGATCATC CGCCGCAACG CCCAGCGCAT GTACCGGCTC
ACCGAAGATC TCCTCACTCT CGCCCGCGTC GAAAGCGGGG AACAGCGCAT GGAGTTCCAC
GCCGTCACTC CCGGCGAGCT CCTTGCGGAA GCGGAATTAA ATTTCCAGGA CCACCATCGC
GGCGCGGGCA TCGAGCTCTC GGTGATGAAC ACCGCGCAAG GCGAAGTCGA AGTCGACCGC
GACGCCCTTC GCCAGGTCTT CAGCAATCTG CTCGACAACG CCGTGAAGTA CGGCGGCACC
GGCGGCAAAA TTCTCCTCGG CTCCCGCGAC GTCGAAGACC GCGTCCTCTT CTACATCCGC
GACTTCGGCC CCGGCATTCC CTCAGAGCAC TTGGCGCGTC TCTTCGAACG CTTCTACCGG
GTCGACAAAG CGCGCTCCGT GCAATCCGGC GGCACCGGCC TCGGCCTCGC CATCGCCAAA
CACATCGTCC GCGCCCACGG CGGCGAAATC CGCGCCGAAA GCGAACTCAA CCACGGCGCC
GTCTTCTCCT TCATCCTCCC GCGCAAGCAA CTCCTGTTCA TGCCCGCCCG CAGCAACAAA
ACCAGCGCCG TTTCGTAG
 
Protein sequence
MRGRIFLKLF FAYLVVIAAC TLTLDIAIRR AWVNSLQTEI ESSLREKVQL FALRVQAERN 
ADLPSIARVV SKAANARATV ITSDGLVLAD SDANPADMEN HATRPEFVAA LHGNIGTNTR
RSHTLGINFL YTAAPIPGGA VRLAYPLSAL DAITGQVRHT LLLSSLLAAA LALLLASFAT
QNISRRLKAI VYFADRIAAG DLSARIAEDS TDEIGKVATA LDKTARQLQE VFAQVESSRQ
ELEALLNSMQ EAVIAVSNDG RLMWANQRME RMLPTGIRRN GKIIETVRDP EFLRGVQEAL
ESRRVTSTRV STLLPGRTFT ATTAPMPGGG AVAVLHDLTE TERVEKTRRD FIANVSHELR
TPLTSIQGYA ETLSEAFRED DPAREFVEII RRNAQRMYRL TEDLLTLARV ESGEQRMEFH
AVTPGELLAE AELNFQDHHR GAGIELSVMN TAQGEVEVDR DALRQVFSNL LDNAVKYGGT
GGKILLGSRD VEDRVLFYIR DFGPGIPSEH LARLFERFYR VDKARSVQSG GTGLGLAIAK
HIVRAHGGEI RAESELNHGA VFSFILPRKQ LLFMPARSNK TSAVS