Gene Acid345_1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1692 
Symbol 
ID4070475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2052366 
End bp2053697 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content58% 
IMG OID637983700 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_590767 
Protein GI94968719 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0719701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAAAA CAGTAGACCA GCGCCGAACG CCCACCCGTG GGCTGCTCCT CGCGCTCGTA 
GTCATCCTTT GTTCGGTGCT GGTCTACTCC TTTTACATTC GAGTCCAGAT CAAACATCTG
CGAGCGGTGC AAACTGATCT CGTGGATCGT AATCGTCGCG ACTCGCTGCA ACTGTTGCGA
ATCGAGAACA ACCTCAATGC GCTCGCCTTG GCGATGCGCG ACATGCTGCA GGGCGATGAG
CCGTATCCCC TGACGGCGTG GGAGAGCCAA TTCCGGCGTT TACGCGTAGA TCTGGATGAC
GCCTTGCTGA AGGAAGATCA GCTGGCAGCC GCGCATCGCA CGCCTGAACA ACGGCAATAC
CTGGCGACTT CGGTTTCCGA CTTTTGGGCG GAGGTGGACC ATGTCTTCGC AGTCGCAAAC
GGCGGTGACG AGAAGGCCGC CCGTGCTCTG ATTCCAGCGC TGCAAACCCA TGAGGCTGGA
CTGAGTTCAA CGGTCTCCCG CCAATTGGTG GAAAACAATC GCGCAGAAGA AGAGGCGGCT
CACCAGATCG AATCAATTTA TGCGGGCGTC GAGAGGAACG TCTTCGTCTT TCTCGCGGCA
ACGCTGGCGG CGATCATTGT AACCGGCGTG CTAATCACGT TGTCGAACCG CCAGCTCTTT
GCAAGGCTCG CGCACCTTTC CGAACAACGC AGTGAACTGG CGCACAAGCT CATCGCGACG
CAGGAATCCA CGCTGCAATT CGTCTCGCGC GAATTGCATG ACGAGTTTGG TCAAATTCTC
ACAGCAATCG GTTCCCTCCT CGGTCGGGCC GAGAAGCAAG CGCCCGCAGG TAGCACCTGG
GCACGAGATC TGCACGAAGT CCGTGAGATG GCGCAGTCCA CTCTTGATAG CGTACGCAGC
CTCTCCCAAG CACTCCATCC CGTGGTGCTG GATGAAGCCG GTGTGGAACA CGCGATTGAC
TGGTACTTGC CGATGATGGA GCGTCAGAAC GCGATTACGA TCCACTATGG TAAGAGCGGT
TCTTCCCAAT TGGTGGGAAG CGGCGCGGGA ATTCACATCT ACAGAATTCT TCAGGAAGCC
CTGAACAACG TAGTACGACA TGCGAGCGTG CAGGAAGCGT GGGTGCGCCT GCAGTTTGAA
CCAGGGCGGC TCCTGCTCGA AGTCGAGGAC CACGGCAAGG GGTTTCAGCC GGACCCATCA
CGGCGTGGCA TTGGCGTAGT CGCCATGCGC GAGCGCGCAG AGCTACTAGG CGGTAACATT
AAATGGCTCC CTGCTCCGGG CGGCGGAACG CTGGTCCGGC TGGCAGTACC GAAGGAACGA
TTGAACGCAT GA
 
Protein sequence
MSKTVDQRRT PTRGLLLALV VILCSVLVYS FYIRVQIKHL RAVQTDLVDR NRRDSLQLLR 
IENNLNALAL AMRDMLQGDE PYPLTAWESQ FRRLRVDLDD ALLKEDQLAA AHRTPEQRQY
LATSVSDFWA EVDHVFAVAN GGDEKAARAL IPALQTHEAG LSSTVSRQLV ENNRAEEEAA
HQIESIYAGV ERNVFVFLAA TLAAIIVTGV LITLSNRQLF ARLAHLSEQR SELAHKLIAT
QESTLQFVSR ELHDEFGQIL TAIGSLLGRA EKQAPAGSTW ARDLHEVREM AQSTLDSVRS
LSQALHPVVL DEAGVEHAID WYLPMMERQN AITIHYGKSG SSQLVGSGAG IHIYRILQEA
LNNVVRHASV QEAWVRLQFE PGRLLLEVED HGKGFQPDPS RRGIGVVAMR ERAELLGGNI
KWLPAPGGGT LVRLAVPKER LNA