Gene Acid345_2442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2442 
Symbol 
ID4072876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2886164 
End bp2889256 
Gene Length3093 bp 
Protein Length1030 aa 
Translation table11 
GC content58% 
IMG OID637984458 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_591517 
Protein GI94969469 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.223882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.750566 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAGTA CTGGCAGAGT CAGGATCCAA CCGTCGTTGG TTGTTTGGTT CCTGTTGTTG 
ATCTCAGCGG GCTCCTCCCT GTTCGCGTTA AATCCTGACC TCAGCATCTC GCAGTACGCG
CACTCCACCT GGCGGGTCCA GGATGGAGCT TTCCGCAGCG CGCCGAACGC GGTCGCCCAA
ACCAAAGACG GCTATCTCTG GATCGGCACT GAAGGCGGCT TGGTGCACTT CGATGGCGTG
CGCTTTGTTC CATGGGTGCC TCCCGCGGGC GTGAAGCTGC TTGACCCCAG GATCTTTTCG
CTCATGGCAG CAAGCGACGG CAGCTTGTGG ATCGGGACCG GCTACAGCAT TTCGCACTGG
CGCCGCAACG AACTCATCAA TTATTCACAA TTGAGTGGAA GAATTGAGGC CATCGCCGAA
GACCACGACG GCACGGTGTG GTTCGTTCGG ACGCAGATTA CCGATGGCGG CGGACCGGTG
TGTCGCATCA CCAACGATCA GCCACAGTGC TTCGGGAAAG CGGACGGCAT TCCCTTTCCC
ATCGCGGTGC AATTACGAGT GGGAAATTCC GGAGAGCTCT GGGTTGGCGG GTATTCCGAG
CTTTGCCGCT GGAAGCCGGC CTCGTTATCC AGTGACTGCT TTGCGAAAGG TTCGCAGGTG
CCGGAGACAT TTGCTTCCAT CAAAGCGATA GCCACCGGAA ACGATGGAAC AGTTTGGGTA
GCGCGCGAAC GTCCTGGGAG TTTCCTCCAA CTGGAACGAT TTGCGCAGGA GAAGTGGACG
ACGCTGTCTT ATCCGGAAAT TGCGATCAAC AACTCGGACG TAACGACCTT ATTTGTTGAC
CGCGACAACA CCATCTGGGT CGGGAGCGCG AACCACGGCG TATTTCGCAT TGTGGGTAAT
ACCGTGCGCA GCTTTGGACG CACCGACGGC TTGTCGAGCG ATGCGGTTGG GCGCTTCTAC
CAGGATGTGG AAGGCACGGT GTGGGTAGTG ACCTCGGCCG GTATCGATAA CTTCCGCGAC
TTGAAGGTCG TGACCTATTC CATGCGCGAG GGTCTCACTG CAGCCGGCGC TGGGACAGTG
TTGGGGACGC GCGACGGCAC GGTGTGGATC GGCAACTTCC ACGCACTCGA TTTCATGCAA
GGCAGCAAGC TGTCGTCAAT CCGCGCTGGG AACGGTCTCC CGGGCCTTTA CATCACAACG
TTTTTCGAAG ACCACGCCGG TCGCCTTTGG GTTGGCATTG ACGATGGCCT TTGGGTTTAC
GAGAACCAAA CGTTTCGTCC CGTCCGCCAT GCTGATGGCA GCAAGCTCGG CATTGTCTTC
TCCATAACGG AAGACACCCT GCATAACATT TGGGCACGAG CCGGGAAGAA CCTCGATCGC
ATTGCGGATT ACCGGCTGCA GGAAGAGACG ACCTCTCCGC AGATTTCCAC GTCGTACATT
CTCGCCGCGA GTCCGCAGGG CGGCATCTAT CTCGGACTGG TGAGTGGCGA CCTAGTGCAA
TATGACGGCG GCAAGTCGCA GACCTTCGCG TCGAACGAAG TAGGAAACAC GCGGCAAATC
CGCGATCTCC TGGTGGAACC CGACGGCTCC GTCTGGGGTA CAACGCTCGA TGAAATCGTT
CGCTGGAAAA ACGGGGAGCG CAAGAACCTC ACCACGCGCA ATGGACTTCC CTGCGATGAA
ATCTTCGCGC TGGTGGAAGA CTCGCGCGGT TCGCTCTGGA TCGAGTCGAA GTGTGGGGTG
ATTGAAATCG AGCGCGCGCA GCTCGATGCC TGGTGGGAAC ACCCTGAAAC CGTGGTGAAG
TTCGGGCTGC TCGATGGCTC CGATGGAATG CAGGCGGGGC TTACTCCGCT CAAGCCGCAG
GCAACGCGCT CCGCCGACGG AAAGTTATGG TTCGTGAATG GACGCATCCT TCAAATGCTC
GACCCGAACC ATCTGCAGAG GAACCCGGTA CCACCGCCGG TGCAGATCGA AGAAATTGTT
GCCGACCACA AGAGCCATTC GCCACAAGCG GGCCTGCGCT TGCCGGCGCT CACGCGCGAT
CTCGAGATTG ACTACACTGC TCTGAGCTTC GTCGCTCCTC AGAAAGTCCA ATTCCGCTAC
ATGCTCGAAG GACGCGACAC CGCATGGCAG GAAACCATGA CGCGTCGCCA GGCGTTCTAC
AACAATCTCG GTCCGGGCCA CTATCGCTTC CGCGTGATGG CGTCGAACAA CGACGGCGTA
TGGAACGAGG CTGGCGCTTA TCTCGATTTT TCGATCTTGC CCGCGTATTA CCAAACGGTT
TGGTTCCGAC TGCTTTGCGC CATCGCGTTT CTCATGGTGT TGTGGTCCAT CTTCCAGATC
CGCGTGCACC AATTGCGGCG GCAGTTTGAG ATCGGCGTGG AAGCTCGTGT CAACGAGCGC
ACTCGTATCG CGCGCGAACT CCACGACACC CTGCTGCAAA CGCTGCATGG GCTGATGTTC
CAATTCCAGG CGGTACGAAA TCTTTTACCG CGGCGTCCTG AGGACGCAAT GCGCTCGCTG
GATGACGCCA TCGTTGAGAC GGAGAAAGCG CTCGCGGAGG GTCGCAACGC GATCCAGGGC
ATTCGCTCCG AATCCCGCGA TGGCGACGAT CTAGCGGAAT TCCTGAAAAA CGCGAGCAAG
GACTTCGCCA GTACCGCGAA GCCCGGAGAA TCTCTGCCGA CCTTCGACTT GATCGAAGAA
GGACAGCGGC GCTCGGTCTC GTCGGACGTG AACAATGAGG TCTGCCGCAT TGCGCTCGAG
TTGCTGAGAA ATGCGTTCCG CCACGCGCAG GCAACGCGCA TTGAAGCGGA GATTCGCTAC
GACGCGCAGA TGCTGCGATT ACGAATCCGC GACAACGGTA AGGGCATCGA TCCTGTAGTG
CTCCGCGAGG GTGGCGTTGC CGGACATTGG GGATTGAAAG GTGTGCGCGA GCGTGCCGAG
CGCATTGGCG CAAAGATTGA ATTCTGGAGC GATGTCGGTC TCGGCACTGA AATCCAAGTG
ACCGTGCCGG CGGGAGTCGC ATACCAGGCA GAATCCGAAG AACGTTTATT ACAGTCGAAT
CCCAGGACAA AGAGCCGTGC CAAGCAATCA TGA
 
Protein sequence
MISTGRVRIQ PSLVVWFLLL ISAGSSLFAL NPDLSISQYA HSTWRVQDGA FRSAPNAVAQ 
TKDGYLWIGT EGGLVHFDGV RFVPWVPPAG VKLLDPRIFS LMAASDGSLW IGTGYSISHW
RRNELINYSQ LSGRIEAIAE DHDGTVWFVR TQITDGGGPV CRITNDQPQC FGKADGIPFP
IAVQLRVGNS GELWVGGYSE LCRWKPASLS SDCFAKGSQV PETFASIKAI ATGNDGTVWV
ARERPGSFLQ LERFAQEKWT TLSYPEIAIN NSDVTTLFVD RDNTIWVGSA NHGVFRIVGN
TVRSFGRTDG LSSDAVGRFY QDVEGTVWVV TSAGIDNFRD LKVVTYSMRE GLTAAGAGTV
LGTRDGTVWI GNFHALDFMQ GSKLSSIRAG NGLPGLYITT FFEDHAGRLW VGIDDGLWVY
ENQTFRPVRH ADGSKLGIVF SITEDTLHNI WARAGKNLDR IADYRLQEET TSPQISTSYI
LAASPQGGIY LGLVSGDLVQ YDGGKSQTFA SNEVGNTRQI RDLLVEPDGS VWGTTLDEIV
RWKNGERKNL TTRNGLPCDE IFALVEDSRG SLWIESKCGV IEIERAQLDA WWEHPETVVK
FGLLDGSDGM QAGLTPLKPQ ATRSADGKLW FVNGRILQML DPNHLQRNPV PPPVQIEEIV
ADHKSHSPQA GLRLPALTRD LEIDYTALSF VAPQKVQFRY MLEGRDTAWQ ETMTRRQAFY
NNLGPGHYRF RVMASNNDGV WNEAGAYLDF SILPAYYQTV WFRLLCAIAF LMVLWSIFQI
RVHQLRRQFE IGVEARVNER TRIARELHDT LLQTLHGLMF QFQAVRNLLP RRPEDAMRSL
DDAIVETEKA LAEGRNAIQG IRSESRDGDD LAEFLKNASK DFASTAKPGE SLPTFDLIEE
GQRRSVSSDV NNEVCRIALE LLRNAFRHAQ ATRIEAEIRY DAQMLRLRIR DNGKGIDPVV
LREGGVAGHW GLKGVRERAE RIGAKIEFWS DVGLGTEIQV TVPAGVAYQA ESEERLLQSN
PRTKSRAKQS