Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3519 |
Symbol | |
ID | 4072778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4159954 |
End bp | 4162917 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637985542 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_592594 |
Protein GI | 94970546 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase [COG2203] FOG: GAF domain |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGGAT TAAATCCTGA ACTCGCTGCG CACGTCGATG CGTCCGAACA CGCCATTCGT CTTCTCGCGC GCGAAAACAA AATTCTCGAG TTCATAGCGC GCGGCGCCTC TCTGGAGAAG GTGTTAACAG AGATTGCGCA CGCCGCCGAG GAGTACTGCG ACACCGAAGT TCGTTGCTCG ATCCTCTTGC TCGACGAGTC AGGAACGCGC TTGATGCACG GCGCTGCGCC TAGCCTTCCC GATACATATA ACAAGCTGAT TCATGGCACC GCGATTGGTC CCGAAGTCGG CTCCTGCGGC AAGGCCGCGT TCTGCAAGAA GCCGGTTTTC GTCGAAGACA TTGACACTGC GCCCTCCTGG TCCGCGCTGA AGCACCTCGC GTTGCCGCTT GGACTGCGCG GCGCCTGGTC CATGCCGATC TTTGCATCGA ATGGCGATGT TCTTGGCACG CTCGGTATCT ACACCCTTGA ACCTGCGTCA CCGACCGACC GGGCGCGCCA GGCTATCGAT CTTCTCGCTC GCACTGCGGG CATCGCAATC GAGCGTCATC GCTCCGAATC GCAGCGGCTG CGCTATCAGA AACAAATTGA GACGCTCAAC GACACGGGCA TTCTCCTTGC TGCCGAGCGC GACCTCCACA AGATCGTGCA GGCGGCAACG GACACCGCGC GAGAATTCAG CGGTGCCGCT TTCGGAGCGT TTTTCTACAA CGAAATCCGC GCCGATGGCG AGAGCTACAT GTTGTACACG CTCTCTGGCG CACCCCGCGA AGCCTTCGAG AAATTTCCCA TGCCGCGCAA TACCGCCGTC TTCGGCCCAA CGTTCGCCGG CGAAGGCACG GTGCGGCTGG CGGATGTGCG CAAGGATCCG CGATATGGCA AGAACGCTCC CTACCACGGC ATGCCGGAAG GGCACCTGCC CGTGTGCAGC TATCTCGCGG TGCCGGTGGT TTCGCGCTCG GGGAAGGTGC TCGGCGGACT GTTCTTTGGC CATTCTGAGC CGAACCGGTT CACCCTGGAA GCACAGCACC TGGTCGAGAG CATCGCCGCG CAGGCAGCGG CCGCCATCGA CAACGCACAG CTCAACGATC GTATTGCGAG ACAGTTGGCT TCATCCGAGG AAGTGCAGCA GCGGCTCGCT ATCGCCCAGC AGGCGGCGCA GCTCGCGACC TGGGAACTGG ATTTCCGTAC CGATGAAATC CGGTTCTCGC CGGGCAGCTG GCCGGTGTTC GGGTGCGACC CGTCGGAGAT CAAGAGCCGC GCCGATTGGG AGCGTCAAAT CCATCCCGAC GACCGCGACA TCGTCCGCAA CGAGCTCGAG AGCTGCGTCC AGAATGCCAA AGCCTACTTC GTGGAGTATC GCGTCCAGAG CCCGCTCGGT GTGCGATGGG TGCAGGGGCG CGGCCACGTG GTGTATCACG CGGAAACCGC GCGCCCGGAG CGCCTGATCG TCCTCAGCAT CGACATAACC GAACGCAAGC TTGCCGACGA AGCCCTGCGC ATCAGCGACC AGAAGTTCCG CGAGGCGCAG AAGGCCGCCA ACATGGGCAC CTGGTTCTGG GACATCCCGA CCGACAAAGT CACGTGGTCC ATGGAGGTCC CTTCCTTCGA CGCTGCAGTC TCCGCCGACC GGCTGAAGAA CTGGGTGAAC GCGGTCCACC CCGACGACCG CCCCGCCGTT GTTGCCGAAC TCGATCGTGC GCTTCGTCAG GGCGGCCCCT TCAAGATCGA GCATCGCCTT ATCAGGCAAG ACGGCGCTCA GCGGTGGTCG TTCACGCAAG GCCAGATCAT GCTGGGTGAG GATGGCACAG CGCTTTCGGG TCTCGGCATC ACCATGGACA CGACCGCGCG GCGCGAGGCG GAGACGGAAC TCAAGCGTGC CGAGGAGCGC TTCAACCTCG CCGTGGACGC CGCTGACCTG GGCTTCTGGT ATTGCGACTT GCCGTTCGAC GTTCTCGGTT GGGATGAGCG CGTGAAGGAG CACTTCTGGT TGCCGCCCGA CGCCAAAGTC ACGGTCGACG ATTTCTATCG GATCCTCCAT CCCGAGGACC GCGAACGAAC CCGCCAGGCG ATCGAAACCT CCATCAACCA AAAGAAGCGT TACGACGTTG ACCATCGCGC TGTCTCGCCG ACAGGCGAGG TGCGCTGGGT GCGCGCGGTG GGCCGCGGGT TCTACGACGA GACCGGGAAC CCGGTGCGCT TCGACGGCGT GACCATGGAC ATCACCGAGC GGCGCAAAGC GGAAGAGGCG TTGCGCAGTT CCGAGAAGCT CGCCGCGACC GGACGCCTTG CTGCGACCAT CGCGCACGAG ATCAACAATC CTCTCGAAGC GGTCACGAAC TTCATCTATC TCGCCAAGAC GACCGACGGC GTCTCAGACC AGGTTCGCTC CTATCTGGAG ATTGCTGACC AGGAACTGGG CCGCGTATCG CACATCGCGC GGCAGACGCT CGGCTTCTAT CGCGACAGCA GCGGCCCAAT CCTGATGAGC GTTCCCGACA TCGTGCAGGA CGTCGTGAAC CTCTACCAGC GCAAGCTGCT CTACAAGTCG CTCGAACTCA AGCTCGACGT GCAGTCGGAC CTCACGATCC GCGGCCTCGC CGGCGAAATG CGCCAGGTGC TTGCGAATTT GCTCGTGAAC GCGATCGACG CGTCGAACGA CGGCGGCCGC ATCTGGATCC GAGCGCGCCG CGTGGTAGAC CTCAAGACCG GCGGCAAAGC GGTGCGGCTC ACCGTGGGCG ATTCAGGCAT CGGCATGAAT GAAGAAGTTC GCAAAAAACT TTTCACGCCG TTCTTCACCA CCAAGTCCGA CGTCGGCACC GGCCTCGGCC TGTGGGTCAC GCGCGGCATG GTCGAGAAAG CGAAGGGAAG AATCCGGGTG CGCAGCCGCC AGGGGATTGG CACCGTTTTC TCCATGCTGT TTCCGTCAAC GAAGTATCCG CCGCCGTCGG TGCAGCCGGC GTGA
|
Protein sequence | MSGLNPELAA HVDASEHAIR LLARENKILE FIARGASLEK VLTEIAHAAE EYCDTEVRCS ILLLDESGTR LMHGAAPSLP DTYNKLIHGT AIGPEVGSCG KAAFCKKPVF VEDIDTAPSW SALKHLALPL GLRGAWSMPI FASNGDVLGT LGIYTLEPAS PTDRARQAID LLARTAGIAI ERHRSESQRL RYQKQIETLN DTGILLAAER DLHKIVQAAT DTAREFSGAA FGAFFYNEIR ADGESYMLYT LSGAPREAFE KFPMPRNTAV FGPTFAGEGT VRLADVRKDP RYGKNAPYHG MPEGHLPVCS YLAVPVVSRS GKVLGGLFFG HSEPNRFTLE AQHLVESIAA QAAAAIDNAQ LNDRIARQLA SSEEVQQRLA IAQQAAQLAT WELDFRTDEI RFSPGSWPVF GCDPSEIKSR ADWERQIHPD DRDIVRNELE SCVQNAKAYF VEYRVQSPLG VRWVQGRGHV VYHAETARPE RLIVLSIDIT ERKLADEALR ISDQKFREAQ KAANMGTWFW DIPTDKVTWS MEVPSFDAAV SADRLKNWVN AVHPDDRPAV VAELDRALRQ GGPFKIEHRL IRQDGAQRWS FTQGQIMLGE DGTALSGLGI TMDTTARREA ETELKRAEER FNLAVDAADL GFWYCDLPFD VLGWDERVKE HFWLPPDAKV TVDDFYRILH PEDRERTRQA IETSINQKKR YDVDHRAVSP TGEVRWVRAV GRGFYDETGN PVRFDGVTMD ITERRKAEEA LRSSEKLAAT GRLAATIAHE INNPLEAVTN FIYLAKTTDG VSDQVRSYLE IADQELGRVS HIARQTLGFY RDSSGPILMS VPDIVQDVVN LYQRKLLYKS LELKLDVQSD LTIRGLAGEM RQVLANLLVN AIDASNDGGR IWIRARRVVD LKTGGKAVRL TVGDSGIGMN EEVRKKLFTP FFTTKSDVGT GLGLWVTRGM VEKAKGRIRV RSRQGIGTVF SMLFPSTKYP PPSVQPA
|
| |