Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1771 |
Symbol | |
ID | 4072831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2145208 |
End bp | 2147391 |
Gene Length | 2184 bp |
Protein Length | 727 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983779 |
Product | CheA signal transduction histidine kinases |
Protein accession | YP_590846 |
Protein GI | 94968798 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0643] Chemotaxis protein histidine kinase and related kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCTTCT TCTCCGACGA GCGGGCAGCC GAACTCCGGG ACTTGTTCTT CGAGAGCGCG CAAGAACTTC TGCAAGCGCT CAACGACGAA GGTCTTGAGT TGGAGCGCCA CCCCAACGAC GCGGAGATCG TGCGCGACAT CCGTCGCACC GTGCACACGC TCAAGGGCGA CTCGGCGGCC TGCGGCTTTA AAGAACTCAG CGAACTGGCA CACGAACTCG AAAACGCTCT TACTCCCGAA ATCACGCACG CGGCGAGCGG CAACCTCGTC GAACTCGTGC TGAACGCTGC CGACACCTTC GATTCCCTGC TCACCGCATA TCAGCGCAAT TCAGCGCTCC CGGCCACCGA TCACGTCCGC AACCTGATTA AGCGGTTGAC CGAGAAGCCG CAAGCACTCG ACGGCGGCGC GCTAAAGCTT GAGTTCATGT GGAGCGAGTA CGAACAATTA CTCGCTGAGA AGGGCGCATC GTCTGGCGCA CGTCTGTTTA ACATCGCGAT TGAAGTGGAC CCGCAGTGCC CCATGCGCAT GGCCGCTCTG CAACTCGTTA AGAACGTGCT GCAGGGGCTC GGCGTCACCA TCACGATTCA GCCGGAAGGC GACATGCTGC CGGAAAACAT CTCGACGATT CGGGCCATCC TTGCGACCTT CCAGGCGCCC GAGAACATCG AGAAGAAGAT CCAGATTCCG GCGATTACGC GCCGCACTTT GCTCGAAGAC TATCTGCGCC CCGGACAACC GCGCGAAGGC ATTCTCTCGC GCGCGACCGA AGCACCGAAA GAAACTCCTA AGGCCGTCGC TCCGCCGCCC AAGCCCGCGC CGCCGGCGCC TGAGCCGAAG AAAGTCGAGC CGCCCAAGCC CGATCTCAAA ACCGTACCCG CTCTGCTCAC CGTCGAAGAG CACGACGGCG ATGCGCACGC CGATGCGCCG TCGCCTAATC CCTTTGCGAT CACGCCTGAG AACCTGCTGC GCGTGGATGC CGATCGCATT GATACCGTAC TCAATCTCGT CGGTGAGTTG ATCATCAACA AGTCGATGCT CAACCAGACG CTCTCCGATT TCGGCAAGCA GCACGCGAAA GATCCGTTGA AGGCCCGCTT TGCCGACGCG ATGGCGTTCC AGGCGCAGAT CCTGAATGAG CTCCAGCGCG CGGTGATGAA GATCCGCATG GTGCCGGTGG AGCAACTCTT CCGGCGCTTT CCTCGCATTG TTCGCGATGT CGCTCGCTCC AGTGGCAAGG AATGCGATCT GATCATCAGC GGCCAGAACA CCGATCTCGA TAAGAGCATT CTCGACGCGC TCTCTGAGCC GATGATGCAT CTAATTCGCA ACGCCGTGGA CCACGGCATC GAGCTTCCAG CCGCGCGTAT CGCCGCCGGC AAGTCGGTCA AAGGAACACT GAAACTCAAT GCGTTCTACC AGGGCAACCA GGTCGTCATC GAACTCACCG ACGACGGTGC CGGCATTGAC CGCGATCGCG TGGTTGAAAA AGCAATCGAG AACAACATCG TCTCGGAGAA AGAAGCCGAA AAGCTGACCG ATCAGGACGC GCTGAACCTG ATCTTCCGAC CCGGCCTGAG CACCGCGACC CAAGTCACTG AGATCTCCGG CCGCGGTATG GGCATGGACA TCGTAGAGAG CGTGCTGCGT CGCCTGAAGG GCTCCATCGG GATCCAGACT GAAAAAGGGC GCGGAACGAC TTTCCAGTTG CGCGTTCCGC TGACTCTGGC GATCATGCAG GCACTGCTCT TCCGCGCAGC CAACCGGCTG TATGCGGTGC CCCTCGGCTC GGTGGTCGAG ATAGCTCGCG CCACCTCGCA GCATATCCAC GTCGTGGACC ACCACGAGGT GTTGCAGCTT CGCGAACAGA TCGTCACTCT GGTTCGTCTG GACAAGCTGG AGGGCCGCAA GCGCCAGCCG CAGTCGGAGA AGCACAAGGT TTTCGTAGTC GTAGTACAAC TGGGCGACCG CCGCTTTGGG ATGGTCGTAG ACAAACTGGT TGGGGAAGAA GAACTGGTCA TCAAGGCCCT GGACGACAAT CTCGTCGCAA CCGATTTGGT TAGCGGCGCT TCCATCCTTG GTGACGGTAA GGTCGTTTTG ATCCTAAACG TTGCTTCTGT GGTCGAGCGC CTTGGGCGCG CGCCGAGCGG CAACGGGTCA AGAAAATTGG GAGCCTCGGC TTGA
|
Protein sequence | MSFFSDERAA ELRDLFFESA QELLQALNDE GLELERHPND AEIVRDIRRT VHTLKGDSAA CGFKELSELA HELENALTPE ITHAASGNLV ELVLNAADTF DSLLTAYQRN SALPATDHVR NLIKRLTEKP QALDGGALKL EFMWSEYEQL LAEKGASSGA RLFNIAIEVD PQCPMRMAAL QLVKNVLQGL GVTITIQPEG DMLPENISTI RAILATFQAP ENIEKKIQIP AITRRTLLED YLRPGQPREG ILSRATEAPK ETPKAVAPPP KPAPPAPEPK KVEPPKPDLK TVPALLTVEE HDGDAHADAP SPNPFAITPE NLLRVDADRI DTVLNLVGEL IINKSMLNQT LSDFGKQHAK DPLKARFADA MAFQAQILNE LQRAVMKIRM VPVEQLFRRF PRIVRDVARS SGKECDLIIS GQNTDLDKSI LDALSEPMMH LIRNAVDHGI ELPAARIAAG KSVKGTLKLN AFYQGNQVVI ELTDDGAGID RDRVVEKAIE NNIVSEKEAE KLTDQDALNL IFRPGLSTAT QVTEISGRGM GMDIVESVLR RLKGSIGIQT EKGRGTTFQL RVPLTLAIMQ ALLFRAANRL YAVPLGSVVE IARATSQHIH VVDHHEVLQL REQIVTLVRL DKLEGRKRQP QSEKHKVFVV VVQLGDRRFG MVVDKLVGEE ELVIKALDDN LVATDLVSGA SILGDGKVVL ILNVASVVER LGRAPSGNGS RKLGASA
|
| |