Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3357 |
Symbol | |
ID | 4071275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3981390 |
End bp | 3984302 |
Gene Length | 2913 bp |
Protein Length | 970 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637985379 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_592432 |
Protein GI | 94970384 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.984464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGTG AGATGGCAGT AGATTTGCAC ATGGATTCGC CGGAGATTAC CGTCCTTCCC ACCAGCTCTG GCAAAGGACG GAACCGGCCC ATGCTGAAGG AGAATCAAGT CCGCTTCGTT GCCGCCGTTG TGGCGCTGCT GACCGCGACT GCAATCGTAT TCTCCTTCAT TAACTTCCAG AAAGAGCGGG AGTTCGAGAC CCCTACCGAC GGCGTCTGGT GGGTGGAATC CGGCGGCCAT CTCAAGGCCG AAAAGGTCGA GGCCGACGGC CCCGGCGAGA AGGCCGGTAT CAAGCAGGGT GACGTCCTGA TCGCCATCAA CGGCGTCGAT ATTACCCGCA AAGCCGTGCA AGTGCGGCAG ATTTACCGCA CCGGAAGCTG GTCAAAGGCT ACTTTCTCGC TGACCCGCAG CAATGTGCCG ATCGAAGTCC CGGTGATTCT CACTCCGGCC GATCGCTCCC TGAATGGCGG CTTGCGCCTC ATCGCGCTGA TTTACCTTGG CATCGGCATC TACGTTCTGT TCCGGCGCTG GACGGCCCCA AAGGCGACCC ATTTCTACCT GTTCTGCCTG GCGTCGTTCG TTTACTACTC GTTCCACTTC ACCGGTAAGC TGAACCAGTT CGACTGGATC ATCTACTGGT CAAACGTGAC CGCATGGCTG TTACAGCCGG CGCTCTTCCT GCATTTCGCG CTCACATTCC CCGAGACCAA AGACGTAGTA AAGCGACACA AGTGGCTGGT GCCTGCTGTC TATGCGGTGC CCGCCGCGCT GCTTTCGCTG CATATCGTGG CGCTGAATTT CCTGCGCCCC AGTGAAGTCC TGCGCTGGAA CCTCGACCGC GGCCAGATGC TGTATCTCGC GGTGTATTTC GTGGCTGCGA CCGTGGTGCT CTGGCAGACC TACGCGAATG CCGCCTCGCC CATCCTGCGT CAGCAGATGA AGTGGGTGAC CCGCGGCACG TTCATGGCGA TCGCGCCGTT CACCATCTTC TACGTCATCC CGTACCTGCG CGGCAGCCTG CCGACCGCGG CCATGAAGAT CTCGGTGCTC TCACTGATCT TCCTCCCGTT GACCTTCGGC TACGCCATCT TCCGCTATCG CCTGATGGAC GTGGACCTCA TCTTCAAGCG CGGCATGGCC TACACACTCG CCGCCGGCAC GATCACCGGT ATTTACTTCA TGGCGATCGG CGGGGCCTCG GAAATGTTCC ACAAGAACTT CCCGAGCGCC GGCCCAGCCG GATTGATGGC GGCGATCGTC GTCACGGCCT TGCTCTTCGA TCCGTTCAAG AACTGGATCC AGGACAAGCT CGACAAGTTC TTCTATCGCA AGCGGTATGA CTACCGTAAG ACGCTCATCG AATTCGGCCG CGACCTGAAC TCCGAAACCG ATCTCGACAA GATGCTGGCG TCCATTGTGG ACCGCCTCTC GCGCACGCTG CTCGTCGATC GCATCGCCGT CTTCGTGCAC GATGAACAAA GTCGCTGGGT GCTGGCAAAG AGCTGCGGCA TCTCGCAGAC CACCGGGCTC GACATGAGCT TCATGAACGA AGAGCGTCCG GACATGGCTG CAGCCGGGCA CCTGTTCTTC GACAACACCA GCCAGGCAGT TCGCGAAAAT CCCGGCGCGC GCGAGACCAT TCGCCGCCTC GATCTGAACT ACTATCTGCC TTGCACGGTG ATGGGCCGCA CCATCGCGAT GGTCGGCCTT GGCAAGACCA CTGAGGGCGA CTTCCTCTCC AGCGAAGATG TAGAGCTGCT CGAGACGCTG GCCGGCTACA TCGGCATCGC GCTGCAGAAC GCGCGCCTCT ACCAGTCGCT CGCAGAAAAA ATCACCGAGT ACGAGCGGCT CAAGGAATTC AACGAGAACA TTGTCGAATC CGTAAGCGTC GGTGTGCTCG CCGTCGATCT CGAAGACAAG ATCGAATCGT GGAATGCGCA GATGGAAGTG ATGTACGCGC TGCCCCGCGC GGAGGCCCTC GGCAAGCGCC TCTCCGATGT CTTCCCGCTG AATTTCGTGG AAGAGTTCTA TCGCGTCCGC CAGGTTCCCG GCATTAACAA TCTCTATAAG TTCCGCATGG GCACTCCCGC GGGCGACACG CGCATTTGTA ACATCGCTAT CGCACCGCTG GTCACGCGCG ATTTCAACGT CATCGGCCGC ATCATCATCC TCGACGACAT GACCGATCGC GTCGAACTCG AATCGCAATT GGCGCAGGCA GAAAAGCTTT CGTCCATTGG ATTGTTGGCC GCCGGCGTGG CGCACGAAGT CAATACGCCA CTTGCGGTGA TCTCGTCCTA CGCGCAGATG CTCTCCAAGC AGTTGCAGGG CGATGAACGC CGCTCGGCGC TGCTCGAGAA GATCACCACA CAGACGTTCC GCGCCTCGGA GATCGTTAAC AACCTGCTGA ACTTCTCCCG CACCGGCAGC AGCGAATTTG CTGAAGTAGA CATCAACAAG GTTGTCAGCG ACACGCTCGC CTTACTCGAG CACCAGCTAA AGACCTCGCG CGTGAAGGTG GAAAACCACC TCGCGCCTAC GCTGCCCAAG ATCTACGGCA ACACCGGCAA GTTGCAGCAA GTGTTCCTGA ACCTCTTCCT CAACGCCAAG GATGCGATGC CCTCGGGCGG CACGCTGAGC ATCACCACGC GCAACGGCCG CGCGGTTGAG GTTGAGGTAT GCGACACCGG AAGCGGCATT GCGCCGGAAC ACATCCAACG CATCTACGAT CCATTTTTCA CCACGAAGAA ATCGCCGCGG CAGGGCCACT CCGGAGGCAC CGGCCTCGGA TTGGCTGTGA CCTACGGCAT TATCCAGGAA CATGCGGGCA AGATCCGCGT GGACAGCCGC CCCGGCGAGG GCACGCAGTT CACGATGGAG TTCCCCATGG TCAGGAAGGC TGTGAATGCC TGA
|
Protein sequence | MQSEMAVDLH MDSPEITVLP TSSGKGRNRP MLKENQVRFV AAVVALLTAT AIVFSFINFQ KEREFETPTD GVWWVESGGH LKAEKVEADG PGEKAGIKQG DVLIAINGVD ITRKAVQVRQ IYRTGSWSKA TFSLTRSNVP IEVPVILTPA DRSLNGGLRL IALIYLGIGI YVLFRRWTAP KATHFYLFCL ASFVYYSFHF TGKLNQFDWI IYWSNVTAWL LQPALFLHFA LTFPETKDVV KRHKWLVPAV YAVPAALLSL HIVALNFLRP SEVLRWNLDR GQMLYLAVYF VAATVVLWQT YANAASPILR QQMKWVTRGT FMAIAPFTIF YVIPYLRGSL PTAAMKISVL SLIFLPLTFG YAIFRYRLMD VDLIFKRGMA YTLAAGTITG IYFMAIGGAS EMFHKNFPSA GPAGLMAAIV VTALLFDPFK NWIQDKLDKF FYRKRYDYRK TLIEFGRDLN SETDLDKMLA SIVDRLSRTL LVDRIAVFVH DEQSRWVLAK SCGISQTTGL DMSFMNEERP DMAAAGHLFF DNTSQAVREN PGARETIRRL DLNYYLPCTV MGRTIAMVGL GKTTEGDFLS SEDVELLETL AGYIGIALQN ARLYQSLAEK ITEYERLKEF NENIVESVSV GVLAVDLEDK IESWNAQMEV MYALPRAEAL GKRLSDVFPL NFVEEFYRVR QVPGINNLYK FRMGTPAGDT RICNIAIAPL VTRDFNVIGR IIILDDMTDR VELESQLAQA EKLSSIGLLA AGVAHEVNTP LAVISSYAQM LSKQLQGDER RSALLEKITT QTFRASEIVN NLLNFSRTGS SEFAEVDINK VVSDTLALLE HQLKTSRVKV ENHLAPTLPK IYGNTGKLQQ VFLNLFLNAK DAMPSGGTLS ITTRNGRAVE VEVCDTGSGI APEHIQRIYD PFFTTKKSPR QGHSGGTGLG LAVTYGIIQE HAGKIRVDSR PGEGTQFTME FPMVRKAVNA
|
| |