Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1494 |
Symbol | |
ID | 4071664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1816724 |
End bp | 1818472 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637983503 |
Product | putative GAF sensor protein |
Protein accession | YP_590570 |
Protein GI | 94968522 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.980694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGATT CTGTAACAAA GGACTCGGAA TTCGAGATCC AGTATCCGCC CTCAATACTC GAAGACGTCA CAAAACTCGT TCAACGGGTT CAGACGTTTA CGGGAGCGGC CGGCGCAGCG ATCGCGCTGC GCGAGGGGGA GGACATGGTT TGCCGCGGCA GCCGCGGCAA CAACGCCCCC GACGTTGGAA TGGTTCTAAG CACCGACGGA ACGTTCACTG GCCTGGCCGT TACCGGAATG AAAGCGGTGC GTTGCGACGA CACCGAAAAT GATTCGCGCG TCGACCCCGA AATCAGCCGC GCCCTCCGCA TCAAGTCGAT GGCCGTAGTG CCCGTACTGA GCGGAATGCG CGTCAGTGGC GTGATTGCGA CGTTCTCGAG CGCTGCCAGT GCCTTCAGCG ACACTCATAT GGCCGTCCTC AAGACGATGG CCGATGGCTT GGGCGCTTCG ATCCAGCGCT TCCTGGAAGT CCAGGGAATT AGCACCGGCG CACCGATGGT CTCCGCCGCG GCAGCAGCCG CACCGGCCCC GGTCGCCAGG CCGCAGGTTG CACCGCCACC TCCGCCGCCT CCGGCGCCCA AGGTGGAAGC ACCGCGTCCG GCACCGCCGC CACCACCGGC TCCGCCGAAA ATTGAAGCTC CGCGCCCAAC ACCGCCGCCT CCGCCGGCAC CGAAGATTGA AGCTCCGAAG CTGGCCCCGC CGGCGCCTGC TCCGATGCCA GTTGCCGCCG CTCCAGCGCC TGCAGTCGAG CGCGCTCCTG AGCCGCCGAA GCCACCGCCG CAGCCACCGA AGCGGCAGGA ACAGAAACAG CAGGGCAAAT GGAAACCGGT AGCGCCTCCG AAGCAGGAGG AAGAGGCTCC GGTGATCGAG AAGCCCGCGC CGAAGCCGGA ACCCAAGGCG CAACCGAAGC CGGAACCGAA ACCTGAGCCC AAACCGCAGC CGAAGGCCGA GCCGGTCATC GCTGCCCCCT CGTTCAGCTA CGAAGCGAAA ACCGAGGAAG GCGAAGGCGG CGGAAACAAG GGCATGATTT TCGGCGGCGT CGCAGCCGCA GTGCTGGTAA TCGCGATCGG CGGATACTTC ATGATGGGCA AGAAGTCGAG CCCAGCGCCT GCTCCGCCCA CTACTACGAC GCAACCGGCG CCTGAGAACA CCGCGAACAC AACGCCCGCC TCCAACGTAA CTGGAACGGT GACGACTGGA GCAAACCCCG CGGCCGCAAA CAACACCAAG CCGCAGGACC AGGGCAAGAA CAACAACAAC ACGACCGCCA GCAAGCCGGA AGAGCAGCAG CAGGCGAAGC CGGCAGCCGC TCCGCTGGTT GTAGGCTCCG CACCTTCGGC GAGCAACAAG CCGCAGCAGG TTGCAGATGT CTCCGCGCCC TCGCTGAACC TGGCAGGCGC AGCCGGTGCG GGTCCGAATC TTGACGTTCC GGTAACCAGC TCAGCGCCGA AGCTTTCCGC TCCTGCGCCG GCCAATGCCG TAATCGTTCC AAGCCGCCTG GTCCAGCGCG TGAACCCGAA CTATCCGCAG TCCGCCAAGC AGTACCGCAT TGAAGGCGCG GTAACGCTGA GCGCGACCAT CGGATCTGAC GGACACGTGA AGGACGCGAA AGTACTGAAC GGGCCGCCGA TGTTGCGCGA CTCCGCACTC AACGCAGTTC GCCAATGGAA ATATGCTCCT TCCACGGTGA ACGGGCGTCC AGTCGAATCG AGCGTACAGA TCGTGCTCCA GTTCAAGATG CCTAGCTAA
|
Protein sequence | MPDSVTKDSE FEIQYPPSIL EDVTKLVQRV QTFTGAAGAA IALREGEDMV CRGSRGNNAP DVGMVLSTDG TFTGLAVTGM KAVRCDDTEN DSRVDPEISR ALRIKSMAVV PVLSGMRVSG VIATFSSAAS AFSDTHMAVL KTMADGLGAS IQRFLEVQGI STGAPMVSAA AAAAPAPVAR PQVAPPPPPP PAPKVEAPRP APPPPPAPPK IEAPRPTPPP PPAPKIEAPK LAPPAPAPMP VAAAPAPAVE RAPEPPKPPP QPPKRQEQKQ QGKWKPVAPP KQEEEAPVIE KPAPKPEPKA QPKPEPKPEP KPQPKAEPVI AAPSFSYEAK TEEGEGGGNK GMIFGGVAAA VLVIAIGGYF MMGKKSSPAP APPTTTTQPA PENTANTTPA SNVTGTVTTG ANPAAANNTK PQDQGKNNNN TTASKPEEQQ QAKPAAAPLV VGSAPSASNK PQQVADVSAP SLNLAGAAGA GPNLDVPVTS SAPKLSAPAP ANAVIVPSRL VQRVNPNYPQ SAKQYRIEGA VTLSATIGSD GHVKDAKVLN GPPMLRDSAL NAVRQWKYAP STVNGRPVES SVQIVLQFKM PS
|
| |