Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4155 |
Symbol | |
ID | 4072114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4919970 |
End bp | 4922123 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986186 |
Product | transcriptional regulatory protein-like |
Protein accession | YP_593229 |
Protein GI | 94971181 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0823] Periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | [TIGR02800] tol-pal system beta propeller repeat protein TolB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.297616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGGCT CGTTCCATAT CGCAGATTGG GAGGTCGAGC CTCAGATCAA CCAAGTAAGG CGTCAGAACC ACTCCTTCCA TCTTGAACCG AAGGTGATGC AAGTCCTGGT GGAGTTGGCC GCACACTCGG ATGAAGTTCT CTCCAAGGAA CATCTGATCC ACGCAGTTTG GTCGGACACG TTCGTCAGCG ACGACGTGTT GACGCGTTGC ATCTCGGAGA TCCGGCGAGT GCTGGACGAC GATGCGCGCG CGCCGAAGAT CATCCAGACG ATCCCGAAAT CGGGCTACCG GCTGATTGCG CCGGTGGTGT TCGACAAGGC CGCGCCTCCA AAGAATGGGT CGTCGAATGG GTCGACGGCA ACGGTTGAGG CGGCGACGGT GGAGACGCCT CCTCTGAAGA CCGCTGAGAC GCCGCGGAAG AACCTGGCGC CGTGGATTGT TGCGGTCGGC GTATTGGCGA TTGCCGTGGT TTACTTCGCG ATACGAGCTA CTCGTCCGCC GGTAGCCCAA CCGGGGCCGG AGGAGAGCTA CAGGACGATT CCCTTCACGT CGTATCCGGG ATCGCAGACG CAACCGGCGT TTTCGCCGGA TGGGAACCAA GTGGCGTTCG TGTGGAATGG CGAGGGCGGA GAGAACCGGA ACATCTACGT AAAGATCATC GGGTCGGAGA CACCGCTGCA GCTTACGCAC GACGCCGGGC AGGATTACAG CCCGACGTGG TCGCCGGATG GACGGTCCAT TGCGTTCCTA AGATATACGG ACACGGACCG CGGCATCTTT ATGGTGCCGG CGCTGGGTGG GGCGGAGCAC AAGGTTTTTA CGCCGACGGG GATCGTGGAG TGGGAGCGCA ATGCGCTCTC GTGGTCGCCG GATGGAAAAC GGCTGATCTT CTCCGATGGG AAGAGTGCGA ACTCGCCAAG CTCGATTTTC GAACTGGATC TGGAAACGGG GACGGCAAAG GCGATTACGT CACCACCGAA GTTGTGGGAC GGCGACAGCG GGCCGGCGTT CTCACCAGAT GGAAAGAAGA TTGCGTTCGT CCGCGGGTCT GAGGGATGGG TACAGGACCT GTACGTGATG GATGCGGCGG GCGGCGAACC AACGCGACTG ACGAACGATG GCCGCATGAT GGCGAGCATC AGTTGGGCGG CGGATGGGAA GTCGATTGTG TACTCGTCGA ACCGTGCGGG AAAGTTTTCG CTATGGCGGA TTCCGGCGAC AGGCGGCACG GCCGAGCGCA TTGGCGTGGG CACAGAAGAT GCATTCGGGC CTAGCGTTTC ACGGAATGGC GATCACTTGG CTTACACGCA GGGGAGTTCG ATTTACGGGA TCCACCGCCT GGATTTGAAA GCACCAAAGA GCGCGGCGAC GACGATCTTG AGTTCGACGC AGCAGGATTC TGCGCCGAAG CTTTCGCCGG ATGGAAGTCG CGTGGCGTTC CAGTCGTGGC GATCGGGGAC GCAGGAAGTT TGGGTTGCGG GCAGCGATGG GAAAAATCCG GAGCGGGTGA CTTCGTTTGA GAAGTCGCTG ACGGGGAGCC CTTCGTGGTC GCCGGATGGA AAGCAGTTGG CCTTCGACGC GCGGCCGGAG GGACGTTCGC ACATCTATGC GCTGCGATTG GATGGCGGGC AGCCGAAAGC CGTGACGGAC GGGGATTTCA ACGACATTCT GCCGAATTGG TCGAGCGATG GAAAGTGGGT TTATTTTGCG TCGAACCGCG GCGGGGCTTG GCAAATTTGG AAGGCGCCGA GCGAAGGCGG AACTCCGCAA CAGGTTACGA AGCATGGCGG ATTCGTTGGG CAGGAATCCT TCGACGGCAA GTGGCTGTAC TTCGCGAAGC CGGACGCGGT GGGACTGTTC CGGATGCCGG TAAGCGGTGG CGACGAACAG AAGATCATGA ATCAGCCGCC GGAAGCGTAC TGGGGATATT GGTCGCTGAC CCCGAATGGA ATTTATTACC TGAACGAGAC CGGCGGGAAG ATGTCGATTG AGTTCGCCGA ATTGGATGGG ACGCATCCGA CTCGCGTGCA TGTACTTGAA CGAGGGCTGC CGCCGTTTTC GGGGCTATCG GTTACGCCGG ATGGGAAGAC GCTCTTGTAC AACGATCGGG TTGAGGCGGG GAGCCATATT ACGCTGGTGG ATGGGTTCCG ATAG
|
Protein sequence | MKGSFHIADW EVEPQINQVR RQNHSFHLEP KVMQVLVELA AHSDEVLSKE HLIHAVWSDT FVSDDVLTRC ISEIRRVLDD DARAPKIIQT IPKSGYRLIA PVVFDKAAPP KNGSSNGSTA TVEAATVETP PLKTAETPRK NLAPWIVAVG VLAIAVVYFA IRATRPPVAQ PGPEESYRTI PFTSYPGSQT QPAFSPDGNQ VAFVWNGEGG ENRNIYVKII GSETPLQLTH DAGQDYSPTW SPDGRSIAFL RYTDTDRGIF MVPALGGAEH KVFTPTGIVE WERNALSWSP DGKRLIFSDG KSANSPSSIF ELDLETGTAK AITSPPKLWD GDSGPAFSPD GKKIAFVRGS EGWVQDLYVM DAAGGEPTRL TNDGRMMASI SWAADGKSIV YSSNRAGKFS LWRIPATGGT AERIGVGTED AFGPSVSRNG DHLAYTQGSS IYGIHRLDLK APKSAATTIL SSTQQDSAPK LSPDGSRVAF QSWRSGTQEV WVAGSDGKNP ERVTSFEKSL TGSPSWSPDG KQLAFDARPE GRSHIYALRL DGGQPKAVTD GDFNDILPNW SSDGKWVYFA SNRGGAWQIW KAPSEGGTPQ QVTKHGGFVG QESFDGKWLY FAKPDAVGLF RMPVSGGDEQ KIMNQPPEAY WGYWSLTPNG IYYLNETGGK MSIEFAELDG THPTRVHVLE RGLPPFSGLS VTPDGKTLLY NDRVEAGSHI TLVDGFR
|
| |