Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1501 |
Symbol | |
ID | 4069248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1827378 |
End bp | 1830749 |
Gene Length | 3372 bp |
Protein Length | 1123 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983510 |
Product | TonB-dependent receptor |
Protein accession | YP_590577 |
Protein GI | 94968529 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.175276 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.273267 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGT CTTACCTGGC GATTCTGCTC GCCACGGTAT TGGTTTTGTT GTGCGCGTTT CCCGCCTTAG CACAATCCAA TACAGCAAAG TTGACCGGAA CAGTCGCAGA CGCGCAGGGT GCAGTCGTCC CCGGCGTCAG CATTACTGTC ACCAGCTCGA AAGGTCGTGC AACTACGGTC CAGAGCAATG CTTCCGGCTT CTACACGATT GCTGCGTTGG ATCCCGATAC CTATAACGTG GATGTGAAGC AGTCCGGATT CAAACCCATC ACGCAAAAGA TCACGCTTCA GACTGCCCAG CAGGCGGAAC TGAACTTCAC CATGCAATTA GGTTCGACCT CAGAGACCGT TGAGGTTACC AGCGATGTTC CGCTCGTCGA CGCCATCAGC TCGAACGTGA GCGCCGTCAT CGTCGGCCGC CAGATTACCG AGCTCCCGCT CAACGGCAAC AACTTCACGC AGCTTGCAAC CCTCGTCCCG GGCGTCACTC GCGGTATGCC GGCCAATCAA CAGTCTGGAG AGGGCAACCA GGCCGAAACA TTCCGCTATG CGGGCAACGG CGGTTCCGCC ATCTCCGTAA ACGGTCTTCG TCAGCAGGCG AACAACTTCC TGCTCGATGG TTTCGACAAC AACGAGAGCC TGGTGAACAC GATTGTGTTC TTCCCGCCGA CCGAGGCGAT CCAGGAATTC CGCGTTGACA CCAGCGTTGC GCCGGCAGAA TTCGGCCGCG CCGGCGGTGG CATCGTCAAC TCGTCTATCA AGTCAGGCAC CAACAGCTGG CACGGCTCTG CATTCGACTA TCTTCGCAAC AGCGTCCTGG ATGCGAAGAC CTACGCATTG GGTCCGAATG ACCCGAAGCC CCCCTTCCGC CGCAACCAGT TTGGCGGCGC AATTGGCGGC GCGATCATCA AGAACAAGCT TTTCATTTTC GGTGATTATC AGGGCCTTCG GCAGGCGCAG CCGGCCAGTC TCGACTTCGC GACTGTCCCG ACCGACATGA TGCGCAGTGG CGATTTCTCC GAATTGTTGG GACTGCAACG CGTGGACGGA ACGGCGGTAG CGCCCATCCA AATTCGCCAT GCGGGCGCCA ACAGTTCCGC CGATTACGTG AACTACGTTG GGAATATCAT CCCCAGCGCT GACTTCGTTG GCGCCGGCCA AGCTTATTTG AACGCCTATC CGGAGCCGAA TGTCTCCGCC ACGAACACAC ACTGCGGTCT GGTCTCGACC CTCGATGGCC TCTGCCTGCA GCAGAACTAC ATGGTCCAGC GGCAGCAGAT CCAGAACTTC GATGATTTCG ACATCCGTGC CGACTATGTC CTTCGCACCC AGGACACGAT GTTCCTGCGC TACAGCTATG GGCACGAAAC CGACGTCACG ACGTCGCGCC TGCCTGCACT GCCCGCCGGA TTTGGTTCCG GCGATCAGTT CAGTATGCCT CGCTCCTGGG CATTCGGTGA AACCCACACT TTCTCCCCAA ACATCCTGAA CGAGTTCCGC TTCGGATGGA TTCGCGCGAA CCTCGGCTAT CTGCCGCCGT TCGACGACCA GGCCTTGTCG GCAAATCTCG GAATTCCGAA CGCCAACACT TCGTCTCTGC TTGGAGGCGG CGCCCTTATC GGTGGTTACA ACGGTCAGCT GGAATACACC GGGGACTACG GTCCGTACCA GGTTCCGGAA AACACTTATC AACTCGCCGA CAATGTCAGC TGGATCAAGG GTCATCACAG CTTCAAGTTC GGCGCGAACC TAATGTGGCG ACAAGTCAAC CTCTTCCGTC CCAAGGCGGG CAAGGGTTAC TTCTTCATCT CCGGTAACGG TGGCGATGCC TTCTCGACCG GATACGAAGT CTCCGACGTT CTCGCCGGTT TCATGAACAA CTACCAGGTT GGTCCGGCAC TTGGATTCGC TGGGACCCGC AGCTGGGAAA ACGGCATTTT CGCTCAGGAC GATTGGCGCA TTACCAATCG GCTGACGTTG AACCTCGGCT TCCGCTGGGA TGTGCTCACA TGGCCCACCG AAGTCCACAA CCAGATGTCG AACTTCGACC TGAACCCCGG CAGCGGTACC TATGGACAGG TGGTTCTGCC TGGAACCCAC GGCTACAACG ATTCGTTCGT TCCGACCGAT TGGCATAACT TCGCTCCTCG CGTCGGTTTT GCCTACAACC TGTTTGGCAA CGGTAAGTCC GTGGTGCGCG GCGGTTATGG AATGTTCTAT TTCGTAGACC GTGGCGGCAT CGACAACCAG ATGGCGCAGA ACGCTCCGTT TGCGGGCGTC TCGCAATACA ACTACACCAA CGGCTATCGC TTCACGCTCG GCGGCCAGGC TCCACTGAAC TCAGTGGATC CGACGCAAGG CGGCGCGGTC GGCATGCCGG ACAAGAACGA TTTCAACATC GACTTCACCA ATCCGGCGAA CCTCTCGATG GTGTCGTACC TTAACCGCAA CGTACCGGCC AGTGTGCAGG AGTGGAACCT CCAGTTCGAA CAGGAACTTG GCGTCAACAC CTCCTTCATG CTCGCTTACG TGGGCAACAA GGGCACGCAT CTTACCTCGT ACCACGACTT CAACCGCCAG TTCTACAACC AGGCCAATGG CGACAAGAAC TTCCCAGCCA TGGGCAGCCT CACCGTTCAC GATACTAGCG GCAACTCAAA CTACAACTCG TTGCAGGCGC AGTTGACTCG CCGCATGACC AAGGGCCTGC AGTTCAACGC TTCGTACACC TGGGCCCACG CAATCGACGA TTCCCAGGGC GCATTCGACG CCAACAATGG CGTTGTCGAT TACTTCAACC TGGCGCATGA GCGCGCGAAC TCGTTGCTCG ACTTCCGTCA ACGCTTCGTG TTCAACGCTC TGTACGAGCT GCCTTTCGGC CACGGACGGC AGTGGGGCAA TAGCTGGAAC GGCGTCACCA ACGCGGTCCT TGGTGGATGG CAGTTCAACC CGATCCTGAC GCTCAGCAGC GGATCACCGT TCGACCTCGG TGGAATCGGT AACCCGCAGA CGCGTCCTGA CCTGGTCGGC CAGCTGCATC AGTTGAACAG CGTGAACGGC ATGTGGTTCG ACACCAGCGC GTTCGCGGTT CCCGCCTCGA ACGGCACCGG CGTCTTCCTC GCGCCTGGCA CCGCTCCTCG CAACCCGTTC ACCGGACCGG GAACCGAGAT CTTCGACTTC TCTGTCCAGA AGACGTTCGC GATCACTGAA CGCGTCCGGA CCGAGTTCAC GTCGCAGTTC TTCAACGCCT TCAACACCCC GCAATTCGCA CAGCCGGATG GCAACTTCTA CGATGGCAAC TTCGGTCGTG TGACCAACAC TCGCTTGAAC AGCGAACGCC AGGTCCAGTT CGGACTTCGC GTTCTCTTCT AA
|
Protein sequence | MKKSYLAILL ATVLVLLCAF PALAQSNTAK LTGTVADAQG AVVPGVSITV TSSKGRATTV QSNASGFYTI AALDPDTYNV DVKQSGFKPI TQKITLQTAQ QAELNFTMQL GSTSETVEVT SDVPLVDAIS SNVSAVIVGR QITELPLNGN NFTQLATLVP GVTRGMPANQ QSGEGNQAET FRYAGNGGSA ISVNGLRQQA NNFLLDGFDN NESLVNTIVF FPPTEAIQEF RVDTSVAPAE FGRAGGGIVN SSIKSGTNSW HGSAFDYLRN SVLDAKTYAL GPNDPKPPFR RNQFGGAIGG AIIKNKLFIF GDYQGLRQAQ PASLDFATVP TDMMRSGDFS ELLGLQRVDG TAVAPIQIRH AGANSSADYV NYVGNIIPSA DFVGAGQAYL NAYPEPNVSA TNTHCGLVST LDGLCLQQNY MVQRQQIQNF DDFDIRADYV LRTQDTMFLR YSYGHETDVT TSRLPALPAG FGSGDQFSMP RSWAFGETHT FSPNILNEFR FGWIRANLGY LPPFDDQALS ANLGIPNANT SSLLGGGALI GGYNGQLEYT GDYGPYQVPE NTYQLADNVS WIKGHHSFKF GANLMWRQVN LFRPKAGKGY FFISGNGGDA FSTGYEVSDV LAGFMNNYQV GPALGFAGTR SWENGIFAQD DWRITNRLTL NLGFRWDVLT WPTEVHNQMS NFDLNPGSGT YGQVVLPGTH GYNDSFVPTD WHNFAPRVGF AYNLFGNGKS VVRGGYGMFY FVDRGGIDNQ MAQNAPFAGV SQYNYTNGYR FTLGGQAPLN SVDPTQGGAV GMPDKNDFNI DFTNPANLSM VSYLNRNVPA SVQEWNLQFE QELGVNTSFM LAYVGNKGTH LTSYHDFNRQ FYNQANGDKN FPAMGSLTVH DTSGNSNYNS LQAQLTRRMT KGLQFNASYT WAHAIDDSQG AFDANNGVVD YFNLAHERAN SLLDFRQRFV FNALYELPFG HGRQWGNSWN GVTNAVLGGW QFNPILTLSS GSPFDLGGIG NPQTRPDLVG QLHQLNSVNG MWFDTSAFAV PASNGTGVFL APGTAPRNPF TGPGTEIFDF SVQKTFAITE RVRTEFTSQF FNAFNTPQFA QPDGNFYDGN FGRVTNTRLN SERQVQFGLR VLF
|
| |