Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3895 |
Symbol | |
ID | 4072230 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4607876 |
End bp | 4611325 |
Gene Length | 3450 bp |
Protein Length | 1149 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637985919 |
Product | TonB-dependent receptor |
Protein accession | YP_592969 |
Protein GI | 94970921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0561409 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGGAT CAAGAGCAAG GTACACACTT CTGTGCGCGC TGCTATTTCT GTGTTCCGCC ATGTTGTTTG GTCAGGCGGA GACAGGTTTG ATCACAGGCA CCGTCGTGGA TGTTTCCGGC GCAGTTGTCG GCGGAGCGAC GGTGACAGTG ACGGACGTGA ATACGGGCGC GCAGCGAACC GCTACCACCA ATAACGATGG TTCCTACACG GTTTCCAACC TCAAACCGTC GATGTACGAA GTCGTGATCG ACAAGCAAGG CTTCACCAAA TACACCCGCA GGATCGCGGT GACCGTCGGA TCAAGAAATG AACTTTCGGC GCAGATGAGT GTGATGGGCG GCGGTACTAC CGTCGAAGTG ACCGCGGAAT CAGGCGGCGC CGCGGTGAAC ACCGAAACCC AGACGCTTTC ATCGGTCGTG AGCGGCGCGC AGATCACCGA ACTCCCAACT CTCACCCGCA ATCCATACGA CCTCGTTGCC ACCGCCGGCA ACGTGACAGA AGACACCACC GGTACCATGC GCGGTGCAGG CTTCTCGATC AACGGTCAGC GCTCAGCATC AACCGATGTA CTGTTAGACG GTGGTGAAAA TGTCGATATG TTCACCGCTT CCGTCGGGCA GCAGGTTCCG CTCGATTCCG TGCAGGAATT TCGCGTTGTC ACCAGCAACT TCACCGCAGA ATATGGCCGC GCGGGCGGCG GCGTCCTGAA CGTCGCTACC AAATCTGGCG CCAATGCCTT CCATGGCACC GCGTATGAGT TCAACCGCAT ATCTGCGCTG GCGGCGAACA CCTGGGAGAA CGACACCAAC GATATCCCCA AGTCCACGTT CACGCGCAAT CAGTTCGGAT ATTCGGTGGG TGGGCCAATC ATCAAGAACA AACTGTTCTT CTTCTCCAAC ACCGAATGGA TCCGGGTCCG CAGCAGTTCG AACCAGATCG TCTCGATCAT TGATCCTGCG ATGTTCCCGA ACCTGGCTCC GAACTCGGTA GCTGCGCTGT CGTATGCTGA CGTGCGCTCG AACGCGACTT TGCTCGGCTC TACCTCATGC GCAGCCGATG CTCTGTGCTC CCCTCTGCTC GCGAGCAATG GCGGACCGTT GCCCAATGGC TCGCCGTTCA CCCAACAGTT GTCGTACACG GCGCCCGCAG AGGCAGGTGG CGGCCTTCCG GAAAACACCT GGATGACCGT CAACCGCTTC GACTACAACA TGACCGACAA AACCACCTTC TTCGGCCGCT ACGCTGGATA CCACGAAGAG GATTTCAACG GGACCGTCAA CAGCAGCCCG TACTCCGAGG GATTCGATAC CGGCCAGAAC ATCTTCAACA ACAACGTGCT TATCAACATG ACGCACGTGT TCACTCCCAA CATCGTGAGC CAGTCGAAGT TTGATTTCAA CCGACTGAAC TTGCTGCAAC CGCTCGGAAC CCAGCCCGTG GGGCCAACGA TGTACGTCTC CTCGCAAGGC GTGCCCACCT CCGGCGGATA CTCGCTGATT TTCCCCGGAT ATAGCGAATT CACGCCAGGT AACTCAATTC CATTCGGCGG CCCGCAGAAC CTCTATCAAT TCTTCCAGGA TGTCTCATGG ACGAAAGGTC GTCACCAGCT GCGTTTTGGC GGACAGTACA TTCACATTCG CGATAACCGC ACCTTCGGCG CTTATGAAAA CGCTGTGCAG TATCTCAGCA CGGGCGCACC CGTGACCGCG GGTGGAAATA CTTATCGCGG CAACACCGCC GGCATTTACA ACCTAGTCGC CGGCAACATC GCGAACATGC AAGTCGCGGT TGACCCTCGC GGAGCGTTCC CCGGAGACAA CATCTCTCTT CCGGCAGGCG CACCGAGCTT CTCGCGTAAC AATCGCTTCA ACGACGGCGC GTTCTACCTC CAGGATTCCT GGAAAGTAAC CAGCCGCTTG ACGCTCAACT ACGGCGTGCG CTGGGAGTAC TACGGTGTGC AGCACAATGC CGATCCCTCG CTCGATTCCA ACTTCTACGA AGGGTCCGGT GCGACGCTGC CGATCCAGGT TGAGAACGGG ACCGTGCAGA TTGCCAATCA GAGCCCGGTC GGCTCTCTCT GGGAGCCTTC CAAACACAAC TGGGGTCCGC GCCTCGGCTT CGCCTGGGAC GTCTTCGGTG ATGGCAAAAC GGCGATACGC GGTGGTTGGG GCATGAGCTA CGAGCGGAAC TTTGGCAACG TGACCTTCAA TGTCATTCAG AACCCACCGA ATTACGCAGT TCTGAATGCA GTGAACACGC CCGTGACGCT CGACAACTTC GGGCCTCTCT CCGGCAGCAG CGGCAGCGTA GTTCTTCCTC CAACCACTCT GCGTGCCGTG CAGCCCAACA TTGACAACGC GTACACCGAG TTCCGCAGCC TATCGCTGGA ACGCGAGGTA CTAAAAAATA GCCTGGTTGC CTTTGAATAC AGCGGCTCGA ACGGCGTTCA CCTGTATGAC ATCGGCAACA CGAACGTGTT TTTCCCGGGG TATGCTGGCT ACGGTGATTA CTTCGATCCT GCTACCTATC ACTCTGGAGT TGCGTGTTAT CCCGGATGCC GCCTGAACCA GCAGTATTCG AACATCAACA GCCGTGGCAG CCGCGGATTC TCGCGCTACA ACGGCCTGAA CACACGCTTC ACCACCAACA ATCTCTTCAA CAAAGGTCTG CAGCTCAACT TCAACTGGAC GTGGTCACAC TCGATTGACA ACTTGAGCTC AACCTTTAGC GAAGGCAACA ACGGCGCGTT CCAACTCGGT TACGAAAACT ACTATGCTCC GCAACTCGAC ACCGGCAATT CCGAGTTCGA CGTCCGTCAC CGCATCGCGG TTAGCGCAGT CTGGGACCTG CCCTGGATGA AGAACGCGAG CAATGCGTTC GTTCGCCAGG CGCTCGGTGG GTGGAGCTTT TCTCCCCTGA TCACCTACCA TACCGGCTAC CCGTTTTCGG TCTATGACTG CACCAACGGA ATCAGCCAGT GTCCGCGCTA CTTGCCGACG GGTGGTGAAC GCGACGGCTT CGCCAATTCA TCAACCTACG CTGGTGGAGG CGTCTTCAAT TACCTGAATG CCGGCTCGCT CGTCGCAGCT CCTGGCTTCG GAATGCCGGG TGTCGGCGGT TCGAGCCAGG TTCCGGAAGC GCCTTGCCAG GGAGCGATCG GATGCAACTG GGCCGTCGGT CCGCGCAACA TGTATACCGG CCCCGGCAAT CACCAGTTCA ACGCGGTTAT CGGGAAGACG TTCAAACTCA CCGAACGGTT CAACTTGCAG TTCCGCGGCG AGATGTACAA CGTCTTCAAC AACCACAACT ACTTCCTGCT CACGTCGAAC GCCGACGTCA GCAGCGGTGC GCTGGGAAGC CCGTTCTTCG TACAAGCTGT TAAGGGCGGC TTCGGCAATC CGACGGACGA ACGTCGTAAT GTTCAGTTCG GCTTGAAGTT GATTTTCTAA
|
Protein sequence | MIGSRARYTL LCALLFLCSA MLFGQAETGL ITGTVVDVSG AVVGGATVTV TDVNTGAQRT ATTNNDGSYT VSNLKPSMYE VVIDKQGFTK YTRRIAVTVG SRNELSAQMS VMGGGTTVEV TAESGGAAVN TETQTLSSVV SGAQITELPT LTRNPYDLVA TAGNVTEDTT GTMRGAGFSI NGQRSASTDV LLDGGENVDM FTASVGQQVP LDSVQEFRVV TSNFTAEYGR AGGGVLNVAT KSGANAFHGT AYEFNRISAL AANTWENDTN DIPKSTFTRN QFGYSVGGPI IKNKLFFFSN TEWIRVRSSS NQIVSIIDPA MFPNLAPNSV AALSYADVRS NATLLGSTSC AADALCSPLL ASNGGPLPNG SPFTQQLSYT APAEAGGGLP ENTWMTVNRF DYNMTDKTTF FGRYAGYHEE DFNGTVNSSP YSEGFDTGQN IFNNNVLINM THVFTPNIVS QSKFDFNRLN LLQPLGTQPV GPTMYVSSQG VPTSGGYSLI FPGYSEFTPG NSIPFGGPQN LYQFFQDVSW TKGRHQLRFG GQYIHIRDNR TFGAYENAVQ YLSTGAPVTA GGNTYRGNTA GIYNLVAGNI ANMQVAVDPR GAFPGDNISL PAGAPSFSRN NRFNDGAFYL QDSWKVTSRL TLNYGVRWEY YGVQHNADPS LDSNFYEGSG ATLPIQVENG TVQIANQSPV GSLWEPSKHN WGPRLGFAWD VFGDGKTAIR GGWGMSYERN FGNVTFNVIQ NPPNYAVLNA VNTPVTLDNF GPLSGSSGSV VLPPTTLRAV QPNIDNAYTE FRSLSLEREV LKNSLVAFEY SGSNGVHLYD IGNTNVFFPG YAGYGDYFDP ATYHSGVACY PGCRLNQQYS NINSRGSRGF SRYNGLNTRF TTNNLFNKGL QLNFNWTWSH SIDNLSSTFS EGNNGAFQLG YENYYAPQLD TGNSEFDVRH RIAVSAVWDL PWMKNASNAF VRQALGGWSF SPLITYHTGY PFSVYDCTNG ISQCPRYLPT GGERDGFANS STYAGGGVFN YLNAGSLVAA PGFGMPGVGG SSQVPEAPCQ GAIGCNWAVG PRNMYTGPGN HQFNAVIGKT FKLTERFNLQ FRGEMYNVFN NHNYFLLTSN ADVSSGALGS PFFVQAVKGG FGNPTDERRN VQFGLKLIF
|
| |