Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1549 |
Symbol | |
ID | 4072940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1894436 |
End bp | 1897633 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637983558 |
Product | TonB-dependent receptor |
Protein accession | YP_590625 |
Protein GI | 94968577 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.114571 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCCTT TGCGAACGCT CTGCGTCGTT CTCCTGCTCT CTTTTTCTGC GTTCGCACAG GTCTCTGCGA ATCTCTCGGG CTTTGTCACC GATCCCAGCG GCGCCGCCGT GGCCGGCGCA GAAGTACGCG CCATCAACAA TGAGACCGGT GCGGTTCGCG TCTCCGCGAC CAATGCCTCA GGACGCTACG ATATTGTCGC GTTGCCCGTT GGACAGTATG AAGTCCACGC GAGGAAGCAG GGCTTCGCCG AACAAGTGCG CACCGGCATC CTTCTCGTAG TCGGGCAGGA TGCTACCGCC GACCTCAAAC TCCGCATCGG CGACGTAAGC GAGCAGGTGA AGGTGAGCGC CGACGCCGAA ATGGTCGGCG TTACCAACCA GGACATCTCT GGCCTGATCG GCGAAAAACA GATCAAAGCG CTTCCCCTGA ACGGCCGCAG CTACGATTTG CTCGTTACGC TCAATCCCGG CATCGTTAAC TTCACGTGGG AGAAAACGGG CGGCACCGGC GTTTCCAATT CCACGACCGG CAACAACTTT TCAGTCTCGG GGAATCGTCC GCAGCAAAAT CTGTTCCTGA TGAATGGCGT CGAGTTTTCG GGCGCGGCGG AAAACAACAT GCAGCCCGGC GGCCCGAGCG GCAACGTGAT CGGTGTGGAA GCCGTCCGCG AATTCAACGT TCTTCGCGAC AACTACGGCG CGCAATACGG CAAGCGTCCC GGCGGACAGG TCGTCATCGT CACGCAGTCC GGCTCAAATC AGTGGCACGG CTCCGCCTAT GAATACCTGC GCAATAACGT CTTCGACGCC CCCAACTACT TTGATCAGGG CGACGCTCCA CCGTTCCAGC GCAATCAGTT CGGCGCGTCG TTAGGCGGTC CGCTCCGCCA TGACCACACA TTTTTCTTCC TGAACTTTGA AGGCGTAATC CAGAACTTGC ATCAGACCTC CGCCGCGTTC GTACCCGGCC TCGACTCCCG CGCCGCCGCC GTTCCCAGCG TGCAGCCGTT GCTCAATCTC TGGCCTACGC CATCGGCGAG CGCTCCGGAA TTCAACGGCA TCGCGCAGGT CTTCAGTAGT CCTCTTCAAA CCATCCGCGA ATATTTCGGC AACGCGCGCG TAGACCACAC CTTCTCTTCG CGTGATTCGC TCGCAGCGTC GTACGTCATC GACGACGGCC ACGATCTCAC CGCCACGGTT GCCAATCCTT TTAGCTCCGA CATCCTCACG CTACGCGAGC AAGTGCTCAG CCTCAACGAG ACGCACGTCT TTTCGCCGTC GCTCTTGAAC GTCGCGCGTG TCGGCTTCAC GCGCGCCGGA TACTACTTCG CCGGTGAACC CACACCCGGA ACCCCCGCCG CCGATGTCCC CGGCTTCCTC CTCGGACATC AGGTCGGCGC GGTTGTCGTC GGCGGCAGCG CCGCATCCAA TCCGCAGGCG CAGGTCGGAT TAGCCGGCAG CAACAACGGC AGCAATCTCA GCATTGCGCG CAACATCTTC ACGTACGAAG ACCAGCTCAG CCTCACCCGT GGCCGCCACC AGATCACCGT CGGCGCATGG TTCCAGCAGT TCCAGTCGAA CGAGAACATC GCGCTCAGCC AGTACGGTCA GGCCACCTTC GCGAGCCTGA CCACGTTCCT GCAAGGCACC ATCGGCACCT TCTTGTACGA TCCCGCGCCG AGCAACATGA ACTGGCGTTC GCTCTTCGGC GCCGGGTACG CCGAAGATGT CTTCCGCGTC TCGCCGAAGT TCACGCTCAC CCTCGGCTTC CGCGGCGAGT TCTCCACCGG ATGGAACGAA GCTCACGGTC GCGCCGCGAA CTACACTTAC AACAACGGCA TCATCTCCGA TCAGCCGCGC ATTGGCGATA GCGCCTTCAC CGAAAACAAT TACAAGTTCC TGGCACAACC GCGCGTCGGC GTCGCGTACA GCCCGTTTAG CGGCACGGTG TTCCGCGCCG CGTTCGGCAT CTACAACGAA CTCCAGGACG CCCTCGGCTA CCGCATGGAC CAGAACGCTC CGTTCAACCC GGTGTACAGC CTCGCGAACT ATAAAGTCTC GAACCTGCCG CTCGATCCCA CCGCCCCTGT GCCTGCCGCA GCGTTGCTAA TTCCCGGTGG CACCCAGCCC GATCTCAAGG CTCCCACGCT GTTCTCCTGG ACCTTCGGCA TCGAGCAGGA ACTCTCGCAC AATACGTCGT TGAACCTGCG CTACGTGGGC TCGCACGGAT ACCACGAACT CGTCGGCGTT GACGCCAACG TGCCCTCGAA TCCGATCATC TGCCCCGCCG ATCCATGCCC CGCGGTCTAT CCGGCGACTT TCCCGGTAGG CCTCGCGGGA ACGCCGATTC CCGCCGGAAG CTACTACATC CCGAAAGGCA CGCCGAAAGG GAATCCCACG ATCAACAACA CGTGGACGTG GTTCTCTGTC GGTACGAGCA GCTACAACGC GCTGCAACTC GACGTGAACC ATCGCTACAG CAACGGCCTC TCTCTGCGCG GCGTGTACAC ATGGTCGAAG ACGCTCGACG ACGGTGACTC GCTCAACCAG ACCACCGCCA ACAACGCACC GGGGCTGGTG TCGAATCCGT ATGACATCAA GGCCGACTGG GGACCTGCGA CCTATGACGT GCGCAATCTC GGCGTAATCA GCGCCGTTTA CGAATTGCCA TTCGGACGCG GCAAGCGCTT CCTCGCGTCA TCGAATTCGT TTGCCAACTG GACGGTCAGC GGATGGTCTG TGAACAGCAT CGCCGTAATG CAGAGCGGCT TCCCATTCAC GCCGCAACTC AGCTACAACC CATCGAACAC CGGCGACACG CGCAATCCCG TGCGGCCCTT TGCGAACCCG AACTTCACCG GACAGATCGT CACCGGCAAT CCAATCCAGT GGTTCAATCC CGCCGCATTT CTGGCACCTC CGGCGAACAG CGGCTTCTGG GGAAATCTCG GACGCAACAC GCTTACCGGG CCGGGGCTCG GTACGTGGGA CTTCTCCGCG ATCAAAGATT CGAAAATCAA CGAGCGGATG AGCCTGCAAT TTCGCGCCGA AATCTTCAAC CTGCTCAACC GCGCGAATTT CAATACGCCG AACTTGATCG TCTTTACGCC GACGGGCGTA TCCGGAACCG CAGGCGCAAT CAGCAGCACG TCCACCACCT CGCGCCAGGT ACAGTTCGCC CTCAAACTGC TTTTCTAG
|
Protein sequence | MSPLRTLCVV LLLSFSAFAQ VSANLSGFVT DPSGAAVAGA EVRAINNETG AVRVSATNAS GRYDIVALPV GQYEVHARKQ GFAEQVRTGI LLVVGQDATA DLKLRIGDVS EQVKVSADAE MVGVTNQDIS GLIGEKQIKA LPLNGRSYDL LVTLNPGIVN FTWEKTGGTG VSNSTTGNNF SVSGNRPQQN LFLMNGVEFS GAAENNMQPG GPSGNVIGVE AVREFNVLRD NYGAQYGKRP GGQVVIVTQS GSNQWHGSAY EYLRNNVFDA PNYFDQGDAP PFQRNQFGAS LGGPLRHDHT FFFLNFEGVI QNLHQTSAAF VPGLDSRAAA VPSVQPLLNL WPTPSASAPE FNGIAQVFSS PLQTIREYFG NARVDHTFSS RDSLAASYVI DDGHDLTATV ANPFSSDILT LREQVLSLNE THVFSPSLLN VARVGFTRAG YYFAGEPTPG TPAADVPGFL LGHQVGAVVV GGSAASNPQA QVGLAGSNNG SNLSIARNIF TYEDQLSLTR GRHQITVGAW FQQFQSNENI ALSQYGQATF ASLTTFLQGT IGTFLYDPAP SNMNWRSLFG AGYAEDVFRV SPKFTLTLGF RGEFSTGWNE AHGRAANYTY NNGIISDQPR IGDSAFTENN YKFLAQPRVG VAYSPFSGTV FRAAFGIYNE LQDALGYRMD QNAPFNPVYS LANYKVSNLP LDPTAPVPAA ALLIPGGTQP DLKAPTLFSW TFGIEQELSH NTSLNLRYVG SHGYHELVGV DANVPSNPII CPADPCPAVY PATFPVGLAG TPIPAGSYYI PKGTPKGNPT INNTWTWFSV GTSSYNALQL DVNHRYSNGL SLRGVYTWSK TLDDGDSLNQ TTANNAPGLV SNPYDIKADW GPATYDVRNL GVISAVYELP FGRGKRFLAS SNSFANWTVS GWSVNSIAVM QSGFPFTPQL SYNPSNTGDT RNPVRPFANP NFTGQIVTGN PIQWFNPAAF LAPPANSGFW GNLGRNTLTG PGLGTWDFSA IKDSKINERM SLQFRAEIFN LLNRANFNTP NLIVFTPTGV SGTAGAISST STTSRQVQFA LKLLF
|
| |