Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4189 |
Symbol | |
ID | 4072148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4955811 |
End bp | 4959128 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986220 |
Product | TonB-dependent receptor |
Protein accession | YP_593263 |
Protein GI | 94971215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.316829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.196574 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCAAA AAATCGGAAC GCGAAGTGTG GCAATCGTTA GCCTGATGTT GCTCATACTC ACCTTCGCCG TTACCGGATT GGGCCAAACT AGCAAGGGCA TCATAGGTGG CACAGTTACC GATAACAGCG GAGCAGTGGT CGTAGGAGCA CAAATTACAG CAACCAATCT CGACACCGGC GATTCTCGCA CGGTGCAATC CGGACCGACC GGAGCCTTCC GTATGGAGGC GTTGAATCTC GGGAAATACA AAGTGACCGT CGGGTACCAG GGCTTCCAAA CGCAGACCGT GACGGGTCTG GAAGTGGCTG GCTCGGTCAT GACCCCGCTC GATATCAGAC TACAGATTGC GAGCGGCACC GCGACGGAAA TTACAGTTTC CGCGGATACC AACCAGGTTC AGACGGAAAA CGGCGAACTT TCCGGCAGCA TCGGTTCAAA AGAACTTTCG GATTTGCCGA TCAGCAGTCT CAACCCAATC CAACTGGCGT TGACGCAGCC TGGAGTAATC GACAACAACG GCCGTGGCAG CACGAACGGC CAGGGTTTCT CCGTCGCCGG CGGGCGTCCG CAGAGCAACA ACTTCCTGAT CGATGGTCAA GACAACAACG ACAACAGCAT CCAAGGTCAG GCGTTCCAGC CGCAGAATCC CAATGCGGTC CAGGAAGTCG CGATCATGAC GAACTCTTAC TCGGCAGAAT TCGGCCGCGG TGGATCATCG GTCACCAACG TTATCTTCAA AAGCGGCACG AACCAGTACC ATGGCACGCT GAGCGAACTC TATTCCGGTT CCGGATTAGA TGCGATCGAT GCGGCCAACG GCCTTGCCGG CATGAAAGAT GGCGGTGATT GCAACCGGGC CCCGAATTAC GCTCCCTGTA AAGGCCGCTA CGATACGCAC ACCTTCGGAT TTACCGTGGG TGGCCCGATC GTTAAGGACA AGCTGTTTGC GTTCGGCAGC GGATTGTGGA ACCGCACTTA CGGCAATGAA GTCTCGAGCA CCTTCACCAT TCCCACGGCG AACGGAGCGG CGCAGCTATC AAGTTATGGT TCGACCAATG CGAACCTGAT GCTGCAGTAT CTCGGTAACA TTCGCGGGGG CTCAAACATT CAGAACGTGG CCACGGGCAT TGCGAGCATG CCGTTCGTCG AAATGGGCGA CGCGACCCGC ATTGTTCCGG AGCAGAGCAC GGACACGCAG TGGAATGTGA AAGTGGATTA CCTGCCCCAC CAGAGCGACA GCATTACCTT CCACTACCTG CATGATCGTG GCTATTTCTC GCCCGATTGG TTTGCAAACT CGAGTTCGGT TCTGCCGAAC TTTGAAACCT ACCAGGGAGG ACCGTCCTGG ATCTCGGGTG GCTCCTGGAC GCACACATTC AGTTCCAACA AGGTGAATGA GTTCCGTGTC TCTTACGGCC ATCTCGGATT TACTTTCGCT CCAACAGCGG GAACGACTGC AAATCCTCTT TATCCATTGC CTTATCTCTC TCTGAGCAGC CCAAGCACGT TCCCGCTTCT CGGCACGGAT TCCGGTTTCC CGCAAGGGCG TAGCCATCGT ACTTTGCAAC TTCAGGAGGC GTTCTCGATC ACGAAGGGAG CGCACACCGT CAAGATGGGT GTCGACATCG CCCATATCTC GGTGACCGAC GACATTCCGA TCAACTCTCG CGGATGGATC ACCTTCTCGG CGGGTGGCGG CTACAGCGCT CTCGGCAATT TCCTCGACAA CTACACAGGG CGCAGCGGAC AGGCCCTCGA CATCCAGATT GGCAATCCGC GCGTGGAACC GACTCTCCTG CAGAGCGGAT ACTATGTGCA AGACAATTGG AAGATCAAGT CGAACCTGAC CTTGAACCTC GGTCTTCGCT ACGAATATCA GACAAATCCC GAGAATTCGC TGACGTACCC AGCCGTGAAG CCGATCATGG GCGGCGAAGT CGCGTTCCCC ACCGTGGTGA AGGCAGACCA GCAGTACATG CACTTCGCGC CACGTATCGG GTTTGCCTAC ACGCCTGATT TCTGGCCGAG CCTTTTCGGT GACGGCAAAA CAGTCATTCG CGGCGGTTAT GGAATTTTCT ACGACGCGCT GTACACCAAC ATTCTCGACA ATACGGCGTC CTCTTCACCA AACTCGATCG ATGTGCCCTT GTACGGCCGT AATGGCGGCG CGCGCGGTTA TGCAAGCGCG ACCGACCTCT TCAACTCGCT TGATCCGGTC GTCAGCCCTT TCAACACTGT CACCAGCGTT TCGCAACGCA TGACCAATCC GCGTACCACG CAGTGGAACC TGGACGTCCA GCGCGAACTG CCGTGGAACC TGCTCGCGAC CGTCGCATAT ATCGGCAGTC GCGGTCAGAA GCTGCTGGTA AACGACGACT ACAACCCCTT CGGCGGATAT GACGCCACGA CGGGCGCTTA CATTCCCCGC TATAACTCGG ATCGCGGAGC CATGGCAATC CGCACCAACG GCGGCGACTC GTACTATCAC GGCCTCGCCT TCACGGTGGA GCGCAAGTTC AACAAGGGCC TCATGCTGCG CAGCGCGTAT ACCTTCTCGA AGTCGATCGA CGACAGTTCG AACATTTTCG TGATCACGGG TGGATCGTCG TACGCGCAAA ACGTGTGGGA CCGCCAAGCC GATCGTGGCT TGTCTGCTTT CAATGCATTC CAGCGCTGGG CGTTTACTTA CGTTTGGGAC GTTCCGGGCT TTAAGTCCGA AAACAAGGCC CTCGACGTGC TTGGATACAT TTCACGTCAC TGGCAGTGGA CCGGGACCAC CAGTCTGCAG TCTGGTTTGC CCGACACGAT CTATGTCGGC TCACTCGACA GCACTGGCGA GGGTCACGGC TACAGCGGAC GTCCGGACGT GCTTAGCAGC AGAGCGCCGA TGACGAACGT GGCGATCTCC GGCCAATACT CTTATTGCTG GCCGGGCGAC GCCCAAACGG CACCCGCGTA CGACTGGGCT ACTTGCGCCC CGATCAGCCA GAGCGACCTG AATGGATACC ACTGGTTTAT TCCATTCGGT CGTCCAGGAA ACGAAGGACG TAACAGCTAC ATACTACCGG GACAGATCAA CTTCAACTTC GGCATCAATC GCAACATTCC GATCCCGAAG CATGAGTCGC AGTTCCTGCA GCTCCGCGTC GAGATGTACA ACCCGTTCAA TCACCCGAAT GAGTCGGCGA ACCCTGGCGG TTTCTGGACG ACGGACGTGA ACACGATCGC GCCCGACAAT CCAAGCCACC TCTTCGATAA GTTCTGGGCA CGCCAGGGAG GTCGCAGTAT TCGCCTCTCG GCCAAGTACC AGTTCTAA
|
Protein sequence | MYQKIGTRSV AIVSLMLLIL TFAVTGLGQT SKGIIGGTVT DNSGAVVVGA QITATNLDTG DSRTVQSGPT GAFRMEALNL GKYKVTVGYQ GFQTQTVTGL EVAGSVMTPL DIRLQIASGT ATEITVSADT NQVQTENGEL SGSIGSKELS DLPISSLNPI QLALTQPGVI DNNGRGSTNG QGFSVAGGRP QSNNFLIDGQ DNNDNSIQGQ AFQPQNPNAV QEVAIMTNSY SAEFGRGGSS VTNVIFKSGT NQYHGTLSEL YSGSGLDAID AANGLAGMKD GGDCNRAPNY APCKGRYDTH TFGFTVGGPI VKDKLFAFGS GLWNRTYGNE VSSTFTIPTA NGAAQLSSYG STNANLMLQY LGNIRGGSNI QNVATGIASM PFVEMGDATR IVPEQSTDTQ WNVKVDYLPH QSDSITFHYL HDRGYFSPDW FANSSSVLPN FETYQGGPSW ISGGSWTHTF SSNKVNEFRV SYGHLGFTFA PTAGTTANPL YPLPYLSLSS PSTFPLLGTD SGFPQGRSHR TLQLQEAFSI TKGAHTVKMG VDIAHISVTD DIPINSRGWI TFSAGGGYSA LGNFLDNYTG RSGQALDIQI GNPRVEPTLL QSGYYVQDNW KIKSNLTLNL GLRYEYQTNP ENSLTYPAVK PIMGGEVAFP TVVKADQQYM HFAPRIGFAY TPDFWPSLFG DGKTVIRGGY GIFYDALYTN ILDNTASSSP NSIDVPLYGR NGGARGYASA TDLFNSLDPV VSPFNTVTSV SQRMTNPRTT QWNLDVQREL PWNLLATVAY IGSRGQKLLV NDDYNPFGGY DATTGAYIPR YNSDRGAMAI RTNGGDSYYH GLAFTVERKF NKGLMLRSAY TFSKSIDDSS NIFVITGGSS YAQNVWDRQA DRGLSAFNAF QRWAFTYVWD VPGFKSENKA LDVLGYISRH WQWTGTTSLQ SGLPDTIYVG SLDSTGEGHG YSGRPDVLSS RAPMTNVAIS GQYSYCWPGD AQTAPAYDWA TCAPISQSDL NGYHWFIPFG RPGNEGRNSY ILPGQINFNF GINRNIPIPK HESQFLQLRV EMYNPFNHPN ESANPGGFWT TDVNTIAPDN PSHLFDKFWA RQGGRSIRLS AKYQF
|
| |