Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0783 |
Symbol | |
ID | 4068564 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 967343 |
End bp | 970468 |
Gene Length | 3126 bp |
Protein Length | 1041 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982790 |
Product | TonB-dependent receptor |
Protein accession | YP_589862 |
Protein GI | 94967814 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1629] Outer membrane receptor proteins, mostly Fe transport |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.149411 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.179647 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAGAT ATATCCGTTA TGTGTTGAGC TTAGCGCTCC TGCTATGCGC GACACATATG TTATTCGCCC AGGCGACCGC TAGCGCGAAT GTCGCCGGCA CCGTGTATGA CAAGACGTCC GCGGTCATCG CCGGCGCACA AGTCACCATC ACGAGCAAAG CGACCGGACA AACGCGTACT TCCAACACCG ACAGCAACGG CGCGTACCGC TTCGACCTGT TGTCTGCCGG TCAATACAGC ATCAAGATCA CCAAGTCCGG ATTTGCAAGC CTTTCCGAAA ACGTAGAGCT TCTGGTCGGA CAAACTTCGA CCATCAACGG TGTCTTGAAT CCAGGCTCCG CCAGCGAAGT GGTGGAAGTG ACCGAAGCCG CGCCGCTCGT GGATGTCGCC AAGACCTCCG TAAGCACGCA GATCACGCCC AGCGAAGTGC AGGAAATGCC GCTTGTCGGA CGTGACGTTG CCAACCTCGC CTACCTCGCT CCGGGCGTGA AAGCCGCGGA CTCCTACGAT CCCACCAAGA ACCGTTACGC GATTCTCTCC GTGAACGGCG CCGATGGCCG TAACGTCAAC GTGACTGTCA ACGGCGTGGA TAACAAGGAC AACACGGTTG GCGGACCGGT CATGCAGCTT CCGCTCGAAG CCGTGCAGGA ATTCCAGATC AGCACCCAGC GTTTCTCGGC TGAAAACGGC CGTTCGCAGG GCGCCGCCAT CAACATGATC ACCAAGTCGG GTACCAACAT GTACCACGGC TCCGCCTTCG GCTTCTTCCG TACCTCGGCC CTCGACGCCG ACGAAATGGT TCCGGATGGC ACCGGTGGCG CCTTCCACTC ACACCCCGAC TACAGCCGTC AGCAGTTCGG CGGCTCCTTC GGCGGCCCGA TTATCAAGGA CAAGCTCTTC GGCTTCTTCG CACTCGAACA TGAGCGTGAA CGCCAGGGCC TCAGCGAGAG CGGCACTTCG TTCGACGAAC TCTCCCTCGC GGCGGGTGCT GGACTCGCAG CTCAGCCGTC AGCAGTCATT CCGCGTCCGT TCAACGAAAC TCGCTATAGC GGCCGTCTCG ACTGGAACGT CAACAGCAAG AACTCCGCGT ACCTTTCCTA CAACTCGCAG GTGAACGACA GCCTGAACGA TCAGTCGGAC GGTACCGGCG ACCTGACCAA CGGTAACTTC ACCAAGAACC ATCTGCAGTT GGCCAACTTG ACTTGGAATA CCTTGCTAAC GTCGCACCTG ATCAACCAGT TCACGTTCGG GTGGCAGTAC TGGAACAACC TGATCGACAG CGACATCAGC GCGCCGCTGG TCACGTTCCC GAACGCTTCG TTCGGCACCA ACACCAACGT TCCGCAGCAG TCGTTCCAGC GCAAGTTCCA GTTTAAGGAC GACATCAGCT GGACCCACGG AAAGCACACC TTCAAAGGTG GCGTGGACTA CATCTGGAAT CCAGTGGAGG GCGGCTTCTT CGAGTACAGC TCAACTCTCG AAATCGATTT CGGCGCCGAC CCGAGCTGCA TTTTGGCCGC CGCTACCGAC GACGTCAACA AGTGCGGTCC GGGATATTAT CCGCAACAGT TCGCCACCAA GGGCGCTGTC ACCGGCATGA CCATCGCCAA CGGCGATCCG CAGTTCATCG TGCCGACCAA GCAGCTCGGC TTCTACTTCC AGGATGATTG GAAGGTCACG CCGCGATTGA ACTTGAACCT CGGCATCCGT TGGGACAAGG ACTTCAACAC CTACGGTCAG TCCGACATCA CGAACAGCCG TACCTACCAG GAATTGGTGG CGATCAACAG CCCGATCACC AATCCGTATG TCGCGAGCCT CCCGCACGCC AGCAACAAGG ACTTCAGCCC GCGTGTTGGC TTCGCCTATG ACCTCACCGG TTCCGGCACG CACGTCCTGC GCGGCGGCTT CGGTCTCTAC TACGGCAACT CCTTCCAGAA CATTCCGTTG TTCATGGAAC AGCAGGCCAA CTCGACGATC TTCCAGACCC TGTTCAGTTT GAGCGATCCG GTGAACGATG TCGTTCCAGG CACCGGAATT CCTCTCGGAC AATGGCAGTA TGGAATCAGC CCGATGCCCA CAATTGCCCC GCCGTCTGCT GACCTCACTG TAGGCAGCAC CGGGCGATTG GTCGATCCTA ACTACCAGAA CCCGGTGTCG GAGGAATTTA ACTTCGGCTA CTCTTGGGGA GTAACCTCCA ACTCGGTGTT CGAGACGGAG TTCACGCACG TCCAAAACCT GCATGAAAAC CGCACCATGA ATATCGACCA GAAGGTTCCG GTCGGTGGTG TTTGCTGCTT CCGTCCTCTC GACGATGCCT TTGCCGCTGC CGGCCAACCG CGCTTGAACA GTGTGCGCGA TGAGCAGTCC ATCGGCCGAT CCCGCTACGA CGGCATCAAC TTCGGCTACC GCCAGCGCAT GACGCATCAC TTCATGTTCA ACGCGTACTA CACCCTGGCC TGGGCCGACG GTTACAACAC CAACGGCAAC TATGCCTTCC GCAACTACCC GTTCCTCGCA ACCGATCCGT TTGCCAAATA CAACTGGGGA CCGACTTATT CCGATGAACG TCACCACGTG ACCATCAGCG GTTTGGTTGA TTTCCCCTTT GGCATCCAGG CTTCGCCGAT TTTGCAATAT GGCTCGGCGC GCCCGTACGC TCTGACCAAC TCTTACAACA CGCTCAACAC CGGTAACGGC ACGGCGACCG CCGTAATCGT CCCCAAGGGA CAGACGAATA ACTACCTCTA CGGCACGAAC TACATCGCGT CGTACGTTGC CGCTCATCCG GGCGACGACA ATGCAACCTC GGAAGCGCAG CAGAACCTCC AGATGTGCTT CTACAACGGC GATTGCACCC TCGCGCAGTT CGATCCGCTT CGTGGCAAGC CGACCTTCGA GCTTGACCTC CGCTTGGCGA AGAACTTCAA GCTGGGTGAG CGCTTCAACC TGCAGATCAC CGCACAGGCG TTTAACCTGA CCAACGCCCC GAACTACGGA AACAACTTCA ACGGCAACAT CGCCTCGCCC TCCACGTTCA TGCATCCGGC GGGCTTCATC AACCCCAGCA GCACCACGAC TCCGCGTTCG CTGTGGAGTG AATACGGAGT ACGCCTCACG TTCTAA
|
Protein sequence | MRRYIRYVLS LALLLCATHM LFAQATASAN VAGTVYDKTS AVIAGAQVTI TSKATGQTRT SNTDSNGAYR FDLLSAGQYS IKITKSGFAS LSENVELLVG QTSTINGVLN PGSASEVVEV TEAAPLVDVA KTSVSTQITP SEVQEMPLVG RDVANLAYLA PGVKAADSYD PTKNRYAILS VNGADGRNVN VTVNGVDNKD NTVGGPVMQL PLEAVQEFQI STQRFSAENG RSQGAAINMI TKSGTNMYHG SAFGFFRTSA LDADEMVPDG TGGAFHSHPD YSRQQFGGSF GGPIIKDKLF GFFALEHERE RQGLSESGTS FDELSLAAGA GLAAQPSAVI PRPFNETRYS GRLDWNVNSK NSAYLSYNSQ VNDSLNDQSD GTGDLTNGNF TKNHLQLANL TWNTLLTSHL INQFTFGWQY WNNLIDSDIS APLVTFPNAS FGTNTNVPQQ SFQRKFQFKD DISWTHGKHT FKGGVDYIWN PVEGGFFEYS STLEIDFGAD PSCILAAATD DVNKCGPGYY PQQFATKGAV TGMTIANGDP QFIVPTKQLG FYFQDDWKVT PRLNLNLGIR WDKDFNTYGQ SDITNSRTYQ ELVAINSPIT NPYVASLPHA SNKDFSPRVG FAYDLTGSGT HVLRGGFGLY YGNSFQNIPL FMEQQANSTI FQTLFSLSDP VNDVVPGTGI PLGQWQYGIS PMPTIAPPSA DLTVGSTGRL VDPNYQNPVS EEFNFGYSWG VTSNSVFETE FTHVQNLHEN RTMNIDQKVP VGGVCCFRPL DDAFAAAGQP RLNSVRDEQS IGRSRYDGIN FGYRQRMTHH FMFNAYYTLA WADGYNTNGN YAFRNYPFLA TDPFAKYNWG PTYSDERHHV TISGLVDFPF GIQASPILQY GSARPYALTN SYNTLNTGNG TATAVIVPKG QTNNYLYGTN YIASYVAAHP GDDNATSEAQ QNLQMCFYNG DCTLAQFDPL RGKPTFELDL RLAKNFKLGE RFNLQITAQA FNLTNAPNYG NNFNGNIASP STFMHPAGFI NPSSTTTPRS LWSEYGVRLT F
|
| |