Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2732 |
Symbol | |
ID | 4069423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3230136 |
End bp | 3232397 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984749 |
Product | polysaccharide export protein |
Protein accession | YP_591807 |
Protein GI | 94969759 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.445544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.725394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACGG CAACGCGTTT GCAGAAGCAT TTAGCGGTTG CGATTTTTAA GGGCTCTCTC CCAAAGGTGA TAGTGGTGGC GGTGCTCCTG GCGATCGGGG CATCGCCGCA ACTATTGGCA CAGAGGGCGG AGTGCGGCAC GGCGAACTGC GCCGCAACCG TGAAGGTGGT GGACTCTAAA CCAACCGAGA AGCCGAACCA GATTGTTACG ACCGAGACAC CGGACGTTGT GCAGGTGCAG ACCTCGCGGC CTCCAAAGCA AGTGGAGAAG CCGAGCGAGT TTGAGAGGTA CGTCGAGGAG TCGCTCGGGT ACGCGCTGCC GTTGTTCGGG CGGAAGCTGT TCGCGGACCC GCCATCAACC TTTGCGCCGC TGACCGAAGT TCCGGCGCCG GCGGAATACG TGATCGGCCC GGGCGATGAG CTGGTGATTC GCGGCTGGGG GCAGGTGCAG ATTGAAGCGC GCGTAACGGT GGATCGGTCC GGCCAAGTTT TCTTGCCCAA AGTTGGCGCT ATTTCGGTGG CGGGAGTGCA CTACGCGGAT TTACGCAGGC ATCTGCAAGC GTCGGTGCAG CGCGTATTCC ACGACTTTGA ACTTGAGGTG TCGCTCGGGC AGTTGCGCAG CATGCAGATA TTTGTCGTCG GGCAAGCGCG GCAGGCGGGC ACCTACACGG TAAGTTCGCT GAGCACTGCG GTAACGGCAG CGTTTGCGAC CGGTGGGCCA TCGCCGCAAG GATCACTCCG AAAGATCGAG GTGCGACGCG GGGGGGCAAC GATCACGACC CTGGATCTGT ATGAGTTCTT GCTAAAGGGA GATAAGTCGC ACGATGTGGC GCTGCTGCCG GGGGACGTGA TCTACATTCC GCCGGTTGGG CAGATGGTGG CGATCTCGGG AAGCGTGAAT ACGCCGGGTA TCTACGAGCT GCTGCCCGGA ACGACACTCA GCGATGCGGT GGAACTTGCT GGCGGTCTGA CGGCGACGGC GGATGACGAG CGCGTGAGCG TGGAGAGGAT CGAGGAGCAC CGCACGCGGC GCGTGTTTGA AGTCGCGCCG AGTGTTGCGG AGCCGAGGTT CGTGCTGGAA AACGGCGATG TGGTGCGGGT GCTGGCGATT TCTAAAGTGA TAGAGGATGC GGTGACGTTG CGGGGTAACG TGGCGCGCCC GGGACGGTAT CGGTGGCATC CGCGGATGCG GATTCAAGAC CTCATTCCTA ACCGTGAGTT CCTGCTTACG CCGGAATACT GGAGCCACAA GAACGCACTC GTCCGTGAGG GAGTCGGGGA AAGCAAGAAC GCGGAAGAAC GAACACTTAC GGAACTGAAG CAGAATGCGC CGCCTTTGAA TTGGGAGTAT GCGGCGGTGG AAAGGAGCGA TCCTGAGCGA TTGAATTCGG AAGTCTTGCC GTTCAACTTG GGGGGCGCGA TTGATGGCGA CAGCGAGGCG AATCTTCTGT TGAAAGCCGG GGATGTGGTC ACCATCTTCT CGGAACGCGA CATCCAGGTT TCGAACGCGA AGCGCACGAA ACTGGTGCGG CTCGAAGGTG AATTTGCGGC TCCTGGTGTC TATCGGGCCG AGCCGGGCGA GACGCTGCGA GCGCTTGTGG GCCGGGTTGG ATTGACGCCG GACAGCTATC TGTTTGGTGC AGAGTTCACG CGCGAGAGCA CACGGATGCA GCAACAACAA GGGCTCGACC GGCTGATTTC CGAGATGGAG AACGGGATCC AGGAGATCCA TACGCAACGC GCCGATGCCA ACGATGGCGC TGCAGTCCAG GAGCGCGAGG CGCAGGAAGC TAGACGGGGA CTGGTCGCAA GAGTCAAGAG TCTGCGCGCG AGCGGAAGGA TTGTGCTTGC GCTTCCGCCT GCGGCGCACG ATGTTCGGGA GATCCCGGAG ATTGTGCTCG AGGACGGTGA CCGATTCGTG GTGCCGCACA CGCCCGCCAC GGTCAGCGTG ACCGGAGAGG TCTTCAATCA GGGGGCATTC CTGTTCAATC GGAACCTGAA AGTTCGCGAC TACTTGCGCG ACGCGGGAGG CGGCACGCGA AATGCCGATG CTTCGCGGAT ATTTGTGCTT CGCGCCGATG GGACCGTGGT GAGTCGTCAG CATGAAGGGG GCCGGTTCGA TGGCCTGCCG GTTTATGCGG GGGACACGAT CATTAGTCCG GTGCGGCTGG AAAAGGGAAA CTTCATGCGT GGTCTGCGCG ACTGGTCGCA GGTGATCTCA CAGTTCGCGC TGGGTGCGGC AGCGATCAAG GTGCTGGAAT GA
|
Protein sequence | MRTATRLQKH LAVAIFKGSL PKVIVVAVLL AIGASPQLLA QRAECGTANC AATVKVVDSK PTEKPNQIVT TETPDVVQVQ TSRPPKQVEK PSEFERYVEE SLGYALPLFG RKLFADPPST FAPLTEVPAP AEYVIGPGDE LVIRGWGQVQ IEARVTVDRS GQVFLPKVGA ISVAGVHYAD LRRHLQASVQ RVFHDFELEV SLGQLRSMQI FVVGQARQAG TYTVSSLSTA VTAAFATGGP SPQGSLRKIE VRRGGATITT LDLYEFLLKG DKSHDVALLP GDVIYIPPVG QMVAISGSVN TPGIYELLPG TTLSDAVELA GGLTATADDE RVSVERIEEH RTRRVFEVAP SVAEPRFVLE NGDVVRVLAI SKVIEDAVTL RGNVARPGRY RWHPRMRIQD LIPNREFLLT PEYWSHKNAL VREGVGESKN AEERTLTELK QNAPPLNWEY AAVERSDPER LNSEVLPFNL GGAIDGDSEA NLLLKAGDVV TIFSERDIQV SNAKRTKLVR LEGEFAAPGV YRAEPGETLR ALVGRVGLTP DSYLFGAEFT RESTRMQQQQ GLDRLISEME NGIQEIHTQR ADANDGAAVQ EREAQEARRG LVARVKSLRA SGRIVLALPP AAHDVREIPE IVLEDGDRFV VPHTPATVSV TGEVFNQGAF LFNRNLKVRD YLRDAGGGTR NADASRIFVL RADGTVVSRQ HEGGRFDGLP VYAGDTIISP VRLEKGNFMR GLRDWSQVIS QFALGAAAIK VLE
|
| |