Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0834 |
Symbol | |
ID | 4072360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1035676 |
End bp | 1036605 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982843 |
Product | polysaccharide export protein |
Protein accession | YP_589913 |
Protein GI | 94967865 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATT GGTTGTTGAT CGGGATTGTG GCCCTCAGTT CCTGGACGTT TGCCCAGACT GATGCCCATC CAACTGCCTC GTCGAATGAG TTTCCAACGC GAGGGCAACA CGTTAAGACG CTGTTGATCG GGCCTGGCGA CCTATTGGAT TTGAACGTCT ATGACGTGCC GGAACTCATC CTCAAAGTTC GTGTTGACGA CAACGGGGAT GTGTCGATTC CGTTGATCGG TCAGCAGCAT TGGGGCGGCC TCACGACTGT GCAGGCGGCT GACCTTGTCA CAAAAAAGCT CATTGAGGGT GATTTTGTGA AGAACCCCGA GGTATCGATT CTGGTGGACG AGTTCGCGAC ACAAGGGATA TCCATCTCCG GCGAGGTGAA TCAGCCCGGT ATTTATCCGC TGCTCGGGCC TCATCGCCTT AACGATGCGA TCTCGGCGGC CGGTGGCCTC TCGCCCCGAG CCGGACGGAC CGTAACTATT GTCCATCGCG CCCACGTGGA CGAGCCGGTG ATCATCGATC TCCCAAATTC GCGCAACATG GTGGAAGCCA ATGTCGAGTT GGAGCCGGGT GATTCCATAC TCGTATCGAA GGCCGGTGTC GTCTATGTCA TGGGAGAAGT TATTCGTCCA GGCGCCTTTC TCATGGAAAA CAACACACGA ATGACGATCC TCCAGGCCGT CACGATGGCT CAAGGTCCAA CCAACATTGC GGCGCTTGGC GGTACGAGAA TCGTGCGCAA AACTCCTCAG GGTGTGCAGC AGTTTCCGGT CGCCCTCGAC AAGATAACCA AGGGTGTAAT TCCTGACCGA CTTCTCGATG CCGACGATAT TGTCGTCTTG CCGAAGAGTG GCGTTAAGAT CGCTGGCCAG ATTACGGCCC GCTCCGCCGT GGCTGCAGCT GCGGCCCTTG CTGTGTACGC GATACGTTGA
|
Protein sequence | MKNWLLIGIV ALSSWTFAQT DAHPTASSNE FPTRGQHVKT LLIGPGDLLD LNVYDVPELI LKVRVDDNGD VSIPLIGQQH WGGLTTVQAA DLVTKKLIEG DFVKNPEVSI LVDEFATQGI SISGEVNQPG IYPLLGPHRL NDAISAAGGL SPRAGRTVTI VHRAHVDEPV IIDLPNSRNM VEANVELEPG DSILVSKAGV VYVMGEVIRP GAFLMENNTR MTILQAVTMA QGPTNIAALG GTRIVRKTPQ GVQQFPVALD KITKGVIPDR LLDADDIVVL PKSGVKIAGQ ITARSAVAAA AALAVYAIR
|
| |