Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1677 |
Symbol | |
ID | 4069345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2028162 |
End bp | 2029388 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983685 |
Product | Alpha-galactosidase |
Protein accession | YP_590752 |
Protein GI | 94968704 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.343798 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGGAGGC CCGGCCCGCA GATGAAACGC TTCGTCGCAT CCATCCCTCT CGTAATAATT TTTCTTAGCA TCATTGCTGC CGCGCAAGAA TCCCCAAAGC CGCCGGACAT CAACGCCTAC GCGCCAACCC CGCCGATGGG ATGGAACACG TGGAACAAGT TCGCCTGCAA TGTTGACGAG AAACTCATTC GTGGCGCCGC CGATGCACTC GTCTCCTCTG GCATGAAAGA CGCTGGTTAC CATTACCTCG TCATTGACGA CTGCTGGCAA GTCAGCCGCG ACGGGAAAGG CAACATCGTT GCCGATCCAC AACGATTCCC TTCCGGCATC AAGGCCCTCG CCGCGTACGT CCACCACAAA GGACTGAAAT TCGGTATCTA CTCCGACATC GGCACCAAGA CTTGCCAAGG CCGGCCCGGA AGCCGGGGCC ATGAATTTCA AGACGCCCTG CAATACGCCG CATGGGATGT CGACTATCTC AAACTCGATT GGTGCAACAC TGGCACCCAG AACGCCCCAG CCTCGTACAA AACCATGCGC GATGCGATTG ATGCCACGGG CCGTCCCATC GTTCTAAGCA TCTGCGAATG GGGCAAGAGC AAGCCGTGGC TCTGGGCCAA AGACACCGGT AATCTCTGGC GCACCACCGA CGACATTCAG GATCGCTGGG CCGGCAAGAA AAAATGGGAT GAAGCCTCAT GCTGCTCCAA CGGCATGCTC GACATTCTCG ATGAAGAAGC CGACATCGCC AGTTACGCCG GTCCCGGACA CTGGAACGAT CCCGACATGC TCGAGGTCGG CAACGGCGGC ATGACCAACG TCGAATATCG TTCCCACTTC AGCCTATGGG CAATCCTCGC TGCGCCTCTC ATGGCCGGCA ACGATCTCAC CAACATGCGC CCCGAAATTC GCGAAATCTT GACCAACAAA GAAGTGATCG CCGTAGATCA AGACCCGCTC GGCAAGCAGG GCGTTCGCGC AGCGAAGAAC GGCGATCTCG AAATCTGGAG CAAAGTGCTT GCCGACGGAT CTCGCGCCGC TGTCCTCCTC AACCGCAGTG CATCGGAACA AGAGATCACC GCGCGATGGA TCGACCTCGG TTATCCCGCG ACACTCTCCG TGAGGGTCCG AGACCTCTGG GCGCACAAAG ATCTCGGTTC TTTCAAAGAC AAGTTTTCCG CCAAAGTTCC ATCGCACGGT GTCGTTATGA TTCACCTGCA ACCGTAA
|
Protein sequence | MRRPGPQMKR FVASIPLVII FLSIIAAAQE SPKPPDINAY APTPPMGWNT WNKFACNVDE KLIRGAADAL VSSGMKDAGY HYLVIDDCWQ VSRDGKGNIV ADPQRFPSGI KALAAYVHHK GLKFGIYSDI GTKTCQGRPG SRGHEFQDAL QYAAWDVDYL KLDWCNTGTQ NAPASYKTMR DAIDATGRPI VLSICEWGKS KPWLWAKDTG NLWRTTDDIQ DRWAGKKKWD EASCCSNGML DILDEEADIA SYAGPGHWND PDMLEVGNGG MTNVEYRSHF SLWAILAAPL MAGNDLTNMR PEIREILTNK EVIAVDQDPL GKQGVRAAKN GDLEIWSKVL ADGSRAAVLL NRSASEQEIT ARWIDLGYPA TLSVRVRDLW AHKDLGSFKD KFSAKVPSHG VVMIHLQP
|
| |