Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3249 |
Symbol | |
ID | 4072584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3848975 |
End bp | 3850312 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985270 |
Product | glycoside hydrolase, clan GH-D |
Protein accession | YP_592324 |
Protein GI | 94970276 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.062257 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGAG CGATATTTGT CGCACTCATT TTCTCCGCTG TTTCGTGCGC GCAGACAGAG CTTGCCCCGA CACCGCCTAT GGGGTGGAAC AGCTGGAACT CCTACGGGCA GACGATCTCG GAGGCCGACG TGCGCGCCAC TGCCGATGTG ATGGCGCAGA AGCTGAAAGC CTTCGGATGG CAGTACGTGG TGATTGATGA AGGCTGGTAT GTGCTGAAGC CGGGCGGCGA CGCGAAGCTG TACCAGTTCC GGATGGATGA GAACGGGCGC TTTCTTCCGG CGGAGAGCCG CTTCCCTTCC GCGAAGGACG ACGGCATGAA GGCGATTGCG GATTACGTAC ATTCGCTCGG GCTGAAGTTT GGGATCCATG TGATTCGCGG GATCCCGCGC GAGGCGGTGG CGAAGAATTT GCCGATCGCG GGGAGTTCGT TTCATGCGGC GGAGGCGGCG GATACGAGCG ATGTGTGTCC GTGGAATGCG TACAACTACG GGCTGAAGGA CAACGCCGCG GGGCAGGCCT ATTACGACAG CATTGCGAAG CTGTATGCGA GCTGGGGCGT GGACTTCATC AAGATTGATT GCATTTCGTC GCACCCATAC AAGGGCGCGG AGATACGGAT GATCAGCGAG GCGCTGCATA AGAGCGGGCG TGCGATGGTG CTGAGCCTGT CGCCGGGGCC GACGCCGCTG GAGAAGGCCG ATGAGGTCGC GAAGTACGCG GAGATGTGGC GCATCTCGGA TGACTTCTGG GACCACTGGG GCGTGTGGAA GGGACATGAG TGGTCTCAAG GATTGAAGGC GCAATTCGAG AATGCGGCGA AGTGGAGCGC GCATCAGACG CCGGGACATC GGCCCGACAA TGACATGCTG CCGTTGGGAT ATCTGGGGCC GCATCCGGGT GAAGGCGAGG TCAGGAATAC GCAGTTCACG GCGGATGAGC AGAAGACGCT GATGACGCTG TGGGCGATTT CGCGGTCGCC GCTGATGATG GGCGGGAACT TGACGAAGAT GGATGAGGCG ACGCTGGGAG TGCTGACGAA TAAGTCCGTC ATAGCGCTTG ACCAGCAGTC TCACAACGGC AAGCAAGTGA TCAACGATGG GAATACTGTG GTTTGGTTCG CGGAGGGCTT GTCTCCGATC GCTGGATACG TTGCTGTGTT TAACATTTCT GATCGCGAGC AGTCCGTGAA GTATTCGTGG AAGGAACTTG GTTTAGCGGA GGGAGCACGC GTTGCGCGAG ATGTTTGGGC GGAACGTTCT TTGGGGAAAC AGGAGGGCGT TAACCTGAAG CTTGCGCCCC ACGCGTGTGT GCTGTACCAG CTGTCATTGT TGAAATAG
|
Protein sequence | MKRAIFVALI FSAVSCAQTE LAPTPPMGWN SWNSYGQTIS EADVRATADV MAQKLKAFGW QYVVIDEGWY VLKPGGDAKL YQFRMDENGR FLPAESRFPS AKDDGMKAIA DYVHSLGLKF GIHVIRGIPR EAVAKNLPIA GSSFHAAEAA DTSDVCPWNA YNYGLKDNAA GQAYYDSIAK LYASWGVDFI KIDCISSHPY KGAEIRMISE ALHKSGRAMV LSLSPGPTPL EKADEVAKYA EMWRISDDFW DHWGVWKGHE WSQGLKAQFE NAAKWSAHQT PGHRPDNDML PLGYLGPHPG EGEVRNTQFT ADEQKTLMTL WAISRSPLMM GGNLTKMDEA TLGVLTNKSV IALDQQSHNG KQVINDGNTV VWFAEGLSPI AGYVAVFNIS DREQSVKYSW KELGLAEGAR VARDVWAERS LGKQEGVNLK LAPHACVLYQ LSLLK
|
| |