Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2810 |
Symbol | |
ID | 4071813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3331566 |
End bp | 3334442 |
Gene Length | 2877 bp |
Protein Length | 958 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984828 |
Product | bifunctional transaldolase/phosoglucose isomerase |
Protein accession | YP_591885 |
Protein GI | 94969837 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0166] Glucose-6-phosphate isomerase [COG0176] Transaldolase |
TIGRFAM ID | [TIGR00876] transaldolase, mycobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0413323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC CGTTGATCGA ACTGCACTCT TTCGGACAGA GCGTTTGGCT GGACCAAATC GAACGCGCCC TCTTCAAGAC GGGCAAACTG GCAAAGCTGA TTAAAGAAGA CGGCCTGCGC GGCATGACGT CGAACCCAAC AATTTTCGAG AAAGCGATCA CCGGCTCCAG CGATTACCAG GAGCAGATCG ATCGCGCCGC CCGCGACGGC AAAACTGGCA ACGAGATCTA TGAAGAAGTC GTGATCGACG ATATCGCCCA CGCCGCCGAC CTCTTTCGCC CGCTCTACGA CAGTACCAAC GGCGAAGACG GCTTCGTGAG CCTGGAAGTT TCTCCGCTGC TGGCGAAGAA CACCGATGGC ACCATTCGCG AGGCGAAGAC GCTCTTCAGC CGGTTGAATC GGCCGAACGT GATGATCAAG GTTCCGGCGA CGGAAGAAGG CCTGCCGGCG ATCGAAGAAC TCATCGCGTC GGGCTTGAAC ATCAACGTGA CCCTGATCTT CTCTGTGCAC CGTTACGAAG AAGTGGCCGA AGCCTACATC CGCGGACTGG AACGCCGCGC TCAGGCGGGC CAACCGATCG ATCGCATCGG TTCGGTGGCG AGCTTCTTCG TCAGCCGCAT CGATAGCGCG GTGGACAAGC AACTCGAGGC GCTGGAGAAA GAAGCCACCG ATCCTGCGAA GAAGCAGGAA ATCCATGCGC TCCGCGGCAA AGCTGCGATT GCCAACGCGA AGCTGGCGTA TGCCTCGTTC AAGCGCATCT TTGAGAGCCC GCGTTTCGAG GCGCTCAAGC GCAAAGGCGC GCGCGTACAG CGTCCTCTGT GGGCGTCGAC CAGCACCAAG GACCCGAGCT ATCCCGATGT TCTGTACGTG ACCGAGTTGA TCGGGCCGCA CACCGTGAAC ACGCTGCCTC CGGCGACGGT GGATGCGTGG CGCGATCACG GCGTCGCGGG CGCACACCTC GAAAAGGACA TGGACAAGGC ACCAGATGTC TTCGCCAAGC TGAAGGCGCT CGGCATCGAC TTCAACAAGG TCACAGATAA GCTGACGACA GACGGCGTGC GCTCGTTCTC CGCATCATTC GTGGACCTGA TGCGTGCCAT AGAACAGCGC CGTGAGATGA GCCTGCCGGG AATCAAGGAA CGCCACGTTT CCGCACTCGG CAAGTACGAA GGCGATGTGC AGGCGGCATT GCAGGAACTC GGATCGAAGA ACGTCATCCA GCGCTTCTGG AACAAGGAAG CCGCGGTGTG GAGCGCCGAT GCTGGTGATC AGAAGATCAT CAACAACGCG CTGGGCTGGC TGACCGTCAC CGACCTGATG CAGGGGAAAG TAAAAGAGTT GAAAGCCTTC GCCGCTGAAG TGCAGGCGGC GGGCTTCAAA CATGCCGTTG TACTCGGCAT GGGTGGCTCC AGTCTCTGCC CGGAAGTTTT GCGGCAGACC TTCGGCAAAC AGCTCGATTA TCCCGAACTC CTTGTGCTCG ACTCCACCGT TCCCGCTGCC GTGCTCGCAA TCGACAAGCA GATCGATCCG GCAAAGACCC TGTTCATCGT GGCCTCGAAG TCGGGCTCTA CGACAGAGCC GCAGATGTTC TATCGCTATT ACTTCGAGAA AACGAAGCAG GTGCTCGGCG ACAAAGCGGG ACAGAATTTC GTCGCCATCA CCGATCCGAA TACGCAACTC GAGAGTGAAG CTAAGCGCGA TGGTTTCCGC AAGGTCTTCA CCAATATGGC CGACATCGGC GGGCGCTACT CCGCCCTTTC GTACTTCGGC ATGGTGCCCT TCACGGTGAT GGGTGGCGAT GTGGACTCGC TGCTGCGCCG CGCCAAAGCC GCAATGGATG CCTGCGCTGC GGGTGTTGAA CCGGCGAACA ACGCCGGCGC GAAGATCGGC GCGATTCTCG GCGCCCTCGC GCGCAAGGGC CGCGACAAAG TCACTTTCGT TACGCCGCCG CCCATCAGCT CGCTCGGGCT GTGGATCGAG CAGTTGATTG CCGAGAGCAC CGGCAAGCAC GGTAAGGGCA TAGTGCCGAT CTCAGGCGAA TCGCTTGGCG ATCCAAAGGT CTACGGTGAT GATCGCGTTT TCGTGTACAT CGGCGTCTCG GGAACGAATG GCGCCAACTA CGAAGCGCAA CTCCAGGCGC TGGAGCAGGC CGGGCATCCG GTGTTGCACC ACGTGCTGAA CAGCCCGATT GATCTTGGCG AAGAGTTCTT CCTCTGGGAA TTCGCGACAC CCATCGCCGG CGAGTTGATC GGGATCAATC CGTTCGACCA GCCGAACGTG CAGGAGTCGA AGGACAACAC CAAGCGCATC CTGAAGGAAT ACACCGACAC CGGCAAAATT ACGCAGTTGC CCGAGGTAGC CGAAGGCGAC GGCCTGACCG TGCTGACCGA CGAGAACAAT CGCAAGGCTC TGAACGGCGT TTCAACGCCG GATTCGGCGA TCACCGCGCA TCTGGGCCGC GTGCAGAAGG GCGACTACTT CGCGATCACG CAGTACATCG AGGAGACGCC GGAGATCGAG TCGCTGGTGC AGCAGATTCG TACCGCTGTC CGCGATAAGG CTTGCGTGGC GACGACAACT GGGTATGGTC CTCGCTTCCT GCATTCGACT GGGCAACTGC ACAAAGGCGG TCCGGACAGC GGCGTCTTCC TGCAACTGAT CTCCAACGAC GCGCAAGATG TTCCTCTGCC GGGCGAGAAA TTCACCTTCG GCGTGTTGAA AGACGCGCAG GCACTGGGCG ATTTCGAGTC GCTGTCGAGC CGCGGACGAC GCGCGATTCG CGTGAACTTG GGCAACAACA TCGTCGGCGG GTTGAAGAAG ATACTCGCGG CGGTGCAGCA GTTTGAAGGC GCGACAGCGG GAGCTGCGCG CAAGTAA
|
Protein sequence | MSNPLIELHS FGQSVWLDQI ERALFKTGKL AKLIKEDGLR GMTSNPTIFE KAITGSSDYQ EQIDRAARDG KTGNEIYEEV VIDDIAHAAD LFRPLYDSTN GEDGFVSLEV SPLLAKNTDG TIREAKTLFS RLNRPNVMIK VPATEEGLPA IEELIASGLN INVTLIFSVH RYEEVAEAYI RGLERRAQAG QPIDRIGSVA SFFVSRIDSA VDKQLEALEK EATDPAKKQE IHALRGKAAI ANAKLAYASF KRIFESPRFE ALKRKGARVQ RPLWASTSTK DPSYPDVLYV TELIGPHTVN TLPPATVDAW RDHGVAGAHL EKDMDKAPDV FAKLKALGID FNKVTDKLTT DGVRSFSASF VDLMRAIEQR REMSLPGIKE RHVSALGKYE GDVQAALQEL GSKNVIQRFW NKEAAVWSAD AGDQKIINNA LGWLTVTDLM QGKVKELKAF AAEVQAAGFK HAVVLGMGGS SLCPEVLRQT FGKQLDYPEL LVLDSTVPAA VLAIDKQIDP AKTLFIVASK SGSTTEPQMF YRYYFEKTKQ VLGDKAGQNF VAITDPNTQL ESEAKRDGFR KVFTNMADIG GRYSALSYFG MVPFTVMGGD VDSLLRRAKA AMDACAAGVE PANNAGAKIG AILGALARKG RDKVTFVTPP PISSLGLWIE QLIAESTGKH GKGIVPISGE SLGDPKVYGD DRVFVYIGVS GTNGANYEAQ LQALEQAGHP VLHHVLNSPI DLGEEFFLWE FATPIAGELI GINPFDQPNV QESKDNTKRI LKEYTDTGKI TQLPEVAEGD GLTVLTDENN RKALNGVSTP DSAITAHLGR VQKGDYFAIT QYIEETPEIE SLVQQIRTAV RDKACVATTT GYGPRFLHST GQLHKGGPDS GVFLQLISND AQDVPLPGEK FTFGVLKDAQ ALGDFESLSS RGRRAIRVNL GNNIVGGLKK ILAAVQQFEG ATAGAARK
|
| |