Gene Acid345_2810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2810 
Symbol 
ID4071813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3331566 
End bp3334442 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content60% 
IMG OID637984828 
Productbifunctional transaldolase/phosoglucose isomerase 
Protein accessionYP_591885 
Protein GI94969837 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase
[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0413323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC CGTTGATCGA ACTGCACTCT TTCGGACAGA GCGTTTGGCT GGACCAAATC 
GAACGCGCCC TCTTCAAGAC GGGCAAACTG GCAAAGCTGA TTAAAGAAGA CGGCCTGCGC
GGCATGACGT CGAACCCAAC AATTTTCGAG AAAGCGATCA CCGGCTCCAG CGATTACCAG
GAGCAGATCG ATCGCGCCGC CCGCGACGGC AAAACTGGCA ACGAGATCTA TGAAGAAGTC
GTGATCGACG ATATCGCCCA CGCCGCCGAC CTCTTTCGCC CGCTCTACGA CAGTACCAAC
GGCGAAGACG GCTTCGTGAG CCTGGAAGTT TCTCCGCTGC TGGCGAAGAA CACCGATGGC
ACCATTCGCG AGGCGAAGAC GCTCTTCAGC CGGTTGAATC GGCCGAACGT GATGATCAAG
GTTCCGGCGA CGGAAGAAGG CCTGCCGGCG ATCGAAGAAC TCATCGCGTC GGGCTTGAAC
ATCAACGTGA CCCTGATCTT CTCTGTGCAC CGTTACGAAG AAGTGGCCGA AGCCTACATC
CGCGGACTGG AACGCCGCGC TCAGGCGGGC CAACCGATCG ATCGCATCGG TTCGGTGGCG
AGCTTCTTCG TCAGCCGCAT CGATAGCGCG GTGGACAAGC AACTCGAGGC GCTGGAGAAA
GAAGCCACCG ATCCTGCGAA GAAGCAGGAA ATCCATGCGC TCCGCGGCAA AGCTGCGATT
GCCAACGCGA AGCTGGCGTA TGCCTCGTTC AAGCGCATCT TTGAGAGCCC GCGTTTCGAG
GCGCTCAAGC GCAAAGGCGC GCGCGTACAG CGTCCTCTGT GGGCGTCGAC CAGCACCAAG
GACCCGAGCT ATCCCGATGT TCTGTACGTG ACCGAGTTGA TCGGGCCGCA CACCGTGAAC
ACGCTGCCTC CGGCGACGGT GGATGCGTGG CGCGATCACG GCGTCGCGGG CGCACACCTC
GAAAAGGACA TGGACAAGGC ACCAGATGTC TTCGCCAAGC TGAAGGCGCT CGGCATCGAC
TTCAACAAGG TCACAGATAA GCTGACGACA GACGGCGTGC GCTCGTTCTC CGCATCATTC
GTGGACCTGA TGCGTGCCAT AGAACAGCGC CGTGAGATGA GCCTGCCGGG AATCAAGGAA
CGCCACGTTT CCGCACTCGG CAAGTACGAA GGCGATGTGC AGGCGGCATT GCAGGAACTC
GGATCGAAGA ACGTCATCCA GCGCTTCTGG AACAAGGAAG CCGCGGTGTG GAGCGCCGAT
GCTGGTGATC AGAAGATCAT CAACAACGCG CTGGGCTGGC TGACCGTCAC CGACCTGATG
CAGGGGAAAG TAAAAGAGTT GAAAGCCTTC GCCGCTGAAG TGCAGGCGGC GGGCTTCAAA
CATGCCGTTG TACTCGGCAT GGGTGGCTCC AGTCTCTGCC CGGAAGTTTT GCGGCAGACC
TTCGGCAAAC AGCTCGATTA TCCCGAACTC CTTGTGCTCG ACTCCACCGT TCCCGCTGCC
GTGCTCGCAA TCGACAAGCA GATCGATCCG GCAAAGACCC TGTTCATCGT GGCCTCGAAG
TCGGGCTCTA CGACAGAGCC GCAGATGTTC TATCGCTATT ACTTCGAGAA AACGAAGCAG
GTGCTCGGCG ACAAAGCGGG ACAGAATTTC GTCGCCATCA CCGATCCGAA TACGCAACTC
GAGAGTGAAG CTAAGCGCGA TGGTTTCCGC AAGGTCTTCA CCAATATGGC CGACATCGGC
GGGCGCTACT CCGCCCTTTC GTACTTCGGC ATGGTGCCCT TCACGGTGAT GGGTGGCGAT
GTGGACTCGC TGCTGCGCCG CGCCAAAGCC GCAATGGATG CCTGCGCTGC GGGTGTTGAA
CCGGCGAACA ACGCCGGCGC GAAGATCGGC GCGATTCTCG GCGCCCTCGC GCGCAAGGGC
CGCGACAAAG TCACTTTCGT TACGCCGCCG CCCATCAGCT CGCTCGGGCT GTGGATCGAG
CAGTTGATTG CCGAGAGCAC CGGCAAGCAC GGTAAGGGCA TAGTGCCGAT CTCAGGCGAA
TCGCTTGGCG ATCCAAAGGT CTACGGTGAT GATCGCGTTT TCGTGTACAT CGGCGTCTCG
GGAACGAATG GCGCCAACTA CGAAGCGCAA CTCCAGGCGC TGGAGCAGGC CGGGCATCCG
GTGTTGCACC ACGTGCTGAA CAGCCCGATT GATCTTGGCG AAGAGTTCTT CCTCTGGGAA
TTCGCGACAC CCATCGCCGG CGAGTTGATC GGGATCAATC CGTTCGACCA GCCGAACGTG
CAGGAGTCGA AGGACAACAC CAAGCGCATC CTGAAGGAAT ACACCGACAC CGGCAAAATT
ACGCAGTTGC CCGAGGTAGC CGAAGGCGAC GGCCTGACCG TGCTGACCGA CGAGAACAAT
CGCAAGGCTC TGAACGGCGT TTCAACGCCG GATTCGGCGA TCACCGCGCA TCTGGGCCGC
GTGCAGAAGG GCGACTACTT CGCGATCACG CAGTACATCG AGGAGACGCC GGAGATCGAG
TCGCTGGTGC AGCAGATTCG TACCGCTGTC CGCGATAAGG CTTGCGTGGC GACGACAACT
GGGTATGGTC CTCGCTTCCT GCATTCGACT GGGCAACTGC ACAAAGGCGG TCCGGACAGC
GGCGTCTTCC TGCAACTGAT CTCCAACGAC GCGCAAGATG TTCCTCTGCC GGGCGAGAAA
TTCACCTTCG GCGTGTTGAA AGACGCGCAG GCACTGGGCG ATTTCGAGTC GCTGTCGAGC
CGCGGACGAC GCGCGATTCG CGTGAACTTG GGCAACAACA TCGTCGGCGG GTTGAAGAAG
ATACTCGCGG CGGTGCAGCA GTTTGAAGGC GCGACAGCGG GAGCTGCGCG CAAGTAA
 
Protein sequence
MSNPLIELHS FGQSVWLDQI ERALFKTGKL AKLIKEDGLR GMTSNPTIFE KAITGSSDYQ 
EQIDRAARDG KTGNEIYEEV VIDDIAHAAD LFRPLYDSTN GEDGFVSLEV SPLLAKNTDG
TIREAKTLFS RLNRPNVMIK VPATEEGLPA IEELIASGLN INVTLIFSVH RYEEVAEAYI
RGLERRAQAG QPIDRIGSVA SFFVSRIDSA VDKQLEALEK EATDPAKKQE IHALRGKAAI
ANAKLAYASF KRIFESPRFE ALKRKGARVQ RPLWASTSTK DPSYPDVLYV TELIGPHTVN
TLPPATVDAW RDHGVAGAHL EKDMDKAPDV FAKLKALGID FNKVTDKLTT DGVRSFSASF
VDLMRAIEQR REMSLPGIKE RHVSALGKYE GDVQAALQEL GSKNVIQRFW NKEAAVWSAD
AGDQKIINNA LGWLTVTDLM QGKVKELKAF AAEVQAAGFK HAVVLGMGGS SLCPEVLRQT
FGKQLDYPEL LVLDSTVPAA VLAIDKQIDP AKTLFIVASK SGSTTEPQMF YRYYFEKTKQ
VLGDKAGQNF VAITDPNTQL ESEAKRDGFR KVFTNMADIG GRYSALSYFG MVPFTVMGGD
VDSLLRRAKA AMDACAAGVE PANNAGAKIG AILGALARKG RDKVTFVTPP PISSLGLWIE
QLIAESTGKH GKGIVPISGE SLGDPKVYGD DRVFVYIGVS GTNGANYEAQ LQALEQAGHP
VLHHVLNSPI DLGEEFFLWE FATPIAGELI GINPFDQPNV QESKDNTKRI LKEYTDTGKI
TQLPEVAEGD GLTVLTDENN RKALNGVSTP DSAITAHLGR VQKGDYFAIT QYIEETPEIE
SLVQQIRTAV RDKACVATTT GYGPRFLHST GQLHKGGPDS GVFLQLISND AQDVPLPGEK
FTFGVLKDAQ ALGDFESLSS RGRRAIRVNL GNNIVGGLKK ILAAVQQFEG ATAGAARK