Gene Acid345_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1786 
Symbol 
ID4072846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2165433 
End bp2167643 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content61% 
IMG OID637983794 
Productdehydrogenase, E1 component 
Protein accessionYP_590861 
Protein GI94968813 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit
[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGA CGAAGGCTGA GCCGAAGCCG GCGACTGCGA GCACGAAAGC CGTGAACAAG 
ACTTACGAAG GGCTGACCCG CGAGGACCTG CTTCGCGCCT ATCGGCTCAT GTACCTCTCG
CGCCGCATTG ACGACCGCGA GATCCTGCTC AAGCGCCAGC AGCGGGTGTT CTTCCAGATC
TCCGGCGCCG GCCACGAGGC GATGCTTGTC GCCGCGGGCC TCCTTCTCAA ACCGGGCTAT
GACTGGTTCT TCCCCTACTA CCGCGATCGC GCGCTCTGCC TCGCGCTCGG TATGACTGCC
GAAGAGATGT TGCTCGGCGC CGTCGGCGCA GCTGCCGATC CCAACTCCGG CGGACGCCAG
ATGCCTTCGC ACTGGGGACA CAAAGGCCTG AACATCGTCA CTGGATCTTC GCCGACCGGA
TCGCAAATTC TGCATGCGGT CGGCTGCGCT GAAGCCGGGC GATTGTTCAA TGCGCACCCG
GATTCCGCTG CGAAGGCCGA AGGCGATTAC CGCGAATTCA AAGACGTCGT CTTCCATGGC
GACGAAGTCA GCTACGTCTC CTGTGGCGAT GGAACCACCA GCCAGGGCGA GTTCTGGGAA
GCCCTGAGTT CGGCCTCGAA CAACAAGCTG CCGGTGCTCT TCGTCGTAGA AGACAACGGC
TACGCCATCT CAACGCCCGT CGAAGTGAAT ACTCCGGGCG GCAACATCAG CAAGGTCGTC
TCCGGCTTCC CGAACTTCCA CTTTGAAGAA TGCGACGGCA CCGAGGTTCT CGAGAGTTAT
CGGGCGTTCA AGCGTGCTAT CGACTACATC CGCGCCGGCA AAGGCCCCGC GTTCGTTCAC
GGCCACGTAA TTCGTCCGTA CTCGCACTCT CTGTCAGACG ACGAAAAACT CTATCGCCCG
GAAGCCGAGC GCAAAGACGA AGCCAATCGC GACCCGATCA CCAAGTTCTA CAAGTGGCTC
GTCGCCGAAA GCCTGGCGAC CGACAAGGAA TTAAAAGACC TGCAGACGGA CGTCGACACC
GAGGTCCAGG ACTCATCCGA TCGCGCCGTC GAAGCGCCCA TTCCCGCACT CGACAGCTAC
TCTCAGCATC TGTACTCGTC CACGCTCGAC CCGGCATCGG CGGCGTTTGA AACGCGCCCT
CAGTTCCCGG TCCACGTCGA GGGCGAAACT GCGGTCGCGC CCGCCAAGAC CATGGCCGAC
CTCATCAATG CCTGCCTGAA GGACGAGATG AAGCGCGACC CGCGCATCGT GATCTTCGGC
GAAGACGTTG CCGATTGCAG TCGCGAAGAA TATTTGAAGC AGAAGCAGGT GAAAGGGAAG
GGCGGCGTCT TCAAGCTAAC TTCCGGCTTG CAGATGGAGT ATGGCGCCGA CCGCGTCTTC
AATTCACCGC TTGCGGAAGC CAACATCGTT GGCCGCGCGA CAGGCATGGC CGTTCGCGGA
TTGAAGCCTG TGGTCGAGAT CCAGTTCTTC GATTACATCT GGCCCGCAAT GCACCAGCTT
CGCAACGAGC TCCCGGTTGT CCGCTGGCGC TCGAATGGCG CGTTCTCATC GCCGGCTGTA
ATACGTGTGG CCATCGGCGG TTATCTCACC GGAGGCGCGA TCTATCACTC GCAGTGCGGC
GAGAGCATCT TCACGCACAC GCCAGGCATG CGCGTGATCT TCCCATCCAA CGCGCTTGAC
GCCAACGGCC TGCTGCGCAC TGCCATTCGT TGCGATGATC CTGTGCTCTT CCTCGAGCAC
AAGCGGCTCT ATCGCGAAAC GTTTGGCCGC TCGCCTTATC CGGGACCGGA TTACATGGTC
CCGTTCGGCA AGGCGAAGAT CGTCAAAGCC GGACACGACA TCACCGTCGT GACCTATGGT
GCGGTCGTTC CGCGCGCGTT GCAGGCGGCG CAGAAGATCG AGCGCGAAAA CGGTGTCAGT
GTGGAACTCA TCGATCTGCG AACGCTCAAT CCGTACGACT TCGAAGCGAT CGCGGAATCA
ATCCACAAGA CAAACCGCGT GATCGTTGCG CACGAAGATA CGCTGAGCTG GGGCTACGGC
GCGGAGATCG CGGCCCGCAT CGCCGACGAA CTCTTCGACG AACTCGACGC GCCCGTCAAG
CGTGTCGCCG CAAAAGATAC GTTCGTTGCC TATCAGCCTG CATTGGAAGA CGTCATCCTT
CCGCAGTCAG ATGACCTCTT CGCAGCGATG CTGGAGATGA GCAAGTACTA G
 
Protein sequence
MATTKAEPKP ATASTKAVNK TYEGLTREDL LRAYRLMYLS RRIDDREILL KRQQRVFFQI 
SGAGHEAMLV AAGLLLKPGY DWFFPYYRDR ALCLALGMTA EEMLLGAVGA AADPNSGGRQ
MPSHWGHKGL NIVTGSSPTG SQILHAVGCA EAGRLFNAHP DSAAKAEGDY REFKDVVFHG
DEVSYVSCGD GTTSQGEFWE ALSSASNNKL PVLFVVEDNG YAISTPVEVN TPGGNISKVV
SGFPNFHFEE CDGTEVLESY RAFKRAIDYI RAGKGPAFVH GHVIRPYSHS LSDDEKLYRP
EAERKDEANR DPITKFYKWL VAESLATDKE LKDLQTDVDT EVQDSSDRAV EAPIPALDSY
SQHLYSSTLD PASAAFETRP QFPVHVEGET AVAPAKTMAD LINACLKDEM KRDPRIVIFG
EDVADCSREE YLKQKQVKGK GGVFKLTSGL QMEYGADRVF NSPLAEANIV GRATGMAVRG
LKPVVEIQFF DYIWPAMHQL RNELPVVRWR SNGAFSSPAV IRVAIGGYLT GGAIYHSQCG
ESIFTHTPGM RVIFPSNALD ANGLLRTAIR CDDPVLFLEH KRLYRETFGR SPYPGPDYMV
PFGKAKIVKA GHDITVVTYG AVVPRALQAA QKIERENGVS VELIDLRTLN PYDFEAIAES
IHKTNRVIVA HEDTLSWGYG AEIAARIADE LFDELDAPVK RVAAKDTFVA YQPALEDVIL
PQSDDLFAAM LEMSKY