Gene Acid345_4042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4042 
Symbol 
ID4072463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4775613 
End bp4776659 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content60% 
IMG OID637986072 
Productisocitrate dehydrogenase (NAD+) 
Protein accessionYP_593116 
Protein GI94971068 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID[TIGR00175] isocitrate dehydrogenase, NAD-dependent, mitochondrial type
[TIGR00183] isocitrate dehydrogenase, NADP-dependent, prokaryotic type
[TIGR02088] isopropylmalate/isohomocitrate dehydrogenases 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTACA AGATCACACT CATTCCCGGC GACGGCATCG GCCCTGAAGT CACCTCCGCT 
GCCGTGCGCG TCCTCGAAGC CACCGGACTC AAGTTCGAGT GGGAAAGCTT CGCCGCCGGC
GCCGAGGCCT ACGAAAAATA CAAGGAATAC ATCCCGAAAG AACTGAACGA ATCCATCGAG
CGCACCAGGA TCGGCCTCAA GGGTCCAGTC ACCACTCCGA TCGGTGGCGG ATTCTCCAGC
ATCAACGTTG AACTGCGCAA GCGCTTCGAG CTCTACGCGA ACGTCCGGCC AATCCGCAAT
CTTCCAGGCG TGCACACTCG TTATCCCGGC GTCGATCTCG TCGTGGTGCG CGAGAACACC
GAAGGCCTCT ACTCCGGCAT CGAGCACGAA GTTGTCCCCG GCGTAGTCGA GAGCCTGAAG
ATCATCACCG AGAAGGCCAG CACCCGCATC TCCAAGTTCG CGTTCAACTA CGCGCGCAAG
ATGGGCCGCA AGAAGATCCA CTCCATCCAC AAAGCCAACA TCATGAAGAT GTCCGATGGC
CTCTTCATCC GCTGCTCGCG CAACATCTCG AAGGAATATC CCGAGATCAT CTACGGCGAG
CACATTGTGG ACAACACCTG CATGCAACTG GTGATGAACC CCTACCAGTA CGACATCCTG
CTCCTCGAAA ATCTCTATGG CGACATTGTC AGTGACCTCT GCGCCGGATT AGTCGGCGGC
CTCGGCCTCG CTCCCGGCGC CAACATCGGC GAACGCGCGA GCATCTTTGA AGCCGTTCAC
GGCTCCGCTC CCGACATCGC GGGCAAGAAC ATCGCCAATC CCACGGCTGT CATCCGCAGC
GGCATCCTCA TGCTCCGCCA CCTCGACGAG CAGGACGCCG CCAACCGCGT CAAAGCCGCC
GTCCACCACG TCTACCGCGA AGGCAAACAC CTCACCAGGG ACATGGGTGG CACTACGTCC
ACCAGCGAAT TCGCCGATAA AGTCGTCGAG GCCATCCACA GCAAAGACCT CGTCGTCCCC
GCACCGCCGG TACAAAGTCC AGCGTAA
 
Protein sequence
MTYKITLIPG DGIGPEVTSA AVRVLEATGL KFEWESFAAG AEAYEKYKEY IPKELNESIE 
RTRIGLKGPV TTPIGGGFSS INVELRKRFE LYANVRPIRN LPGVHTRYPG VDLVVVRENT
EGLYSGIEHE VVPGVVESLK IITEKASTRI SKFAFNYARK MGRKKIHSIH KANIMKMSDG
LFIRCSRNIS KEYPEIIYGE HIVDNTCMQL VMNPYQYDIL LLENLYGDIV SDLCAGLVGG
LGLAPGANIG ERASIFEAVH GSAPDIAGKN IANPTAVIRS GILMLRHLDE QDAANRVKAA
VHHVYREGKH LTRDMGGTTS TSEFADKVVE AIHSKDLVVP APPVQSPA