Gene Acid345_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1950 
Symbol 
ID4071426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2343181 
End bp2344626 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content58% 
IMG OID637983962 
Productisocitrate dehydrogenase (NADP) 
Protein accessionYP_591025 
Protein GI94968977 
COG category[C] Energy production and conversion 
COG ID[COG0538] Isocitrate dehydrogenases 
TIGRFAM ID[TIGR00183] isocitrate dehydrogenase, NADP-dependent, prokaryotic type 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTATA ACGGGCTCCC ACTGCCTACT GACGGCGAAA AGATCACTTT CAAAGATGGC 
CGCTTCGCAA TTCCTAACAA TCCAATCATT CCTTACATCG AAGGCGATGG CACGGGCCGC
GATATTTGGA AGGCTTCCGT TCGGGTCTTC GACGCCGCTG TGCAAAAGGC CTATGGCGGT
AAGCGCAAAG TCGCATGGTA CGAAGTGTTC GCCGGCGAGA AGGCAAAAGA GAAATTCAAT
ACCTGGCTGC CCGACGACAC CGTGAGTGCC TTCCGCGAGC TGCGGGTTGG CATCAAGGGG
CCGTTGACTA CGCCGATTGG CGGCGGAATC CGTTCCCTCA ACGTCACGCT GCGGCAGGTG
CTCGATCTGT ATTCCTGCGT GCGTCCGGTG AAGTACTACC AGGGCATCCC TTCGCCGGTG
AAGCATCCCG AGCGCATGGA CATCGTGATT TTCCGCGAGA ACACCGAAGA CGTATACGCG
GGCATCGAGT GGAAGCAGGG AACTGACAAA GCGAAGAAGA TCATCGACTT CATCAACAAG
GAAATGCTCG CAGGCACGAA CAAGAGCATC CGCGAAGATA GCGGCATCGG TATTAAGCCG
ATCTCGGTCA CGGGCACAAA GCGCCTGGTG AAGATGTCGA TCGAGCACGC GCTGAAGACC
GGACGGAAGT CAGTCACGTT GGTGCATAAA GGCAACATCC AGAAGTTCAC CGAAGGCGCG
TTCCGTGAGT GGGGCTACGA AGTGGCCACG CAGGAGTTCC GCGCGCAGTG CGTGACCGAA
CGCGAGAGCT GGATCCTCGA CAACAAGGAC AAGAATCCCA ACCTCTCGAT CGAAGAGAAC
GCGCACATGG TCGAACCCGG CATCGACTTC GCGCCCGACG ATTTCAAGCA GGGCGTGTAT
GCGGAAGTGA AGCAGGTGCT GGACGCAATC TATGCCAGCC ACGGCGATCG CCGCTGGAAG
AGCATGCTGA TGATCAATGA CCGCATTGCG GACTCGATCT TCCAGCAGAT CGTGACGCGT
CCGGAAGAAT ACAGCGTGCT GGCAACACCG AACCTCAACG GCGATTACAT CTCCGATGCC
TGCGCAGCGC AGGTCGGCGG ACTCGGCATC GCACCGGGTG CGAACATCGG GCGCGAACAC
GCGATCTTCG AAGCGACGCA CGGCACAGCG CCGAAGTACG CCGATAAGGA CGTCATCAAC
CCGGGCTCAG TCATTCTCTC GGGCGTGATG ATGTTCGACT TCCTCGGTTG GGAAGAGGCG
GCACGGCTCA TCGAAGACTC GATGGAGCAG ACAATTCAAA AGAAGACGGT GACCTACGAC
TTCGAGCGCC TGATGGAAGG CGCAAAGAAG GTGACCACCA GTGGCTTTGC CAACTTGATC
ATCGGCAAAA TGGAACACCA CGGCGCGACG GACGGAAACC ATCATCAACA CGCGGCACAC
GAGTAA
 
Protein sequence
MSYNGLPLPT DGEKITFKDG RFAIPNNPII PYIEGDGTGR DIWKASVRVF DAAVQKAYGG 
KRKVAWYEVF AGEKAKEKFN TWLPDDTVSA FRELRVGIKG PLTTPIGGGI RSLNVTLRQV
LDLYSCVRPV KYYQGIPSPV KHPERMDIVI FRENTEDVYA GIEWKQGTDK AKKIIDFINK
EMLAGTNKSI REDSGIGIKP ISVTGTKRLV KMSIEHALKT GRKSVTLVHK GNIQKFTEGA
FREWGYEVAT QEFRAQCVTE RESWILDNKD KNPNLSIEEN AHMVEPGIDF APDDFKQGVY
AEVKQVLDAI YASHGDRRWK SMLMINDRIA DSIFQQIVTR PEEYSVLATP NLNGDYISDA
CAAQVGGLGI APGANIGREH AIFEATHGTA PKYADKDVIN PGSVILSGVM MFDFLGWEEA
ARLIEDSMEQ TIQKKTVTYD FERLMEGAKK VTTSGFANLI IGKMEHHGAT DGNHHQHAAH
E