Gene Acid345_2964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2964 
Symbol 
ID4068865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3508813 
End bp3510024 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content57% 
IMG OID637984983 
Product3-isopropylmalate dehydrogenase 
Protein accessionYP_592039 
Protein GI94969991 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism 
COG ID[COG0473] Isocitrate/isopropylmalate dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.622363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.574229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGTC CAACTGTAGT GACAATGCCC GGCGATGGGA TCGGGAACCA GGTTTTGCCG 
CAGGCGATTC GCGTCCTTGA AGCGGTGGGC TTCGAGGCTA ACTACGTGCA TGCCGATATC
GGCTGGGAGT GCTGGTGCAA CGAAGGCAAT GCACTGCCGG ACCGTACCAT TCAACTGTTG
CGTAAGCACA AGCTCGGTCT GTTCGGCGCG ATCACGAGTA AGCCGAAGAA GGCTGCCGAT
GCCGAATTGA AGCCCGAACT TCGCGGCAAA GGCCTCTCGT ACTTCAGTCC GATCGTGACC
ATGCGGCAGT TGTTCAATCT CGACGTGTGC ATGCGGCCTT GCTTGTCGTT TCCGGGAAAT
CCGCTGAATT TCATCCGTCA AACAACGTGC GGTGGATTTG AAGAGCCGCA GGTGGATGTC
GTCGTTTTCC GGCAAAACAC CGAAGGATTG TACGCGGGCG TGGAGTGGAC GAATCCACCG
GAGAACGTGC GTACTGCGCT GGCATCGCAT AAGAAGTTCG CGGCCTTCGC GAATACACCG
GGTGAAGAAC TGGCGGTGTC GGTGCGCATT ATCACTAAGA AGAATGCGCA ACGGATTTGC
GAGGCGGCAT TCAAGCACGC GAAGAAATAC CGCTACAAGA ACGTGACCAT CTGTGAGAAG
CCGAACGTGC TGCGCGAGAC GAGCGGCATG ATGGAAGAAG TGGCGAAGCA GGTACAGAAA
CAGTATCCGG AGATCGCATT GTGGTCCACG AACATTGACG CGCAAACAAT GTGGCTGACG
AAGAACCCTG AAGAGTATGG GGTGATCGTG GCCAGCAACC TGTTTGGCGA TGTGATTTCC
GACGCGTTCG CGGGACTCGT GGGTGGGTTG GGATTTGCGG CGAGCGGCAA TATCGGCGAT
GAAGTCGCGG TGTTTGAGCC GACGCATGGA TCGGCGCCGA AGTATGCCGA GTTAAATCCG
TCGATCGTAA ATCCGATCGC GATGATCCTG TCGGCAGCGA TGATGCTCGA CCACATCGGC
GAGAGCGAGA AAGCAGATCG GATCCGGAAG GCGATTGCTG ACGTGGTGAA AGAAGGCAAG
GTTCGGACCT ACGACATGAT GCGGATTGCG GGCGGGCCGA AGTCGCTGGC GCAGGGCGCG
GCGAATACGG TCCAAATAAC GGATGCGATT TTGGTGCATG TGGAGACTGC GATCGCAGCT
TTCGCGCGCT AG
 
Protein sequence
MARPTVVTMP GDGIGNQVLP QAIRVLEAVG FEANYVHADI GWECWCNEGN ALPDRTIQLL 
RKHKLGLFGA ITSKPKKAAD AELKPELRGK GLSYFSPIVT MRQLFNLDVC MRPCLSFPGN
PLNFIRQTTC GGFEEPQVDV VVFRQNTEGL YAGVEWTNPP ENVRTALASH KKFAAFANTP
GEELAVSVRI ITKKNAQRIC EAAFKHAKKY RYKNVTICEK PNVLRETSGM MEEVAKQVQK
QYPEIALWST NIDAQTMWLT KNPEEYGVIV ASNLFGDVIS DAFAGLVGGL GFAASGNIGD
EVAVFEPTHG SAPKYAELNP SIVNPIAMIL SAAMMLDHIG ESEKADRIRK AIADVVKEGK
VRTYDMMRIA GGPKSLAQGA ANTVQITDAI LVHVETAIAA FAR