Gene Acid345_4706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4706 
Symbol 
ID4070756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5565131 
End bp5566447 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content61% 
IMG OID637986751 
Productisocitrate lyase 
Protein accessionYP_593780 
Protein GI94971732 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.942851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGG AGAACCCGAC TGCCCAGCTT CGCCTTGACT GGGAGACCAA CCCACGCTGG 
AAGGGCATCA CCCGGCCTTA CACCGCAGAA GACGTTGCGC GATTGCGCGG CACCATTCGC
ATTGACCACA CGCTGGCGCG CCTCGGCGCA GAGCGGCTGT GGAAGCTGCT TCAAGAAGAC
AAATACACCC AGGCGCTCGG TGCATTGACC GGCAACCAGG CGGTGCAGAT GGTCAAAGCC
GGATTGAAAG CGATCTACGT CTCGGGCTGG CAAGTGGCGG GCGACGCCAA TGACGCGGGC
CAAATGTATC CCGACCAAAG CTTGTATCCG GCAGATAGCG TTCCGAACCT TGTGCGCCGA
TTGAACAACG CACTGTTGCG CGCCGATCAG ATCCATCACT CCGAGGGGAA GAATGGAATG
TATTTCCTCG CGCCGATGAT CGCCGATGCC GAAGCCGGCT TCGGCGGCAA CCTCAACGCC
TTCGAGTTGA TGAAAGCGAT GATCGAGTCG GGAGCAGCCT GCGTTCACTT CGAGGACCAA
CTGTCTTCGG CGAAGAAGTG CGGGCACCTC GGCGGCAAAG TTCTGGTACC GACCGGCGAA
GCGATCCAGA AGCTGGTCGC AGCTCGCTTG GCAGCCGACA TCTGCGGCGT GCCGACGCTG
ATCCTGGCGC GTACCGATGC GAACAGCGCC CATCTGCTCA CCAGCGACAT CGATCCCTAC
GATCGCGAGT TCTGCACCGG CGAGCGCACC AGCGAAGGCT TCTTCTGCAT TCGCGGCGGG
TTGGATTCCG CCATCGCGCG CGGCTTGGCC TACGCACCGT ATGCGGACCT GATCTGGTGC
GAAACTTCGG AACCCAACAT CGGCGAGGCG CAGCGCTTCG CGGAGGCGAT CCATGCGAAG
TTTCCGGGAA AGATGCTTGC TTACAACTGC TCTCCGTCGT TCAACTGGAA GAAGAAATTG
TCGGAGGAAG ACATCGCGCG CTTCCAGCCG GCGCTCGCGG AAATGGGTTA CAAGTTCCAG
TTCATTACCC TGGCGGGGTT CCATGCGCTG AACCTCAGCA TGTACGAGTT GGCCACGGGT
TATCGTCAGA GCGGCATGAC GGCCTATTCC GCCCTGCAGC AGCGGGAATT CCAACTGGAG
CCCGAGGGTT ACGAAGCGGC GAAACACCAG CGCTTCGTAG GCACCGGCTA CTTCGACCAG
GTGCAGAACG TGGTCACCAG CGGCAAGGCC TCGACCCGGG CCCTCGAACA CTCCACGGAA
GCCGAACAGT TCCACGCTAG CGAAACGCCC GAAAAGAGCG GCGTGGCGGC CGACTAG
 
Protein sequence
MATENPTAQL RLDWETNPRW KGITRPYTAE DVARLRGTIR IDHTLARLGA ERLWKLLQED 
KYTQALGALT GNQAVQMVKA GLKAIYVSGW QVAGDANDAG QMYPDQSLYP ADSVPNLVRR
LNNALLRADQ IHHSEGKNGM YFLAPMIADA EAGFGGNLNA FELMKAMIES GAACVHFEDQ
LSSAKKCGHL GGKVLVPTGE AIQKLVAARL AADICGVPTL ILARTDANSA HLLTSDIDPY
DREFCTGERT SEGFFCIRGG LDSAIARGLA YAPYADLIWC ETSEPNIGEA QRFAEAIHAK
FPGKMLAYNC SPSFNWKKKL SEEDIARFQP ALAEMGYKFQ FITLAGFHAL NLSMYELATG
YRQSGMTAYS ALQQREFQLE PEGYEAAKHQ RFVGTGYFDQ VQNVVTSGKA STRALEHSTE
AEQFHASETP EKSGVAAD