Gene Acid345_3109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3109 
Symbol 
ID4070223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3693752 
End bp3695473 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content63% 
IMG OID637985128 
Productdihydroxy-acid dehydratase 
Protein accessionYP_592184 
Protein GI94970136 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAA AATCCCCAAA GCCACACAAG CGTAGCGACG CCATTACCGA AGGTCCCAAC 
CGCGCCCCAG CCCGCGCCAT GCTGCGCGCC GCCGGGTTCA CCCCGGAAGA CCTTCGTAAG
CCCATCATCG GCATCGCCAA CACCTGGATC GAAATCGGTC CCTGCAACCT GCACCTTCGT
GAGCTCGCCG AGCACATCAA GCAGGGCGTT CGCGAAGCCG GCGGTACGCC AATGGAGTTC
AACACCGTCT CCATTTCTGA CGGCATCACC ATGGGCTCGG AAGGCATGAA GGCGTCGCTG
GTCAGCCGCG AGGTCATCGC CGACTCCATC GAACTCGTCG CGCGCGGCAA TCTTTTCGAC
GGCCTGATCG CCCTATCGGG ATGCGATAAA ACCATCCCCG GGACCATCAT GGCGCTCGAG
CGCCTCGACA TCCCGGGTCT CATGCTCTAT GGCGGCTCCA TCGCGCCCGG CAAATTCCAT
GCTCAGAAGG TCACGATTCA AGACGTCTTC GAGGCAGTCG GTACGCATGC GCGCGGCAAA
ATGAGCGACG CTGATCTCGA AGAACTCGAA CACAACGCCT GTCCCGGCGC CGGCGCCTGC
GGTGGACAAT TCACCGCTAA CACCATGTCC ATGTGCGGCG AATTCCTCGG CATCTCGCCG
ATGGGCGCTA ACAGCGTCCC GGCGATGACC GTAGAGAAGC AGCAGGTCGC ACGACGCTGC
GGACACCTCG TCATGGAACT GGTCCGCCGC GACATCCGCC CCAGCCAAAT CATCACGCGC
AAGGCGATCG AAAACGCAAT CGCCAGCGTC GCCGCCTCCG GCGGGTCGAC CAACGCCGTT
CTTCACTTGC TCGCCATCGC GCACGAAATG GACGTCGAAC TGAACATCGA AGACTTCGAC
AAGATCAGTT CGCGCACGCC ACTGCTCTGC GAACTCAAGC CCGCTGGCCG CTTCACCGCC
ACCGATCTTC ATGATGCCGG CGGTATTCCG CTCGTCGCGC AACGCCTGCT CGAAGCGAAC
CTGCTGCACG CCGACGCACT GACTGTCACC GGTAAGACCA TCGCCGAAGA AGCGAAGCAG
GCGAAAGAAA CGCCGGGCCA GGAAGTGGTT CGTCCGCTTA CCGATCCCAT CAAAGCTACC
GGCGGCCTCA TGATCCTGAA AGGCAACCTC GCATCCGAAG GCTGCGTGGT CAAACTCGTC
GGACACAAGA AGCTCTTCTT TGAAGGCCCC GCTCGCGTCT TCGAGTCGGA AGAAGAAGCC
TTTGCGGGCG TGGAAGACCG CACCATCCAG GCGGGCGAAG TGGTCGTGGT CCGATATGAA
GGCCCGAAGG GCGGCCCTGG CATGCGCGAA ATGCTTGGCG TGACGGCGGC CATCGCCGGC
ACCGAACTCG CCGAGACCGT CGCGCTCATC ACCGACGGAC GTTTCTCCGG CGCTACCCGC
GGCTTGAGCG TGGGCCACGT TGCGCCCGAA GCCGCGAATG GCGGCGCGAT CGCTGTGGTA
CGCAATGGCG ACATCATCAC TCTCGACGTG GAACGCCGCG AACTGCGCGT CCACCTCACC
GACGCAGAAC TCGAAGCGCG CCTCCGCAAC TGGCGCGCGC CGGAGCCACG ATACAAGCGC
GGCGTCTTCG CCAAGTATGC GAGCACCGTT TCGTCGGCGT CGTTCGGGGC CGTTACCGGC
TCCACCATCG AAAACAAGAC CTTAGCCGGG AGTACGAAGT AG
 
Protein sequence
MTEKSPKPHK RSDAITEGPN RAPARAMLRA AGFTPEDLRK PIIGIANTWI EIGPCNLHLR 
ELAEHIKQGV REAGGTPMEF NTVSISDGIT MGSEGMKASL VSREVIADSI ELVARGNLFD
GLIALSGCDK TIPGTIMALE RLDIPGLMLY GGSIAPGKFH AQKVTIQDVF EAVGTHARGK
MSDADLEELE HNACPGAGAC GGQFTANTMS MCGEFLGISP MGANSVPAMT VEKQQVARRC
GHLVMELVRR DIRPSQIITR KAIENAIASV AASGGSTNAV LHLLAIAHEM DVELNIEDFD
KISSRTPLLC ELKPAGRFTA TDLHDAGGIP LVAQRLLEAN LLHADALTVT GKTIAEEAKQ
AKETPGQEVV RPLTDPIKAT GGLMILKGNL ASEGCVVKLV GHKKLFFEGP ARVFESEEEA
FAGVEDRTIQ AGEVVVVRYE GPKGGPGMRE MLGVTAAIAG TELAETVALI TDGRFSGATR
GLSVGHVAPE AANGGAIAVV RNGDIITLDV ERRELRVHLT DAELEARLRN WRAPEPRYKR
GVFAKYASTV SSASFGAVTG STIENKTLAG STK