Gene Acid345_2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2187 
Symbol 
ID4071439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2609862 
End bp2610971 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content59% 
IMG OID637984203 
ProductO-succinylbenzoate synthase 
Protein accessionYP_591262 
Protein GI94969214 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID[TIGR01928] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG AAGCAATTAC ACTGCGCGAA ATTGAAATGC CGCTGGTCAA CTTTTTCGAG 
ACCAGCTTTG GGCGTATCTA CAGCCGCCGC ATGCTACTGG TGACGATGCA TTGCGATGGC
GTGGACGGCT GGGGCGAGTG CGTAGCTGAC GAGGCGCCGT TTTATAGCCC GGAGTCGGTG
GATACGGCGT GGCTGATTAT TCGCATGTAC CTGGCGCCGA TGCTGCTCGG CAAAGAAGTG
GAGCGAGGCG CCGACGTTCA GCCGCTGCTG GCGCGCGTGC GCGGCCATCG CATGGCCAAG
GGCGTCCTTG AGAACGCAAT GTGGGACGCC GAAGCGAAGG CGAAGAATCT GCCGATATGG
AAGCTGCTCG GCGGTTCCCG TGAAAAGATT CCGTGTGGCG TGTCGATCGG CATCCAGGAC
TCGCACGAGC AGTTGCTCGA CAAGATTGAG ACCGAACTCG CCGCTGGCTA TCAGCGCATC
AAGGTGAAGG TGAAGCCGGG ATGGGACGTC GAAGTGCTCG AGAAAATTCG CAAGCGCTGG
CCCGACATCC TGCTGAGCTG CGATGCGAAT TCGGCATACA CGCTGAGCGA CTTCGAGCAC
CTCAAAGAGT TCGAGCAATT CAAGCTGCTG ATGATCGAGC AGCCCCTCTG GAACGACGAC
TTCTACTTCC ATGCCGCGCT ACAGAAGCAA CTGAAGACAG CCCTCTGTCT CGATGAGGCA
ATCGAAAGCT GGCGCGATGC GCAGGCCGCG CTGGAACTGG GCGCCTGCAG GATCGTCAAC
ATCAAGGTCG GGCGCGTTGG CGGATTCAGC GAAGCCATCG CAGTCCACGA CATCGCGCAA
CGCTTCGGCG TGCCGGTGTG GTGCGGCGGC ATGCTCGAGT GCGGCCTCGG ACGGTCGCAT
AACATCGCGC TCTCGACGCT GCCAAATTTC TCGCTACCTG GCGATGTGTC GGCGTCCAAA
CGATATTGGA AGGAAGACGT AATCGAGCCC GAAGTCACGG TTTCGCCGGA TGGCTTTATT
CCGATTCGCG ACGTGCCGGG TACGGGATAT ACGTTGCGCG AAGATCAGAT TGAGAGAATC
ACCACGAAGA AAGAGACGGT GAGGGCATAG
 
Protein sequence
MKIEAITLRE IEMPLVNFFE TSFGRIYSRR MLLVTMHCDG VDGWGECVAD EAPFYSPESV 
DTAWLIIRMY LAPMLLGKEV ERGADVQPLL ARVRGHRMAK GVLENAMWDA EAKAKNLPIW
KLLGGSREKI PCGVSIGIQD SHEQLLDKIE TELAAGYQRI KVKVKPGWDV EVLEKIRKRW
PDILLSCDAN SAYTLSDFEH LKEFEQFKLL MIEQPLWNDD FYFHAALQKQ LKTALCLDEA
IESWRDAQAA LELGACRIVN IKVGRVGGFS EAIAVHDIAQ RFGVPVWCGG MLECGLGRSH
NIALSTLPNF SLPGDVSASK RYWKEDVIEP EVTVSPDGFI PIRDVPGTGY TLREDQIERI
TTKKETVRA