Gene Acid345_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1856 
Symbol 
ID4069198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2232682 
End bp2233680 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content61% 
IMG OID637983865 
Productpyridoxal biosynthesis lyase PdxS 
Protein accessionYP_590931 
Protein GI94968883 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0214] Pyridoxine biosynthesis enzyme 
TIGRFAM ID[TIGR00343] pyridoxal 5'-phosphate synthase, synthase subunit Pdx1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAA ACAACGGACA CTACGCACTG CGCCTGAAGA CCGGCCTAGC CGAAATGCTC 
AAGGGCGGCG TAATCATGGA TGTAATGAAC GCCGCGCAGG CCGAAATCGC GCAGGAATCC
GGCGCCACGG CCGTCATGGC TCTGGAGAAG GTACCCGCCA TGCTTCGCGC CGAAGGCGGC
GTCGCCCGCA TGGCCAGTCC CAAGGTGATC AAAGAGATCA TGCGCACCGT GGACATCCCG
GTCATGGCCA AGTGCCGCAT CGGCCACTTC GCCGAGGCGC AGATACTCCA GGAACTCGGC
GTCGATTACA TCGACGAGTC CGAAGTCCTC ACCCCCGCCG ACGAAGAACA CCATGTCGAC
AAGCACGCCT TCAAGACGCC GTTTGTCTGC GGCGCCCGCA ACCTCGGTGA AGCTCTCCGC
CGCATCGCCG AAGGCGCAGC CATGATCCGC ACCAAGGGCG AAGCTGGCAC CGGCGACGTC
ATCCATGCCG TCAAGCACAT GCGTCAAATC TTGAAAGAAA TGAAGCAGTT GACGGTGATG
CCTGAAGAAG AACTTTATGC GGCAGCCAAG AACCACCAAG CACCCTACGA GCTCATTCGC
ACGGTCGCCA AAGCTGGCAA GCTCCCCGTG CCGAACTTCT CCGCTGGCGG CATCGCCACT
CCGGCAGACG CCGCTCTCAT GATGCAGCTG GGCGCCGAGG CCGTCTTCGT AGGCTCCGGC
ATCTTCATGA AGGAGCGAGC TACCCCGCTC GAGGTAGGAA AGTGGAGTGA ACAGGACGCC
AAGCAAGGTC GCTGCTCGAG CAATCAGGTA GGTACCCCAC GAAATCCTGA AGAGCGCGAA
GAAGCCATGA CTCGCGCCAA GGCGATCGTC GAGGCCACTG CACACTTCAA CGACCCCAAG
GTCTTGGCCG AAGTTAGCGA GAAGGTCACG GGAACGATGA AGGGCCTCGC CGTCTCGGCT
ATCGAAGAGA AGGAACTGCT ACAGACCCGT GGCTGGTAA
 
Protein sequence
MAENNGHYAL RLKTGLAEML KGGVIMDVMN AAQAEIAQES GATAVMALEK VPAMLRAEGG 
VARMASPKVI KEIMRTVDIP VMAKCRIGHF AEAQILQELG VDYIDESEVL TPADEEHHVD
KHAFKTPFVC GARNLGEALR RIAEGAAMIR TKGEAGTGDV IHAVKHMRQI LKEMKQLTVM
PEEELYAAAK NHQAPYELIR TVAKAGKLPV PNFSAGGIAT PADAALMMQL GAEAVFVGSG
IFMKERATPL EVGKWSEQDA KQGRCSSNQV GTPRNPEERE EAMTRAKAIV EATAHFNDPK
VLAEVSEKVT GTMKGLAVSA IEEKELLQTR GW